Suchergebnisse
Suchergebnisse:
Even though primarily created for language learning, the PAISÀ corpus also provides a rich resource for research. This web site will serve a learner-oriented interface for online access to the corpus. The interface will offer different modes for accessing the corpus, ranging from precompiled searches to fully flexible search options for ...
- welcome to PAISÀ
Welcome to PAISÀ. On these pages we present the corpus...
- general info & download
The Paisà corpus is a large collection of Italian web texts,...
- construction steps
The PAISÀ documents were selected in two different ways. A...
- description
The overall objective of the project PAISÀ (Piattaforma per...
- partnership
Partnership of Paisà. The project is a joint effort of:...
- help pages / manuals
Getting started with browsing the PAISÀ corpus. This...
- Italiano
Queste pagine web sono dedicate al corpus PAISÀ, un’ampia...
- welcome to PAISÀ
Pur essendo stato nato principalmente per l’apprendimento, il corpus PAISÀ rappresenta anche una preziosa risorsa per diverse attività di ricerca linguistica. Il sito intende offrire un’interfaccia per gli apprendenti attraverso cui accedere al corpus online.
6. Jan. 2009 · Raw and annotated versions of the corpus are freely made available for download. In addition, direct access to the data will be provided via a multifaceted query interface for learners and users of Italian, thus fostering free online access to concrete contexts of use of contemporary Italian.
11. Dez. 2018 · PAISÀ: Corpus italiano. Im Projekt PAISÀ (Piattaforma per l’Apprendimento dell’Italiano Su corpora Annotati) wurden Texte aus dem Internet zusammengestellt und linguistisch dokumentiert, um authentische Texte für den Sprachunterricht zur Verfügung zu stellen.
- Italienisch
- ca. 250 Mio. Wörter
- schriftlich (Internet)
- Standard
PDF | PAISA' is a Creative Commons licensed, large web corpus of contemporary Italian. We describe the design, harvesting, and processing steps involved... | Find, read and cite all the...
A large (250 million tokens) corpus of authentic Italian contemporary texts from the web, freely available and freely distributable, fully annotated in CoNNL format, and openly accessible and searchable through an advanced, learner-oriented interface (ILC-CNR carried out the linguistic annotation of texts).