TechRxiv
DiFelippoEtAl2022_RevistadaAbralin.pdf (262.66 kB)
Download file

THE DANTESTOCKS CORPUS: AN ANALYSIS OF THE DISTRIBUTION OF UNIVERSAL DEPENDENCIES-BASED PART OF SPEECH TAGS

Download (262.66 kB)
preprint
posted on 2022-11-28, 21:24 authored by Ariani Di FelippoAriani Di Felippo, Norton Trevisan Roman, Thiago A. S. Pardo, Lucas Panta de Moura

In the paper, we characterise DANTEStocks - a corpus of stock market tweets annotated with morphosyntactic information - in terms of the distribution of the PoS tags present in it. This effort provides a benchmark against which to compare other corpora and may support the investigation of the syntactic relations called dependencies, since some of them usually co-occur with specific PoS tags.

Funding

FAPESP 2019/07665-4

History

Email Address of Submitting Author

ariani@ufscar.br

ORCID of Submitting Author

0000-0002-4566-9352

Submitting Author's Institution

Federal University of São Carlos

Submitting Author's Country

  • Brazil

Usage metrics

    Licence

    Exports