Published: 2025-12-31

A Frequency-Based Algorithm for Argument Extraction from Russian Treebanks

Sergey Say Logo ORCID , Ilja Seržant Logo ORCID

Abstract

Arguments, unlike adjuncts, are typically understood as verb-specific dependents, which includes the fact that the morphosyntactic devices used for argument encoding are determined by individual verbs. Building on this observation, we operationalize arguments as dependents whose encoding device occurs with a given verb at a significantly higher-than-average frequency. We apply an argument extraction algorithm to a dataset of 132,221 verb dependents from Russian treebanks available in the Universal Dependencies (UD) platform. To evaluate the algorithm ’ s performance, we compare its results to a manually annotated subset, informed by The Active Dictionary and a detailed semantic understanding of argumenthood. The frequency-based algorithm achieves acceptable precision (approx. 0.83), with particularly few false positives, making it a promising tool for cross-
linguistic applications in typologically diverse languages with UD treebanks. Theoretically, we argue that a quantitative distributional approach to valency—originally proposed in Ju. D. Apresjan ’ s early pioneering work—broadly aligns with the in-depth semantic analyses of individual verbs and their meanings found in his later works, including The Active Dictionary.

Download files

Citation rules

Say, S., & Seržant , I. (2025). A Frequency-Based Algorithm for Argument Extraction from Russian Treebanks. Neophilologica, 1–32. https://doi.org/10.31261/NEO.2025.37.07

Cited by / Share

Licence

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


The Copyright Owners of the submitted texts grant the Reader the right to use the pdf documents under the provisions of the Creative Commons 4.0 International License: Attribution-Share-Alike (CC BY SA). The user can copy and redistribute the material in any medium or format and remix, transform, and build upon the material for any purpose.

1. License

The University of Silesia Press provides immediate open access to journal’s content under the Creative Commons BY-SA 4.0 license (http://creativecommons.org/licenses/by-sa/4.0/). Authors who publish with this journal retain all copyrights and agree to the terms of the above-mentioned CC BY-SA 4.0 license.

2. Author’s Warranties

The author warrants that the article is original, written by stated author/s, has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author/s.

If the article contains illustrative material (drawings, photos, graphs, maps), the author declares that the said works are of his authorship, they do not infringe the rights of the third party (including personal rights, i.a. the authorization to reproduce physical likeness) and the author holds exclusive proprietary copyrights. The author publishes the above works as part of the article under the licence "Creative Commons Attribution-ShareAlike 4.0 International".

ATTENTION! When the legal situation of the illustrative material has not been determined and the necessary consent has not been granted by the proprietary copyrights holders, the submitted material will not be accepted for editorial process. At the same time the author takes full responsibility for providing false data (this also regards covering the costs incurred by the University of Silesia Press and financial claims of the third party).

3. User Rights

Under the CC BY-SA 4.0 license, the users are free to share (copy, distribute and transmit the contribution) and adapt (remix, transform, and build upon the material) the article for any purpose, provided they attribute the contribution in the manner specified by the author or licensor.

4. Co-Authorship

If the article was prepared jointly with other authors, the signatory of this form warrants that he/she has been authorized by all co-authors to sign this agreement on their behalf, and agrees to inform his/her co-authors of the terms of this agreement.

As the author of the proposed text, I hereby declare that in the event of withdrawal of the text from the publishing process or submitting it to another publisher without agreement from the editorial office, I agree to cover all costs incurred by the University of Silesia in connection with my application.

Domyślna okładka

2025
Published: 2021-10-12


ISSN: 0208-5550
eISSN: 2353-088X
Ikona DOI 10.31261/NEO

Publisher
Wydawnictwo Uniwersytetu Śląskiego | University of Silesia Press

Licence CC Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

This website uses cookies for proper operation, in order to use the portal fully you must accept cookies.