https://doi.org/10.31261/TAPSLA.16594
Extensive vocabulary acquisition is a cornerstone of second language (L2) proficiency, directly influencing both receptive and productive language skills. However, research on the productive vocabulary size of L2 learners transitioning to higher education, particularly their mastery of high-frequency words, remains limited. This study investigated the productive Hebrew vocabulary size and frequency distribution of Arabic-speaking learners entering higher education programs where Hebrew is the primary language of instruction. The research employed a corpus-driven approach, analyzing 156 Hebrew-language argumentative essays (18,054 orthographic words) written by native Arabic-speaking students during a college entrance examination. Automated tools were used to add contextual vocalization and disambiguate homographs, followed by manual annotation mapping each word to its corresponding lemma. The identified lemmas were then compared to established written and spoken Hebrew frequency lists. This process aimed to chart the vocabulary profile of the research population. The study determined that learners had a productive vocabulary size of approximately 1,000 lemmas, despite completing over 1,000 hours of formal L2 instruction. A comparison with established written and spoken Hebrew frequency lists indicated that 50% of the identified lemmas fell within the 1,000 most frequent Hebrew lemmas. Additionally, the learners exhibited a typical vocabulary profile, employing more lemmas from the 1k frequency band (the 1,000 most frequent words) than from the 2k frequency band (words ranked 1,001–2,000). Similarly, their use of lemmas from the 2k band exceeded that of the 3k band (words ranked 2,001–3,000), which in turn surpassed their use of lemmas from the 4k band (words ranked 3,001–4,000). These findings highlight the learners’ significant reliance on high-frequency vocabulary in L2 writing, emphasizing the need for targeted academic vocabulary instruction as they transition to higher education.
Download files
Citation rules
Licence

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
The Copyright Holders of the submitted texts are the Authors. The Reader is granted the rights to use the material available in the TAPSLA websites and pdf documents under the provisions of the Creative Commons 4.0 International License: Attribution - Share Alike (CC BY-SA 4.0). The user is free to copy and redistribute the material in any medium or format, and to remix, transform, and build upon the material for any purpose, even commercially.
1. License
The University of Silesia Press provides immediate open access to journal’s content under the Creative Commons BY-SA 4.0 license (http://creativecommons.org/licenses/by-sa/4.0/). Authors who publish with this journal retain all copyrights and agree to the terms of the above-mentioned CC BY-SA 4.0 license.
2. Author’s Warranties
The author warrants that the article is original, written by stated author/s, has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author/s.
If the article contains illustrative material (drawings, photos, graphs, maps), the author declares that the said works are of his authorship, they do not infringe the rights of the third party (including personal rights, i.a. the authorization to reproduce physical likeness) and the author holds exclusive proprietary copyrights. The author publishes the above works as part of the article under the licence "Creative Commons Attribution-ShareAlike 4.0 International".
ATTENTION! When the legal situation of the illustrative material has not been determined and the necessary consent has not been granted by the proprietary copyrights holders, the submitted material will not be accepted for editorial process. At the same time the author takes full responsibility for providing false data (this also regards covering the costs incurred by the University of Silesia Press and financial claims of the third party).
3. User Rights
Under the CC BY-SA 4.0 license, the users are free to share (copy, distribute and transmit the contribution) and adapt (remix, transform, and build upon the material) the article for any purpose, provided they attribute the contribution in the manner specified by the author or licensor.
4. Co-Authorship
If the article was prepared jointly with other authors, the signatory of this form warrants that he/she has been authorized by all co-authors to sign this agreement on their behalf, and agrees to inform his/her co-authors of the terms of this agreement.
I hereby declare that in the event of withdrawal of the text from the publishing process or submitting it to another publisher without agreement from the editorial office, I agree to cover all costs incurred by the University of Silesia in connection with my application.
Vol. 11 No. 1 (2025)
Published: 2025-06-26
10.31261/tapsla

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.