A Corpus-Based Study Based on a Structural and Functional Analysis of Lexical Bundles in Online Business News

Piriya Thaksanan

Abstract


While numerous studies have explored formulaic language in English for Specific Purposes (ESP) across genres and registers, the linguistic demands of rapid digitalization, especially in online business news, remain under-investigated. Unlike previous studies, this study analyses lexical bundles in business news articles published in 2025, capturing shifts in register driven by trade policy and tariffs. Specifically, the study’s objectives were to investigate the frequency of lexical bundles used in online business news from January to May 2025, and to examine the extent to which a functional taxonomy can characterize these bundles. A total of 450 business news articles from three news agencies, i.e., BBC, CNN, and Reuters, comprising 321,225 running-word tokens, were compiled and analyzed using the AntConc software. The analysis identified 67 four-word lexical bundles meeting the frequency and range criteria. Structurally, noun phrase-based bundles, e.g., the world’s largest and a global trade war, were the most frequent, followed by prepositional phrase-based, verb phrase-based, and clause-based bundles. Functionally, referential bundles, e.g., in the first quarter and at the same time, were predominant, particularly those indicating identification/focus and time/place/text deixis. Stance bundles and special conversational bundles, especially reporting expressions, e.g., said in a statement, were also notable. The outcomes of this study provide meaningful resources for instructors to improve students’ learning of lexical bundles, specifically in online business news. Furthermore, the findings suggest that ESP instructors should prioritize noun-phrase structures to help students master the linguistic style of modern digital reporting.

 

Keywords: Business English; corpus linguistics; formulaic language; lexical bundles; online business news

 

ABSTRAK

 

Meskipun telah banyak kajian yang meneroka bahasa berformula dalam Bahasa Inggeris Untuk Tujuan Khusus (ESP) merentasi pelbagai genre dan laras bahasa telah dilakukan, namun tuntutan linguistik akibat digitalisasi yang makin berkembang, terutamanya kajian melibatkan berita perniagaan dalam talian masih kurang diteliti. Berbeza dengan kajian terdahulu, kajian ini menganalisis berkas leksikal perniagaan dalam artikel berita perniagaan yang diterbitkan pada tahun 2025 untuk mengenal pasti perubahan laras bahasa yang didorong oleh dasar perdagangan dan tarif. Objektif kajian ini ialah untuk menyiasat kekerapan berkas leksikal yang digunakan dalam berita perniagaan dalam talian dari Januari hingga Mei 2025, serta meneliti sejauh mana taksonomi fungsi dapat mencirikan berkas tersebut dalam laporan berita perniagaan. Sejumlah 450 artikel berita perniagaan daripada BBC, CNN, dan Reuters telah dikumpul dan dianalisis menggunakan perisian AntConc yang merangkumi 321,225 token perkataan. Hasil analisis telah dapat mengenal pasti 67 berkas leksikal empat perkataan yang memenuhi kriteria kekerapan dan julat yang telah ditetapkan. Dari segi struktur, berkas berasaskan frasa nama (contohnya, 'the world’s largest', 'a global trade war') merupakan yang paling kerap digunakan, diikuti oleh berkas berasaskan frasa sendi nama, frasa kerja, dan klausa. Dari segi fungsi pula, berkas rujukan (contohnya, 'in the first quarter', 'at the same time') didapati lebih dominan terutamanya yang menunjukkan identifikasi/fokus serta deiksis masa/tempat/teks. Berkas pendirian dan berkas perbualan khas, terutamanya klausa pelaporan (contohnya, 'said in a statement') juga ditemui dalam data kajian. Dapatan kajian ini dilihat dapat menyediakan sumber yang bermakna bagi para pendidik untuk meningkatkan pembelajaran pelajar mengenai berkas leksikal, khususnya dalam berita perniagaan dalam talian. Selain itu, hasil dapatan ini juga mencadangkan agar para pendidik ESP memberi keutamaan kepada struktur frasa nama bagi membantu pelajar menguasai gaya linguistik pelaporan digital moden.

 

Kata Kunci: Bahasa Inggeris Perniagaan; linguistik korpus; bahasa berformula; berkas leksikal; berita perniagaan dalam talian


Full Text:

PDF

References


Alasmary, A. (2024). Sustaining vocabulary knowledge growth through corpus-generated lists of lexical bundles and keywords in the law of contracts. Heliyon, 10(9). https://doi.org/10.1016/j.heliyon.2024.e29944

Anthony, L. (2024). AntConc (Version 4.3.1) [Computer Software]. Tokyo, Japan: Waseda University. https://www.laurenceanthony.net/software/AntConc

Basturkmen, H. (2022). Current trends in ESP research in the Asia Pacific region. World Englishes, 41(4), 512-522. https://doi.org/10.1111/weng.12601

Biber, D., & Barbieri, F. (2007). Lexical bundles in university spoken and written registers. English for Specific Purposes, 26(3), 263–286. https://doi.org/10.1016/j.esp.2006.08.003

Biber, D., Conrad, S., & Cortes, V. (2004). If you look at…: Lexical bundles in university teaching and textbooks. Applied linguistics, 25(3), 371-405. https://doi.org/10.1093/applin/25.3.371

Chen, Y.-H., & Baker, P. (2010). Lexical bundles in L1 and L2 academic writing. Language Learning & Technology, 14(2), 30–49.

Chen, Y.-H., Baker, P. (2016). Investigating criterial discourse features across second language development: lexical bundles in rated learner essays, CEFR B1, B2 and C1. Applied Linguistics, 37(6), 849–880. https://doi.org/10.1093/applin/amu065

Cortes, V. (2004). Lexical bundles in published and student disciplinary writing: Examples from history and biology. English for Specific Purposes, 23(4), 397-423. https://doi.org/10.1016/j.esp.2003.12.001

Cortes, V. (2006). Teaching lexical bundles in the disciplines: An example from a writing intensive history class. Linguistics and Education, 17(4), 391-406. https://doi.org/10.1016/j.linged.2007.02.001

Crosthwaite, P., Ningrum, S., & Schweinberger, M. (2023). Research trends in corpus linguistics A bibliometric analysis of two decades of Scopus-indexed corpus linguistics research in arts and humanities. International Journal of Corpus Linguistics, 28(3), 344–377. https://doi.org/10.1075/ijcl.21072.cro

Dahunsi, T. N., & Ewata, T. O. (2022). An exploration of the structural and colligational characteristics of lexical bundles in L1–L2 corpora for English language teaching. Language Teaching Research, 29(2), 472-488. https://doi.org/10.1177/13621688211066572

De Cock, S. & Granger, S. (2021). Stance in press releases versus business news: a lexical bundle approach. Text & Talk, 41(5-6), 691-713. https://doi.org/10.1515/text-2020-0040

Durrant, P., & Brenchley, M. (2023). Development of noun phrase complexity across genres in children’s writing. Applied Linguistics, 44(2), 239-264. https://doi.org/10.1093/applin/amac032

Fernebring, F. (2014). Exploring a recent grammatical change : A corpus-based investigation of the core modals will and shall and the semi-modal BE going to in newspapers and blogs written by Swedes [Bachelor’s dissertation, Linnaeus University, Sweden). Diva Portal. https://urn.kb.se/resolve?

urn=urn:nbn:se:lnu:diva-31860

Gong, H., Le, T. N. P., & Buckingham, L. (2025). Lexical bundles across IMRD-structured Medicine research article sections: A within-register perspective. Journal of English for Academic Purposes, 74, 101487. https://doi.org/10.1016/j.jeap.2025.101487

Ha, H. T. (2022). Lexical profile of newspapers revisited: A corpus-based analysis. Frontiers in Psychology, 13, 800983. https://doi.org/10.3389/fpsyg.2022.800983

Hooi, C. M., Tan, H., Lee, G. I., & Danarajan, S. S. V. (2020). Texts with metadiscourse features are more engaging: A fact or a myth? 3L, Language, Linguistics, Literature, 26(4). http://doi.org/10.17576/3L-2020-2604-05

Hyland, K. (2008). As can be seen: Lexical bundles and disciplinary variation. English for Specific Purposes, 27(1), 4-21. https://doi.org/10.1016/j.esp.2007.06.001

Hyland, K. (2024). Genre-based instruction and corpora. Tesol Quarterly, 58(3), 1227-1234. https://doi.org/10.1002/tesq.3273

Hyland, K., & Jiang, F. (2018). Academic lexical bundles: How are they changing?. International Journal of Corpus Linguistics, 23(4), 383-407. https://doi.org/10.1075/ijcl.17080.hyl

Ihlström, C., & Lundberg, J. (2003). The Online News Genre through the User Perspective. Proceedings of the 36th Annual Hawaii International Conference on System Sciences, 1–10. https://doi.org/10.1109/HICSS.2003.1174241

Işık, E. E. (2023). A corpus-based genre analysis of promotional-informational discourse in online painting exhibition overviews. English for Specific Purposes, 70, 44-56. https://doi.org/10.1016/j.esp.2022.11.002

Jiang, Y. (2015). Study of language features of business English. Higher Education of Social Science, 8(5), 29-35. http://dx.doi.org/10.3968/6939

Kang, S., Shin, Y. K., & Yoo, I. W. (2024). Using lexical bundles to teach prepositions to Korean EFL students: Corpus-based instructed SLA. Journal of Second Language Studies, 7(1), 75-98. https://doi.org/10.1075/jsls.00022.kan

Kim, S., & Kessler, M. (2022). Examining L2 English university students’ uses of lexical bundles and their relationship to writing quality. Assessing Writing, 51. https://doi.org/10.1016/j.asw.2021.100589

Larsson, T., Kim, T., & Egbert, J. (2025). Introducing and comparing two techniques for key lexical bundles analysis. Research Methods in Applied Linguistics, 4(3), 100245.

Leelasetakul, M. (2025). " Dead at the Scene" and More: Lexical Bundles in Accident News. LEARN Journal: Language Education and Acquisition Research Network, 18(2), 78-107. https://doi.org/10.70730/EUQT4958

Lindstromberg, S., 2010. English prepositions explained. John Benjamins.

Liu, S., & Zhang, J. (2021). Using metadiscourse to enhance persuasiveness in corporate press releases: A corpus-based study. Sage Open, 11(3). https://doi.org/10.1177/21582440211032165

Liu, X., Shuangling, L. I., Fan, W., & Dang, Q. (2023). Corpus-based bundle analysis to disciplinary variations: Relocating the role of bundle extraction criteria. English for Specific Purposes, 70, 151-163. https://doi.org/10.1016/j.esp.2022.12.004

Metang, P., & Narathakoon, A. (2025). A corpus-based study of lexical collocations of keywords found in online news articles. THAITESOL Journal, 38(1), 1–21. https://doi.org/10.61508/refl.v32i2.282437

Narkprom, N., & Phoocharoensil, S. (2022). Lexical bundles in native English speakers’ and Thai writers’ dissertations. GEMA Online Journal of Language Studies, 22(3), 43–62. https://doi.org/10.17576/gema-2022-2203-03

Qian, Y., Deng, X., Ye, Q., Ma, B., & Yuan, H. (2019). On detecting business event from the headlines and leads of massive online news articles. Information Processing & Management, 56(6). https://doi.org/10.1016/j.ipm.2019.102086

Saadatara, A., Kiany, G., & Talebzadeh, H. (2023). Bundles to beat the band in high-stakes tests: Pedagogical applications of an exploratory investigation of lexical bundles across band scores of the IELTS writing component. Journal of English for Academic Purposes, 61.

https://doi.org/10.1016/j.jeap.2022.101208

Saeedi, M., Khany, R., & Tazik, K. (2023). Research themes and sub-themes in academic wordlist studies between 2000 and 2020: A systematic review. Journal of Research in Applied Linguistics, 14(1), 95-111. https://doi.org/10.22055/RALS.2023.18070

Samraj, B. (2024). Disciplinary differences in lexical bundles use: A cautionary tale from methodological variations. Journal of English for Academic Purposes, 70. https://doi.org/10.1016/j.jeap.2024.101399

Sukman, K., Triwatwaranon, W., Munkongdee, T., & Chumnumnawin, N. (2022). A corpus-based study of lexical collocations of keywords found in online business news articles. European Journal of English Language Teaching, 7(3). https://doi.org/10.46827/ejel.v7i3.4275

Tosqui-Lucks, P., de Almeida Prado, M. C., Pacheco, A., de Moraes Garcia, A. C., & Monteiro, A. L. T. (2024). Challenges and possibilities in compiling Aeronautical English corpora: The case of the Aerocorpus. Research Methods in Applied Linguistics, 3(3). https://doi.org/10.1016/j.rmal.2024.100135

Wan, M., Fang, A. C., & Huang, C. R. (2019). The discriminativeness of internal syntactic representations in automatic genre classification. Journal of Quantitative Linguistics, 28(2), 138–171. https://doi.org/10.1080/09296174.2019.1663655

Wang, W., & Csomay, E. (2024). Constructing proximity in popularization discourse: Evidence from lexical bundles in TED talks. English for Specific Purposes, 73, 95-109. https://doi.org/10.1016/j.esp.2023.10.003

Wen, Q., Gloor, P. A., Fronzetti Colladon, A., Tickoo, P., & Joshi, T. (2020). Finding top performers through email patterns analysis. Journal of Information Science, 46(4), 508-527. https://doi.org/10.1177/0165551519849519

Wen, Z., Liu, M., & Huan, C. (2025). COVID-19 vaccines as a game-changing tool? A corpus-based study of vaccine communication in People's Daily and The New York Times. Heliyon, 11(2). https://doi.org/10.1016/j.heliyon.2025.e42082

Yin, X., & Li, S. (2021). Lexical bundles as an intradisciplinary and interdisciplinary mark: A corpus-based study of research articles from business, biology, and applied linguistics. Applied Corpus Linguistics, 1(1). https://doi.org/10.1016/j.acorp.2021.100006

Yunjung, K. (2025). A corpus analysis of prepositional phrase-lexical bundles in academic writing: L2 writers from Indo-European and Non-Indo-European languages. Applied Corpus Linguistics, 5(2). https://doi.org/10.1016/j.acorp.2025.100128


Refbacks

  • There are currently no refbacks.


 

 

 

eISSN : 2550-2131

ISSN : 1675-8021