Sentiment Analysis of China-Related News in The Star Online Newspaper

Hong Wu, Kesumawati A. Bakar, Azhar Jaludin, Norsimah Mat Awal

Abstract


As China and Malaysia approach their 47th year of diplomatic relationships, cooperation and trust between the two countries have deepened in aspects ranging from politics to economy. Despite this mutual reliance, the relationship between Malaysia and China is not without its conflicts and these conflicts are often manifested in media reports. How China is presented in Malaysia news is a field that has been scarcely explored. As part of the research on media sentiment towards China, this research investigates the general sentiment of China-related news in Malaysian media through sentiment analysis of some selected news coverages. Selecting China-related news in The Star Online from 2012 to 2021 as the data for investigation,the Excel Add-in tool Azure Machine Learning was used to generate polarity of these news reports automatically and corpus tool Wordsmith was used for the analysis of news discourse. A total of 137,475 pieces of news have been collected as the research sample. The finding reveals  that: 1) despite the large proportion of news with negative sentiment in China-related news in The Star Online, the monthly trend of sentiment shows a slight increase of positiveness over time; 2) an investigation into the keyword lists of three months with highest proportion of negativeness and collocates of the top keywords, however, shows that negative sentiment of the news may be due to a global conflict at that particular time and does not necessarily indicate negative sentiment towards China. A combination of sentiment analysis and corpus approach on the study of China-related news in Malaysian media enriches the study of news discourse from the perspective of corpus linguistics.


Keywords


sentiment analysis; polarity; China; Malaysian English news; The Star Online

Full Text:

PDF

References


Alasuutari, P., Qadir, A. & Creutz, K. (2013). The domestication of foreign news: news stories related to the 2011 Egyptian revolution in British, Finnish and Pakistani newspapers. Media, Culture & Society. 35(6), 692-707. https://doi.org/10.1177/0163443713491299

Andersson, E., Dryden, C. & Variawa, C. (2018). Applying Machine Learning to Student Feedback Through Clustering and Sentiment Analysis. Proceedings of the Canadian Engineering Education Association (CEEA) Conference, 3-6 June, Vancouver BC. https://doi.org/10.24908/pceea.v0i0.13059

Ansari, M. Z., Aziz, M. B., Siddiqui, M. O., Mehra, H. & Singh, K. P. (2020). Analysis of Political Sentiment Orientations on Twitter. Procedia Computer Science. (167), 1821-1828. https://doi.org/10.1016/j.procs.2020.03.201

Antonakaki, D., Spiliotopoulos, D. V., Samaras. C, Pratikakis. P, Ioannidis. S and Fragopoulou, P. (2017). Social media analysis during political turbulence. PLoS ONE. 12(10), e0186836. https://doi.org/10.1371/journal.pone.0186836

Anunne, U. K. & Yan, Lifeng (2018). China in Foreign Media: Assessing China’s Image in Online Editions of Nigeria’s Leading Newspapers in 2017. European Scientific Journal, ESJ, 14(35), 165-188. https://doi.org/10.19044/esj.2018.v14n35p165

Armony, A. C. & Velásquez, N. (2015). Anti-Chinese Sentiment in Latin America: An Analysis of Online Discourse. Journal of Chinese Political Science. 20, 319–346. https://doi.org/10.1007/s11366-015-9365-z

Asad, S., Mohd Noor, S. N. F. & Jaes, L. (2019). Transitivity analysis of election coverage in online newspapers of Malaysia & Pakistan: A study with critical discourse analysis & systematic functional linguistics’ perspective. Amazonia Investiga. 8(21), 168–176.

Baccianella, S., Esuli, A. and Sebastiani, F. (2010). SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10). European Language Resources Association (ELRA).

Baker, P. (2010). Sociolinguistics and Corpus Linguistics. Edinburg: Edinburg University Press.

Balahur, A. et al. 2013. Sentiment analysis in the news. Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC’2010). 19-21 May, pp. 2216–2220. https://arxiv.org/pdf/1309.6202 [Oct 12th, 2019]

Balahur, A., & Steinberger, R. (2009). Rethinking Sentiment Analysis in the News: from Theory to Practice and back. Proceeding of WOMSA. 9, 1-12.

Bednarek, M. (2006). Evaluation in Media Discourse – Analysis of a Newspaper Corpus. London & New York: Continuum.

Bednarek, M., & Caple, H. (2014). Why Do News Values Matter? Towards a New Methodological Framework for Analysing News Discourse in Critical Discourse Analysis and Beyond. Discourse & Society. 25(2), 135-158.

Biber, D., Johansson, S., Leech, G., Conrad, S. & Finegan, E. (1999). Longman Grammar of Spoken and Written English. London: Longman.

Bickes, H. Otten, T. & Weymann, L. C. (2014). The financial crisis in the German and English press: Metaphorical structures in the media coverage on Greece, Spain and Italy. Discourse & Society. 25(4): 424-445. doi: 10.1177/0957926514536956

Bučar, J., Žnidaršič, M. & Povh, J. (2018). Annotated news corpora and a lexicon for sentiment analysis in Slovene. Lang Resources & Evaluation. 52, 895–919. https://doi.org/10.1007/s10579-018-9413-3

Bui, N. T. (2017). Managing anti-China nationalism in Vietnam: evidence from the media during the 2014 oil rig crisis. The Pacific Review. 30(2), 169-187. https://doi.org/10.1080/09512748.2016.1201132

Brookes, H. J. (1995). `Suit, Tie and a Touch of Juju'—The Ideological Construction of Africa: A Critical Discourse Analysis of News on Africa in the British Press. Discourse Society. 6(4), 461-494. https://doi.org/10.1177/0957926595006004002

Briones, R. R. Y. (2017). Using Fairclough’s CDA Framework on News Articles. Beyond Words 5(1), 83-97.

Cavasso, L. & Taboada, M. (2021). A corpus analysis of online news comments using the Appraisal framework. Journal of Corpora and Discourse Studies. 4, 1-38. https://doi.org/10.18573/jcads.61

Chan, M. (2012). The discursive reproduction of ideologies and national identities in the Chinese and Japanese English-language press. Discourse and Communication. 6(4), 361-378. https://doi.org/10.1177/1750481312457496

Cheng, W. & Lam, P. W. Y. (2013). Western perceptions of Hong Kong ten years on: A corpus-driven critical discourse study. Applied Linguistics. 34(2), 173–190. https://doi.org/10.1093/applin/ams038

Cook, T. E. (2006). The News Media as a Political Institution: Looking Backward and Looking Forward. Political Communication. 23 (2), 159–171.

Duda, R., Hart, P. and Stork, D. (2001). Pattern Classification. 2nd edition. New York: John Wiley&Son.

Duff, P. A., Anderson, T., Doherty, L., & Wang, R. (2015). Representations of Chinese Language Learning in Contemporary English-language News Media: Hope, Hype, and Fear. Global Chinese. 1(1), 139-168. https://doi.org/10.1515/glochi-2015-1006

Edgerly, S. & Vraga. E. K. (2020). News-Ness as an Audience Concept for the Hybrid Media Environment. Journalism & Mass Communication Quarterly. 97 (2), 416–434.

Esuli, A. and Sebastiani. F. (2006). SentiWordNet: A publicly available lexical resource for opinion mining. In Proceedings from the International Conference on Language Resources and Evaluation (LREC).

Fairclough, N. (1995). Media Discourse. London: Edward Arnold.

Fowler, R. (1991). Language in the News Discourse and Ideology in the Press. Oxon: Routledge.

Gabrielatos, C. & Baker, P. (2008). Fleeing, Sneaking, Flooding: A Corpus Analysis of Discursive Constructions of Refugees and Asylum Seekers in the UK Press, 1996-2005. Journal of English Linguistics. 36(1), 5-38. https://doi.org/10.1177/0075424207311247

Global Times. (2021). ASEAN becomes China's largest trading partner in 2020, with 7% growth. Retrieved January 15th, 2021 from https://www.globaltimes.cn/page/202101/1212785.shtml

Godbole, N., Srinivasaiah, M. & Skiena, S. (2007). Large-Scale Sentiment Analysis for News and Blogs. ICWSM 2007 - International Conference on Weblogs and Social Media. https://www.icwsm.org/papers/3--Godbole-Srinivasaiah-Skiena.pdf

Gries, S. T. (2005). Syntactic priming: A corpus-based approach. Journal of Psycholinguistic Research. 34(4), 365-399.

Hanks, P. (2004). Corpus Pattern Analysis. In Euralex Proceedings. (1), 87-98. Lorient: Université de Bretagne-Sud.

Haw, A. L. (2020). Duelling Discourses: A Rhetorical Device for Challenging Anti-Asylum Sentiment in Western Australia. Journal of Australian Studies. 44(3), 303-317. https://doi.org/10.1080/14443058.2020.1737178

Idris, A. (2019). Implicit stances taken by ABC and BBC towards Indonesia presidential election : CDA Perspectives. Journal of English Language Studies. 4(2), 197–211.

Kim, K. H. (2014). Examining US news media discourses about North Korea: A corpus-based critical discourse analysis. Discourse and Society. 25(2), 221–244. https://doi.org/10.1177/0957926513516043

Leung, C. C. M. & Huang, Y. (2007). The paradox of journalistic representation of the other: The case of SARS coverage on China and Vietnam by western-led English-language media in five countries. Journalism. 8(6), 675-697.

Lin Sixian. (2022). On the Framing Strategy of China-Related News in Western Mainstream Media. Contemporary International Relations. 1, 53-60.

Liu, B. (2012). Sentiment Analysis and Opinion Mining. (Synthesis Lectures on Human Language Technologies). Vermont: Morgan & Claypool Publishers.

Liu, M. & Jiang, C. (2019). Constant fear, but lingering nostalgia: British press representations of post-colonial Hong Kong 20 years on. Discourse & Communication. 13(6), 630–646. https://doi.org/10.1177/1750481319868852

Lu, X. (2011). A corpus‐based evaluation of syntactic complexity measures as indices of college‐level ESL writers' language development. TESOL Quarterly. 45(1), 36-62.

Luo Yi. (2019). Image of China as Media Representation of Others. Media. 7, 79-82.

Maryam Jahedi (2012). The Discursive Construction of Iran from 1979 to 2009 in the New York Times. Thesis for Doctor of Philosophy. Faculty of Modern Languages and Communication. Universiti Putra Malaysia.

McEnery, T. and Wilson, A. (1996). Corpus Linguistics. Edinburgh: Edinburgh University Press.

McEnery, T., Xiao, R. & Tono, Y. (2006). Corpus- Based Language Studies: An Advanced Resource Book. London: Routledge.

Nasa, C. & Suman. (2012). Evaluation of different classification techniques for web data. International Journal of Computer Applications. 52(9), 34–40.

Neuman, W. L. (2014). Social Research Methods: Qualitative and Quantitative Approaches (Seventh Edition). Essex: Pearson.

Ng Siew Hua. (2008). A Critical Discourse Analysis of Representations of Bilateral Issues Concerning Malaysia and Singapore in Mainstream Newspaper Editorials. Thesis for Doctor of Philosophy.Universiti Putra Malaysia.

Nor Fariza Mohd Nor, Novelia Bernice Jeffree & Hilwa Abdullah@Mohd Nor. (2021). Health is Wealth: A Corpus-driven Analysis of the Portrayal of Mental Health in Malaysian English Online Newspapers. Journal of Language Studies. 21(2), 46-71. https://doi.org/10.17576/gema-2021-2102-03

Norsimah Mat Awal, Kesumawati A. Bakar, Anis Nadiah Che Abdul Rahman, & Imran Ho Abdullah (2021). Representasi Halal dan Haram dalam Wacana Parlimen Malaysia. GEMA Online® Journal of Language Studies. 21(4), 186-207.

O’keeffe, A., McCarthy, M., & Carter, R. (2007). From corpus to classroom: Language use and language teaching. Cambridge: Cambridge University Press.

Ospina Estupinan, J. D. (2017). The coverage of China in the Latin American Press: Media framing study. Cogent Arts & Humanities. 4, 1287319, https://doi.org/10.1080/23311983.2017.1287319

Pang, B. & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(2), 1–135. https://doi.org/10.1561/1500000001

Peslak, A. R. (2018). Facebook Fanatics: A Linguistic and Sentiment Analysis of the Most “Fanned” Facebook Pages. Journal of Information Systems Applied Research (JISAR). 11(1), 23-33.

Reuters Institute (2021). Digital News Report 2021. Retrieved September 20, 2021 from https://reutersinstitute.politics.ox.ac.uk/sites/default/files/202106/Digital_News_Report_2021_FINAL.pdf

Romaine, S. (2001). ‘A corpus- based view of gender in British and American English.’ In M. Hellinger and H. Bußmann (Eds.), Gender across Languages. Vol. 1 (pp. 153–175). Amsterdam and Philadelphia: John Benjamins.

Ruhrberg, S., Kirstein, G., Habermann, T., Nikolic, J. & Stock, W. (2018). #ISIS—A Comparative Analysis of Country-Specific Sentiment on Twitter. Open Journal of Social Sciences. 6, 142-158. https://doi.org/10.4236/jss.2018.66014.

Salah, Z. (2014). Machine learning and sentiment analysis approaches for the analysis of Parliamentary debates. Doctoral dissertation. University of Liverpool.

Scaffidi, C., Kevin B., Chang, E. Felker, M. Ng, H. & Jin, C. (2007). Red Opal: product-feature scoring from reviews. in Proceedings of Twelfth ACM Conference on Electronic Commerce (EC-2007).

Scott, M. (2020). WordSmith Tools version 8, Stroud: Lexical Analysis Software.

Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford: Oxford University Press.

Singh, R. & Sharma, P. 2021. Sentiment Analysis using Microsoft Azure Machine Learning and Python. International Journal of Engineering Research & Technology (IJERT). 10 (11), 241-244. https://doi.org/10.17577/IJERTV10IS110099

Sohoni, D. & Mendez, J. B. (2012). Defining immigrant newcomers in new destinations: symbolic boundaries in Williamsburg, Virginia. Ethnics & Racial Studies. 37(3), 496-516. https://doi.org/10.1080/01419870.2012.716521

Soroka, S. (2015). Why do we pay more attention to news with negative sentiment than to news with positive sentiment? The London School of Economics. Retrieved May 25, 2019 from https://blogs.lse.ac.uk/politicsandpolicy/why-is-there-no-good-news/

Stafford, T. (2014). Psychology: Why bad news dominates the headlines. BBC Future. Retrieved July 29, 2019 from https://www.bbc.com/future/article/20140728-why-is-all-the-news-bad

Stamou, A. G. (2001). The Representation of Non-Protesters in a Student and Teacher Protest: A Critical Discourse Analysis of News Reporting in a Greek Newspaper. Discourse & Society. 12(5), pp. 653-680. https://doi.org/10.1177/0957926501012005005

Theodoridis, S. & Koutroumbas, K. (2008). Pattern Recognition. 4th edition. 2008. Burlington/San Diego/London: Academic Press.

The Star Online. (2021). Retrieved September 15, 2021 from https://www.thestar.com.my/AboutUs

Wang, Y. & Reagan, J. (2020). Media Sentiment Towards Chinese Investments in Malaysia: An Examination of the Forest City Project. Asian Journal for Public Opinion Research. ISSN:2288-6168 (Online), 8(3), 197-221. https://doi.org/10.15206/ajpor.2020.8.3.197

Wardhaugh, R. (2010). An Introduction to Sociolinguistics. Sixth edition. Oxford: Blackwell.

World Bank (2020). GDP Ranking. Retrieved June 26, 2022 from https://databank.worldbank.org/data/download/GDP.pdf

Wu Junjing (2020). Construction of China’s National Image in Foreign Chinese Newspaper. Chinese Editors Journal. 124, 109-114.

Xinhua. (2021). China becomes Malaysia's largest trading partner for 12 consecutive year as bilateral trade grow in 2020. Retrieved February 21, 2021 from http://www.china.org.cn/world/Off_the_Wire/2021-01/30/content_77172119.htm

Yin Yue (2020). China’s National Image Constructed by Korean Daily and Korean National Daily. Journal of Yanbian University (Social Sciences). 53(3), 20-28 & 139-140.

Zhai, Z. W., Liu, X., Xu, H. & Jia, P. F. (2011). Clustering Product Features for Opinion Mining. in Proceedings of ACM International Conference on Web Search and Data Mining (WSDM-2011).

Zhang Yijiang (2018). Construction of China’s National Image in Mainstream Newspaper in Spain. Youth Journalist. 6, 93-94.

Zhao Yonghua & Lu Junyu. (2021). Metaphors in News Discourse and the Selective Construction of National Image. Contemporary Communication. 6, 17-22.




DOI: http://dx.doi.org/10.17576/gema-2022-2203-09

Refbacks

  • There are currently no refbacks.


 

 

 

eISSN : 2550-2131

ISSN : 1675-8021