Browse > Article
http://dx.doi.org/10.4217/OPR.2019.41.2.107

Frequency Analysis of Scientific Texts on the Hypoxia Using Bibliographic Data  

Lee, GiSeop (Ocean Data Science Section, KIOST)
Lee, JiYoung (Marine Environment Research Division, National Institute of Fisheries Science)
Cho, HongYeon (Ocean Data Science Section, KIOST)
Publication Information
Ocean and Polar Research / v.41, no.2, 2019 , pp. 107-120 More about this Journal
Abstract
The frequency analysis of scientific terms using bibliographic information is a simple concept, but as relevant data become more widespread, manual analysis of all data is practically impossible or only possible to a very limited extent. In addition, as the scale of oceanographic research has expanded to become much more comprehensive and widespread, the allocation of research resources on various topics has become an important issue. In this study, the frequency analysis of scientific terms was performed using text mining. The data used in the analysis is a general-purpose scholarship database, totaling 2,878 articles. Hypoxia, which is an important issue in the marine environment, was selected as a research field and the frequencies of related words were analyzed. The most frequently used words were 'Organic matter', 'Bottom water', and 'Dead zone' and specific areas showed high frequency. The results of this research can be used as a basis for the allocation of research resources to the frequency of use of related terms in specific fields when planning a large research project represented by single word.
Keywords
bibliographic data; data mining; text mining; hypoxia;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Wu Y, Xie L, Huang SL, Li P, Yuan Z, Liu W (2018) Using social media to strengthen public awareness of wildlife conservation. Ocean Coast Manage 153:76-83   DOI
2 Yang D, Kleissl J, Gueymard CA, Pedro HT, Coimbra CF (2018) History and trends in solar irradiance and PV power forecasting: a preliminary assessment and review using text mining. Sol Energy 168:60-101   DOI
3 Zhang W, Qian W, Ho YS (2009) A bibliometric analysis of research related to ocean circulation. Scientometrics 80(2):305-316   DOI
4 Lee GS, Cho HY, Han JR (2018) Text mining analysis on the research field of the coastal and ocean engineering based on the SCOPUS bibliographic information. J Korean Soc Coast Ocean Eng 30(1):19-28   DOI
5 Jung YB, Park ES (2015) Keyword analysis of two SCI journals on rock engineering by using text mining. Kor Soc Rock Mech 25(4):303-319
6 Cho GH, Lim SY, Hur S (2014) An analysis of the research methodologies and techniques in the industrial engineering using text mining. J Kor Ins Ind Eng 40(1):52-59   DOI
7 Cho SG, Kim SB (2012) Finding meaningful pattern of key words in IIE transactions using text mining. J Kor Ins Ind Eng 38(1):67-73   DOI
8 Aria M, Cuccurullo C (2017) Bliometrix: an R-tool for comprehensive science mapping analysis. J Informetr 11(4):959-975   DOI
9 Bartlett MS (1947) The use of transformations. Biometrics 3(1):39-52   DOI
10 Boussalis C, Coan TG (2016) Text-mining the signals of climate change doubt. Global Environ Chang 36:89-100   DOI
11 Costa S, Caldeira R (2018) Bibliometric analysis of ocean literacy: an underrated term in the scientific literature. Mar Policy 87:149-157   DOI
12 Dawei L, Guan-tin C (2018) Wordcloud2: create word cloud by 'htmlwidget'. R package version 0.2.1. https://CRAN.R-project.org/package=wordcloud2 Accessed 21 Feb 2019
13 Diaz RJ, Rosenberg R (2008) Spreading dead zones and consequences for marine ecosystems. Science 321(5891):926-929   DOI
14 Landhuis E (2016) Scientific literature: information overload. Nature 535(7612):457-458   DOI
15 Donoho D (2017) 50 years of data science. J Comput Graph Stat 26(4):745-766   DOI
16 Gattuso JP, Dawson NA, Duarte CM, Middelburg JJ (2005) Patterns of publication effort in coastal biogeochemistry: a bibliometric survey (1971 to 2003). Mar Ecol-Prog Ser 294:9-22   DOI
17 Gavish M, Donoho D (2012) Three dream applications of verifiable computational results. Comput Sci Eng 14(4):26-31   DOI
18 Hui I (2017) Shaping the coast with permits: making the state regulatory permitting process transparent with text mining. Coast Manage 45(3):179-198   DOI
19 Keeling RF, Kortzinger A, Gruber N (2010) Ocean deoxygenation in a warming world. Annu Rrev Mar Sci 2:199-229   DOI
20 Meyer D, Hornik K, Feinerer I (2008) Text mining infrastructure in R. J Stat Softw 25(5):1-54
21 Ming F, Wong F, Liu Z, Chiang M (2014) Stock market prediction from WSJ: text mining via sparse matrix factorization. In: IEEE International Conference on Data Mining, Shenzhen, 14-17 Dec 2014
22 Oschlies A, Brandt P, Stramma L, Schmidtko S (2018) Drivers and mechanisms of ocean deoxygenation. Nat Geosci 11(7):467-473   DOI
23 Peterson RA (2017) Estimating normalization transformations with bestNormalize. https://github.com/petersonR/bestNormalize Accessed 12 Jun 2019
24 Pita P, Villasante S, Arlinghaus R, Gomes P, Strehlow HV, Veiga P, Vingada J, Hyder K (2018) A matter of scales: does the management of marine recreational fisheries follow the ecosystem approach to fisheries in Europe? Mar Policy 97:61-71   DOI
25 Robertson S (2004) Understanding inverse document frequency: on theoretical arguments for IDF. J Doc 60(5):503-520   DOI
26 Sparck JK (1972) A statistical interpretation of term specificity and its application in retrieval. J Doc 28(1):11-21   DOI
27 Rudd MA (2017) What a decade (2006-15) of journal abstracts can tell us about trends in ocean and coastal sustainability challenges and solutions. Front Mar Sci 4:170. doi:10.3389/fmars.2017.00170   DOI
28 Rudis B, Embrey B (2016) Pluralize: pluralize and singularize any (English) word, R package version 0.1.0. https://github.com/hrbrmstr/pluralize Accessed 21 Feb 2019
29 Silge J, Robinson D (2016) Tidytext: text mining and analysis using tidy data principles in R. JOSS 1(3):37. doi:10.21105/joss.00037   DOI
30 Sun J, Wang MH, Ho YS (2012) A historical review and bibliometric analysis of research on estuary pollution. Mar Pollut Bull 64(1):13-21   DOI
31 Tapio I, Fischer D, Blasco L, Tapio M, Wallace RJ, Bayat AR, Ventto L, Kahala M, Negussie E, Shingrield KJ, Vilkki J (2017) Taxon abundance, diversity, co-occurrence and network analysis of the ruminal microbiota in response to dietary changes in dairy cows. PLoS One 12(7):e0180260   DOI
32 Van der Waerden BL (1952) Order tests for the two-sample problem and their power. Indagat Math 55:453-458   DOI
33 Van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JA (2006) A text-mining analysis of the human phenome. Eur J Hum Genet 14(5):535-542   DOI
34 Westergaard D, Staerfeldt HH, Tonsberg C, Jensen LJ, Brunak S (2018) A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts. PLoS Comput Biol 14(2):e1005962. doi:10.1371/journal.pcbi.1005962   DOI
35 Wickham H (2017) Tidyverse: easily install and load the 'Tidyverse'. R package version 1.2.1. https://CRAN.R-project.org/package=tidyverse Accessed 21 Feb 2019