• Title/Summary/Keyword: Text processing

Search Result 1,191, Processing Time 0.028 seconds

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

  • A Study on The 'Kao Zheng Pai'(考證派) of The Traditional Medicine of Japan (일본 '고증파(考證派)' 의학에 관한 연구)

    • Park, Hyun-Kuk;Kim, Ki-Wook
      • Journal of Korean Medical classics
      • /
      • v.20 no.4
      • /
      • pp.211-250
      • /
      • 2007
    • 1. The 'Kao Zheng Pai(考證派) comes from the 'Zhe Zhong Pai' and is a school that is influenced by the confucianism of the Qing dynasty. In Japan Inoue Kinga(井上金娥), Yoshida Koton(吉田篁墩) became central members, and the rise of the methodology of historical research(考證學) influenced the members of the 'Zhe Zhong Pai', and the trend of historical research changed from confucianism to medicine, making a school of medicine based on the study of texts and proving that the classics were right. 2. Based on the function of 'Nei Qu Li '(內驅力) the 'Kao Zheng Pai', in the spirit of 'use confucianism as the base', researched letters, meanings and historical origins. Because they were influenced by the methodology of historical research(考證學) of the Qing era, they valued the evidential research of classic texts, and there was even one branch that did only historical research, the 'Rue Xue Kao Zheng Pai'(儒學考證派). Also, the 'Yi Xue Kao Zheng Pai'(醫學考證派) appeared by the influence of Yoshida Kouton and Kariya Ekisai(狩谷掖齋). 3. In the 'Kao Zheng Pai(考證派)'s theories and views the 'Yi Xue Kao Zheng Pai' did not look at medical scriptures like the "Huang Di Nei Jing"("黃帝內經") and did not do research on 'medical' related areas like acupuncture, the meridian and medicinal herbs. Since they were doctors that used medicine, they naturally were based on 'formulas'(方劑) and since their thoughts were based on the historical ideologies, they valued the "Shang Han Ja Bing Lun" which was revered as the 'ancestor of all formulas'(衆方之祖). 4. The lives of the important doctors of the 'Kao Zheng Pai' Meguro Dotaku(目黑道琢) Yamada Seichin(山田正珍), Yamada Kyoko(山田業廣), Mori Ritsi(森立之) Kitamura Naohara(喜多村直寬) are as follows. 1) Meguro Dotaku(目黑道琢 1739${\sim}$1798) was born of lowly descent but, using his intelligence and knowledge, became a professor as a Shi Jing Yi(市井醫) and as a professor for 34 years at Ji Shou Guan mastered the "Huang Di Nei Jing" after giving over 300 lectures. Since his pupil, Isawara Ken taught the Lan Men Wu Zhe(蘭門五哲) and Shibue Chusai, Mori Ritsi(森立之), Okanishi Gentei(岡西玄亭), Kiyokawa Gendoh(淸川玄道) and Yamada Kyoko(山田業廣), Meguro Dotaku is considered the founder of the 'Yi Xue Kao Zheng Pai'. 2) The family of Yamada Seichin(山田正珍 1749${\sim}$1787) had been medical officials in the Makufu(幕府) and the many books that his ancestors had left were the base of his art. Seichin learned from Shan Ben Bei Shan(山本北山), a 'Zhe Zhong Pai' scholar, and put his efforts into learning, teaching and researching the "Shang Han Lun"("傷寒論"). Living in a time between 'Gu Fang Pai'(古方派) member Nakanishi Goretada(中西惟忠) and 'Kao Zheng Pai' member Taki Motohiro(多紀元簡), he wrote 11 books, 2 of which express his thoughts and research clearly, the "Shang Han Lun Ji Cheng"("傷寒論集成") and "Shang Han Kao"("傷寒考"). His comparison of the 'six meridians'(3 yin, 3 yang) between the "Shang Han Lun" and the "Su Wen Re Lun"("素問 熱論) and his acknowledgement of the need and rationality of the concept of Yin-Yang and Deficient-Replete distinguishes him from the other 'Gu Fang Pai'. Also, his dissertation of the need for the concept doesn't use the theories of latter schools but uses the theory of the "Shang Han Lun" itself. He even researched the historical parts, such as terms like 'Shen Nong Chang Bai Cao'(神農嘗百草) and 'Cheng Qi Tang'(承氣湯) 3) The ancestor of Yamada Kyoko(山田業廣) was a court physician, and learned confucianism from Kao Zheng Pai 's Ashikawa Genan(朝川善庵) and medicine from Isawa Ranken and Taki Motokata(多紀元堅), and the secret to smallpox from Ikeda Keisui(池田京水). He later became a lecturer at the Edo Yi Xue Guan(醫學館) and was invited as the director to the Ji Zhong(濟衆) hospital. He also became the first owner of the Wen Zhi She(溫知社), whose main purpose was the revival of kampo, and launched the monthly magazine Wen Zi Yi Tan(溫知醫談). He also diagnosed and prescribed for the prince Ming Gong(明宮). His works include the "Jing Fang Bian"("經方辨"), "Shang Han Lun Si Ci"("傷寒論釋司"), "Huang Zhao Zhu Jia Zhi Yan Ji Yao"("皇朝諸家治驗集要") and "Shang Han Ja Bing Lun Lei Juan"("傷寒雜病論類纂"). of these, the "Jing Fang Bian"("經方辨") states that the Shi Gao(石膏) used in the "Shang Han Lun" had three meanings-Fa Biao(發表), Qing Re(淸熱), Zi Yin(滋陰)-which were from 'symptoms', and first deducted the effects and then told of the reason. Another book, the "Jiu Zhe Tang Du Shu Ji"("九折堂讀書記") researched and translated the difficult parts of the "Shang Han Lun", "Jin Qui Yao Lue", "Qian Jin Fang"("千金方"), and "Wai Tai Mi Yao"("外臺秘要"). He usually analyzed the 'symptoms' of diseases but the composition, measurement, processing and application of medicine were all in the spectrum of 'analystic research' and 'researching analysis'. 4) The ancestors of Mori Rits(森立之 1807${\sim}$ 1885) were warriors but he became a doctor by the will of his mother, and he learned from Shibue Chosai(澁江抽齋) and Isawaran Ken and later became a pupil of Shou Gu Yi Zhai, a historical research scholar. He then became a lecturer of medical herbs at the Yi Xue Guan, and later participated in the proofreading of "Yi Xin Fang"("醫心方") and with Chosai compiled the "Jing Ji Fang Gu Zhi"("神農本草經"). He visited the Chinese scholar Yang Shou Jing(楊守敬) in 1881 and exchanged books and ideas. Of his works, there are the collections(輯複本) of "Shen Nong Ben Cao Jing"(神農本草經) and "You Xiang Yi Hwa"("遊相醫話") and the records, notes, poems, and diaries such as "Zhi Yuan Man Lu"("枳園漫錄") and "Zhi Yuan Sui Bi"("枳園隨筆") that were not published. His thoughts were that in restoring the "Shen Nong Ben Cao Jing", "the herb to the doctor is like the "Shuo Wen Jie Zi"("說文解字") to the scholar", and he tried to restore the ancient herbal text using knowledge of medicine and investigation(考據). Also with Chosai he compiled the "Jing Ji Fang Gu Zhi"("經籍訪古志") using knowledge of ancient text. Ritzi left works on pure investigation, paid much attention to social problems, and through 12 years of poverty treated all people and animals in all branches of medicine, so he is called a 'half confucianist half doctor'(半儒半醫). 5) Kitamurana Ohira(喜多村直寬 1804${\sim}$1876) learned scriptures and ancient texts from confucian scholar Asaka Gonsai, and learned medicine from his father Huai Yaun(槐園). He became a teacher in the Yi Xue Guan in his middle ages, and to repay his country, he printed 266 volumes of "Yi Fang Lei Ju("醫方類聚") and 1000 volumes of "Tai Ping Yu Lan"("太平禦覽") and devoted it to his country to be spread. His works are about 40 volumes including "Jin Qui Yao Lue Shu Yi" and "Lao Yi Zhi Yan" but most of them are researches on the "Shang Han Za Bing Lun". In his "Shang Han Lun Shu Yi"("傷寒論疏義") he shows the concept of the six meridians through the Yin-Yang, Superficial or internal, cold or hot, deficient or replete state of diseases, but did not match the names with the six meridians of the meridian theory, and this has something in common with the research based on the confucianism of Song(宋儒). In clinical treatment he was positive toward old and new methods and also the experience of civilians, but was negative toward western medicine. 6) The ancestor of the Taki family Tanbano Yasuyori(丹波康賴 912-955) became a Yi Bo Shi(醫博士) by his medical skills and compiled the "Yi Xin Fang"("醫心方"). His first son Tanbano Shigeaki(丹波重明) inherited the Shi Yao Yuan(施藥院) and the third son Tanbano Masatada(丹波雅忠) inherited the Dian You Tou(典藥頭). Masatada's descendents succeeded him for 25 generations until the family name was changed to Jin Bao(金保) and five generations later it was changed again to Duo Ji(多紀). The research scholar Taki Motohiro was in the third generation after the last name was changed to Taki, and his family kept an important part in the line of medical officers in Japan. Taki Motohiro(多紀元簡 1755-1810) was a teacher in the Yi Xue Guan where his father was residing, and became the physician for the general Jia Qi(家齊). He had a short temper and was not good at getting on in the world, and went against the will of the king and was banished from Ao Yi Shi(奧醫師). His most famous works, the "Shang Han Lun Ji Yi" and "Jin Qui Yao Lue Ji Yi" are the work of 20 years of collecting the theories of many schools and discussing, and is one of the most famous books on the "Shang Han Lun" in Japan. "Yi Sheng" is a collection of essays on research. Also there are the "Su Wen Shi"("素問識"), "Ling Shu Shi"("靈樞識"), and the "Guan lu Fang Yao Bu"("觀聚方要補"). Taki Motohiro(多紀元簡)'s position was succeeded by his third son Yuan Yin(元胤 1789-1827), and his works include works of research such as "Nan Jing Shu Jeng"("難經疏證"), "Ti Ya"("體雅"), "Yao Ya"("藥雅"), "Ji Ya"("疾雅"), "Ming Yi Gong An"("名醫公案"), and "Yi Ji Kao"("醫籍考"). The "Yi Ji Kao" is 80 volumes in length and lists about 3000 books on medicine in China before the Qing Dao Guang(道光), and under each title are the origin, number of volumes, state of existence, and, if possible, the preface, Ba Yu(跋語) and biography of the author. The younger sibling of Yuan Yin(元胤 1789-1827), Yuan Jian(元堅 1795-1857) expounded ancient writings at the Yi Xue Guan only after he reached middle age, was chosen for the Ao Yi Shi(奧醫師) and later became a Fa Yan(法眼), Fa Yin(法印) and Yu Chi(樂匙). He left about 15 texts, including "Su Wen Shao Shi"("素間紹識"), "Yi Xin Fang"("醫心方"), published in school, "Za Bing Guang Yao"("雜病廣要"), "Shang Han Guang Yao"(傷寒廣要), and "Zhen Fu Yao Jue"("該腹要訣"). On the Taki family's founding and working of the Yi Xue Guan Yasuka Doumei(失數道明) said they were "the people who took the initiative in Edo era kampo medicine" and evaluated their deeds in the fields of 'research of ancient text', 'the founding of Ji Shou Guan and medical education', 'publication business', 'writing of medical text'. 5. The doctors of the 'Kao Zheng Pai ' based their operations on the Edo Yi Xue Guan, and made groups with people with similar ideas to them, making a relationship 'net'. For example the three families of Duo Ji(多紀), Tang Chuan(湯川) and Xi Duo Cun(喜多村) married and adopted with and from each other and made prefaces and epitaphs for each other. Thus, the Taki family, the state science of the Makufu, the tendency of thinking, one's own interests and glory, one's own knowledge, the need of the society all played a role in the development of kampo medicine in the 18th and 19th century.

    • PDF

    A Study on The 'Kao Zheng Pai'(考證派) of The Traditional Medicine of Japan (일본 '고증파(考證派)' 의학에 관한 연구)

    • Park, Hyun-Kuk;Kim, Ki-Wook
      • The Journal of Dong Guk Oriental Medicine
      • /
      • v.10
      • /
      • pp.1-40
      • /
      • 2008
    • 1.The 'Kao Zheng Pai'(考證派) comes from the 'Zhe Zhong Pai(折衷派)' and is a school that is influenced by the confucianism of the Qing dynasty. In Japan Inoue Kinga(井上金峨), Yoshida Koton(古田篁墩 $1745{\sim}1798$) became central members, and the rise of the methodology of historical research(考證學) influenced the members of the 'Zhe Zhong Pai', and the trend of historical research changed from confucianism to medicine, making a school of medicine based on the study of texts and proving that the classics were right. 2. Based on the function of 'Nei Qu Li'(內驅力) the 'Kao Zheng Pai', in the spirit of 'use confucianism as the base', researched letters, meanings and historical origins. Because they were influenced by the methodology of historical research(考證學) of the Qing era, they valued the evidential research of classic texts, and there was even one branch that did only historical research, the 'Rue Xue Kao Zheng Pai'(儒學考證派). Also, the 'Yi Xue Kao Zheng Pai'(醫學考證派) appeared by the influence of Yoshida Kouton and Kariya Ekisai(狩谷掖齋). 3. In the 'Kao Zheng Pai(考證派)'s theories and views the 'Yi Xue Kao Zheng Pai' did not look at medical scriptures like the "Huang Di Nei Jing"("黃帝內經") and did not do research on 'medical' related areas like acupuncture, the meridian and medicinal herbs. Since they were doctors that used medicine, they naturally were based on 'formulas'(方劑) and since their thoughts were based on the historical ideologies, they valued the "Shang Han Ja Bing Lun" which was revered as the 'ancestor of all formulas'(衆方之祖). 4. The lives of the important doctors of the 'Kao Zheng Pai' Meguro Dotaku(目黑道琢) Yamada Seichin(山田正珍), Yamada Kyoko(山田業廣), Mori Ritsi(森立之) Kitamura Naohara(喜多村直寬) are as follows. 1) Meguro Dotaku(目黑道琢 $1739{\sim}1798$) was born of lowly descent but, using his intelligence and knowledge, became a professor as a Shi Jing Yi(市井醫) and as a professor for 34 years at Ji Shou Guan(躋壽館) mastered the "Huang Di Nei Jing" after giving over 300 lectures. Since his pupil, Isawara Ken(伊澤蘭軒) taught the Lan Men Wu Zhe(蘭門五哲) and Shibue Chusai(澀江抽齋), Mori Ritsi(森立之), Okanishi Gentei(岡西玄亭), Kiyokawa Gendoh(淸川玄道) and Yamada Kyoko(山田業廣), Meguro Dotaku is considered the founder of the 'Yi Xue Kao Zheng Pai'. 2) The family of Yamada Seichin(山田正珍 $1749{\sim}1787$) had been medical officials in the Makufu(幕府) and the many books that his ancestors had left were the base of his art. Seichin learned from Shan Ben Bei Shan(山本北山), a 'Zhe Zhong Pai' scholar, and put his efforts into learning, teaching and researching the "Shang Han Lun"("傷寒論"). Living in a time between 'Gu Fang Pai'(古方派) member Nakanishi Goretada(中西惟忠) and 'Kao Zheng Pai' member Taki Motohiro(多紀元簡), he wrote 11 books, 2 of which express his thoughts and research clearly, the "Shang Han Lun Ji Cheng"("傷寒論集成") and "Shang Han Kao"("傷寒考"). His comparison of the 'six meridians'(3 yin, 3 yang) between the "Shang Han Lun" and the "Su Wen Re Lun"("素問 熱論") and his acknowledgement of the need and rationality of the concept of Yin-Yang and Deficient-Replete distinguishes him from the other 'Gu Fang Pai'. Also, his dissertation of the need for the concept doesn't use the theories of latter schools but uses the theory of the "Shang Han Lun" itself. He even researched the historical parts, such as terms like 'Shen Nong Chang Bai Cao'(神農嘗百草) and 'Cheng Qi Tang'(承氣湯). 3) The ancestor of Yamada Kyoko(山田業廣) was a court physician, and learned confucianism from Kao Zheng Pai's Ashikawa Genan(朝川善庵) and medicine from Isawa Ranken(伊澤蘭軒) and Taki Motokata(多紀元堅), and the secret to smallpox from Ikeda Keisui(池田京水). He later became a lecturer at the Edo Yi Xue Guan(醫學館) and was invited as the director to the Ji Zhong(濟衆) hospital. He also became the first owner of the Wen Zhi She(溫知社), whose main purpose was the revival of kampo, and launched the monthly magazine Wen Zi Yi Tan(溫知醫談). He also diagnosed and prescribed for the prince Ming Gong(明宮). His works include the "Jing Fang Bian"("經方辨"), "Shang Han Lun Si Ci"("傷寒論釋詞"), "Huang Zhao Zhu Jia Zhi Yan Ji Yao"("皇朝諸家治驗集要") and "Shang Han Ja Bing Lun Lei Juan"("傷寒雜病論類纂"). of these, the "Jing Fang Bian"("經方辨") states that the Shi Gao(石膏) used in the "Shang Han Lun" had three meanings-Fa Biao(發表), Qing Re(淸熱), Zi Yin(滋陰)-which were from 'symptoms', and first deducted the effects and then told of the reason. Another book, the "Jiu Zhe Tang Du Shu Ji"("九折堂讀書記") researched and translated the difficult parts of the "Shang Han Lun", "Jin Qui Yao Lue"("金匱要略"), "Qian Jin Fang"("千金方"), and "Wai Tai Mi Yao"("外臺秘要"). He usually analyzed the 'symptoms' of diseases but the composition, measurement, processing and application of medicine were all in the spectrum of 'analystic research' and 'researching analysis'. 4) The ancestors of Mori Ritsi(森立之 $1807{\sim}1885$) were warriors but he became a doctor by the will of his mother, and he learned from Shibue Chosai(澁江抽齋) and Isawaran Ken(伊澤蘭軒) and later became a pupil of Shou Gu Yi Zhai(狩谷掖齋), a historical research scholar. He then became a lecturer of medical herbs at the Yi Xue Guan, and later participated in the proofreading of "Yi Xin Fang"("醫心方") and with Chosai compiled the "Jing Ji Fang Gu Zhi"("經籍訪古志"). He visited the Chinese scholar Yang Shou Jing(楊守敬) in 1881 and exchanged books and ideas. Of his works, there are the collections(輯複本) of "Shen Nong Ben Cao Jing"("神農本草經") and "You Xiang Yi Hwa"("遊相醫話") and the records, notes, poems, and diaries such as "Zhi Yuan Man Lu"("枳園漫錄") and "Zhi Yuan Sui Bi"(枳園隨筆) that were not published. His thoughts were that in restoring the "Shen Nong Ben Cao Jing", "the herb to the doctor is like the "Shuo Wen Jie Zi"(說文解字) to the scholar", and he tried to restore the ancient herbal text using knowledge of medicine and investigation(考據), Also with Chosai he compiled the "Jing Ji Fang Gu Zhi"("經籍訪古志") using knowledge of ancient text. Ritzi left works on pure investigation, paid much attention to social problems, and through 12 years of poverty treated all people and animals in all branches of medicine, so he is called a 'half confucianist half doctor'(半儒半醫). 5) Kitamurana Ohira(喜多村直寬, $1804{\sim}1876$) learned scriptures and ancient texts from confucian scholar Asaka Gonsai(安積艮齋), and learned medicine from his father Huai Yaun(槐園), He became a teacher in the Yi Xue Guan in his middle ages, and to repay his country, he printed 266 volumes of "Yi Fang Lei Ju"("醫方類聚") and 1000 volumes of "Tai Ping Yu Lan"("太平禦覽") and devoted it to his country to be spread. His works are about 40 volumes including "Jin Qui Yao Lue Shu Yi"("金匱要略疏義") and "Lao Yi Zhi Yan"(老醫巵言) but most of them are researches on the "Shang Han Za Bing Lun". In his "Shang Han Lun Shu Yi"("傷寒論疏義") he shows the concept of the six meridians through the Yin-Yang, Superficial or internal, cold or hot, deficient or replete state of diseases, but did not match the names with the six meridians of the meridian theory, and this has something in common with the research based on the confucianism of Song(宋儒). In clinical treatment he was positive toward old and new methods and also the experience of civilians, but was negative toward western medicine. 6) The ancestor of the Taki family Tanbano Yasuyori(丹波康賴 $912{\sim}955$) became a Yi Bo Shi(醫博士) by his medical skills and compiled the "Yi Xin Fang"("醫心方"). His first son Tanbano Shigeaki(丹波重明) inherited the Shi Yao Yuan(施藥院) and the third son Tanbano Masatada(丹波雅忠) inherited the Dian You Tou(典藥頭). Masatada's descendents succeeded him for 25 generations until the family name was changed to Jin Bao(金保) and five generations later it was changed again to Duo Ji(多紀). The research scholar Taki Motohiro was in the third generation after the last name was changed to Taki, and his family kept an important part in the line of medical officers in Japan. Taki Motohiro(多紀元簡 $1755{\sim}1810$) was a teacher in the Yi Xue Guan where his father was residing, and became the physician for the general Jia Qi(家齊). He had a short temper and was not good at getting on in the world, and went against the will of the king and was banished from Ao Yi Shi(奧醫師). His most famous works, the "Shang Han Lun Ji Yi"("傷寒論輯義") and "Jin Qui Yao Lue Ji Yi"("金匱要略輯義") are the work of 20 years of collecting the theories of many schools and discussing, and is one of the most famous books on the "Shang Han Lun" in Japan. "Yi Sheng"("醫勝") is a collection of essays on research. Also there are the "Su Wen Shi"(素問識), "Ling Shu Shi"("靈樞識"), and the "Guan Ju Fang Yao Bu"("觀聚方要補"). Taki Motohiro(多紀元簡)'s position was succeeded by his third son Yuan Yin(元胤 $1789{\sim}1827$), and his works include works of research such as "Nan Jing Shu Jeng"(難經疏證), "Ti Ya"("體雅"), "Yao Ya"("藥雅"), "Ji Ya"(疾雅), "Ming Yi Gong An"(名醫公案), and "Yi Ji Kao"(醫籍考). The "Yi Ji Kao" is 80 volumes in length and lists about 3000 books on medicine in China before the Qing Dao Guang(道光), and under each title are the origin, number of volumes, state of existence, and, if possible, the preface, Ba Yu(跋語) and biography of the author. The younger sibling of Yuan Yin(元胤 $1789{\sim}1827$), Yuan Jian(元堅 $1795{\sim}1857$) expounded ancient writings at the Yi Xue Guan only after he reached middle age, was chosen for the Ao Yi Shi(奧醫師) and later became a Fa Yan(法眼), Fa Yin(法印) and Yu Chi(禦匙). He left about 15 texts, including "Su Wen Shao Shi"("素問紹識"), "Yi Xin Fang"("醫心方"), published in school, "Za Bing Guang Yao"("雜病廣要"), "Shang Han Guang Yao"("傷寒廣要"), and "Zhen Fu Yao Jue"("診腹要訣"). On the Taki family's founding and working of the Yi Xue Guan Yasuka Doumei(矢數道明) said they were "the people who took the initiative in Edo era kampo medicine" and evaluated their deeds in the fields of 'research of ancient text', the founding of Ji Shou Guan(躋壽館) and medical education', 'publication business', 'writing of medical text'. 5. The doctors of the 'Kao Zheng Pai' based their operations on the Edo Yi Xue Guan, and made groups with people with similar ideas to them, making a relationship 'net'. For example the three families of Duo Ji(多紀), Tang Chuan(湯川) and Xi Duo Cun(喜多村) married and adopted with and from each other and made prefaces and epitaphs for each other. Thus, the Taki family, the state science of the Makufu, the tendency of thinking, one's own interests and glory, one's own knowledge, the need of the society all played a role in the development of kampo medicine in the 18th and 19th century.

    • PDF

    A Mobile Landmarks Guide : Outdoor Augmented Reality based on LOD and Contextual Device (모바일 랜드마크 가이드 : LOD와 문맥적 장치 기반의 실외 증강현실)

    • Zhao, Bi-Cheng;Rosli, Ahmad Nurzid;Jang, Chol-Hee;Lee, Kee-Sung;Jo, Geun-Sik
      • Journal of Intelligence and Information Systems
      • /
      • v.18 no.1
      • /
      • pp.1-21
      • /
      • 2012
    • In recent years, mobile phone has experienced an extremely fast evolution. It is equipped with high-quality color displays, high resolution cameras, and real-time accelerated 3D graphics. In addition, some other features are includes GPS sensor and Digital Compass, etc. This evolution advent significantly helps the application developers to use the power of smart-phones, to create a rich environment that offers a wide range of services and exciting possibilities. To date mobile AR in outdoor research there are many popular location-based AR services, such Layar and Wikitude. These systems have big limitation the AR contents hardly overlaid on the real target. Another research is context-based AR services using image recognition and tracking. The AR contents are precisely overlaid on the real target. But the real-time performance is restricted by the retrieval time and hardly implement in large scale area. In our work, we exploit to combine advantages of location-based AR with context-based AR. The system can easily find out surrounding landmarks first and then do the recognition and tracking with them. The proposed system mainly consists of two major parts-landmark browsing module and annotation module. In landmark browsing module, user can view an augmented virtual information (information media), such as text, picture and video on their smart-phone viewfinder, when they pointing out their smart-phone to a certain building or landmark. For this, landmark recognition technique is applied in this work. SURF point-based features are used in the matching process due to their robustness. To ensure the image retrieval and matching processes is fast enough for real time tracking, we exploit the contextual device (GPS and digital compass) information. This is necessary to select the nearest and pointed orientation landmarks from the database. The queried image is only matched with this selected data. Therefore, the speed for matching will be significantly increased. Secondly is the annotation module. Instead of viewing only the augmented information media, user can create virtual annotation based on linked data. Having to know a full knowledge about the landmark, are not necessary required. They can simply look for the appropriate topic by searching it with a keyword in linked data. With this, it helps the system to find out target URI in order to generate correct AR contents. On the other hand, in order to recognize target landmarks, images of selected building or landmark are captured from different angle and distance. This procedure looks like a similar processing of building a connection between the real building and the virtual information existed in the Linked Open Data. In our experiments, search range in the database is reduced by clustering images into groups according to their coordinates. A Grid-base clustering method and user location information are used to restrict the retrieval range. Comparing the existed research using cluster and GPS information the retrieval time is around 70~80ms. Experiment results show our approach the retrieval time reduces to around 18~20ms in average. Therefore the totally processing time is reduced from 490~540ms to 438~480ms. The performance improvement will be more obvious when the database growing. It demonstrates the proposed system is efficient and robust in many cases.

    A Study on Automatic Classification Model of Documents Based on Korean Standard Industrial Classification (한국표준산업분류를 기준으로 한 문서의 자동 분류 모델에 관한 연구)

    • Lee, Jae-Seong;Jun, Seung-Pyo;Yoo, Hyoung Sun
      • Journal of Intelligence and Information Systems
      • /
      • v.24 no.3
      • /
      • pp.221-241
      • /
      • 2018
    • As we enter the knowledge society, the importance of information as a new form of capital is being emphasized. The importance of information classification is also increasing for efficient management of digital information produced exponentially. In this study, we tried to automatically classify and provide tailored information that can help companies decide to make technology commercialization. Therefore, we propose a method to classify information based on Korea Standard Industry Classification (KSIC), which indicates the business characteristics of enterprises. The classification of information or documents has been largely based on machine learning, but there is not enough training data categorized on the basis of KSIC. Therefore, this study applied the method of calculating similarity between documents. Specifically, a method and a model for presenting the most appropriate KSIC code are proposed by collecting explanatory texts of each code of KSIC and calculating the similarity with the classification object document using the vector space model. The IPC data were collected and classified by KSIC. And then verified the methodology by comparing it with the KSIC-IPC concordance table provided by the Korean Intellectual Property Office. As a result of the verification, the highest agreement was obtained when the LT method, which is a kind of TF-IDF calculation formula, was applied. At this time, the degree of match of the first rank matching KSIC was 53% and the cumulative match of the fifth ranking was 76%. Through this, it can be confirmed that KSIC classification of technology, industry, and market information that SMEs need more quantitatively and objectively is possible. In addition, it is considered that the methods and results provided in this study can be used as a basic data to help the qualitative judgment of experts in creating a linkage table between heterogeneous classification systems.

    Prediction of Correct Answer Rate and Identification of Significant Factors for CSAT English Test Based on Data Mining Techniques (데이터마이닝 기법을 활용한 대학수학능력시험 영어영역 정답률 예측 및 주요 요인 분석)

    • Park, Hee Jin;Jang, Kyoung Ye;Lee, Youn Ho;Kim, Woo Je;Kang, Pil Sung
      • KIPS Transactions on Software and Data Engineering
      • /
      • v.4 no.11
      • /
      • pp.509-520
      • /
      • 2015
    • College Scholastic Ability Test(CSAT) is a primary test to evaluate the study achievement of high-school students and used by most universities for admission decision in South Korea. Because its level of difficulty is a significant issue to both students and universities, the government makes a huge effort to have a consistent difficulty level every year. However, the actual levels of difficulty have significantly fluctuated, which causes many problems with university admission. In this paper, we build two types of data-driven prediction models to predict correct answer rate and to identify significant factors for CSAT English test through accumulated test data of CSAT, unlike traditional methods depending on experts' judgments. Initially, we derive candidate question-specific factors that can influence the correct answer rate, such as the position, EBS-relation, readability, from the annual CSAT practices and CSAT for 10 years. In addition, we drive context-specific factors by employing topic modeling which identify the underlying topics over the text. Then, the correct answer rate is predicted by multiple linear regression and level of difficulty is predicted by classification tree. The experimental results show that 90% of accuracy can be achieved by the level of difficulty (difficult/easy) classification model, whereas the error rate for correct answer rate is below 16%. Points and problem category are found to be critical to predict the correct answer rate. In addition, the correct answer rate is also influenced by some of the topics discovered by topic modeling. Based on our study, it will be possible to predict the range of expected correct answer rate for both question-level and entire test-level, which will help CSAT examiners to control the level of difficulties.

    Evaluation of Preference by Bukhansan Dulegil Course Using Sentiment Analysis of Blog Data (블로그 데이터 감성분석을 통한 북한산둘레길 구간별 선호도 평가)

    • Lee, Sung-Hee;Son, Yong-Hoon
      • Journal of the Korean Institute of Landscape Architecture
      • /
      • v.49 no.3
      • /
      • pp.1-10
      • /
      • 2021
    • This study aimed to evaluate preferences of Bukhansan dulegil using sentiment analysis, a natural language processing technique, to derive preferred and non-preferred factors. Therefore, we collected blog articles written in 2019 and produced sentimental scores by the derivation of positive and negative words in the texts for 21 dulegil courses. Then, content analysis was conducted to determine which factors led visitors to prefer or dislike each course. In blogs written about Bukhansan dulegil, positive words appeared in approximately 73% of the content, and the percentage of positive documents was significantly higher than that of negative documents for each course. Through this, it can be seen that visitors generally had positive sentiments toward Bukhansan dulegil. Nevertheless, according to the sentiment score analysis, all 21 dulegil courses belonged to both the preferred and non-preferred courses. Among courses, visitors preferred less difficult courses, in which they could walk without a burden, and in which various landscape elements (visual, auditory, olfactory, etc.) were harmonious yet distinct. Furthermore, they preferred courses with various landscapes and landscape sequences. Additionally, visitors appreciated the presence of viewpoints, such as observation decks, as a significant factor and preferred courses with excellent accessibility and information provisions, such as information boards. Conversely, the dissatisfaction with the dulegil courses was due to noise caused by adjacent roads, excessive urban areas, and the inequality or difficulty of the course which was primarily attributed to insufficient information on the landscape or section of the course. The results of this study can serve not only serve as a guide in national parks but also in the management of nearby forest green areas to formulate a plan to repair and improve dulegil. Further, the sentiment analysis used in this study is meaningful in that it can continuously monitor actual users' responses towards natural areas. However, since it was evaluated based on a predefined sentiment dictionary, continuous updates are needed. Additionally, since there is a tendency to share positive content rather than negative views due to the nature of social media, it is necessary to compare and review the results of analysis, such as with on-site surveys.

    Analysis of Social Trends for Electric Scooters Using Dynamic Topic Modeling and Sentiment Analysis (동적 토픽 모델링과 감성 분석을 활용한 전동킥보드에 대한 사회적 동향 분석)

    • Kyoungok, Kim;Yerang, Shin
      • KIPS Transactions on Software and Data Engineering
      • /
      • v.12 no.1
      • /
      • pp.19-30
      • /
      • 2023
    • An electric scooter(e-scooter), one popularized micro-mobility vehicle has shown rapidly increasing use in many cities. In South Korea, the use of e-scooters has greatly increased, as some companies have launched e-scooter sharing services in a few large cities, starting with Seoul in 2018. However, the use of e-scooters is still controversial because of issues such as parking and safety. Since the perception toward the means of transportation affects the mode choice, it is necessary to track the trends for electric scooters to make the use of e-scooters more active. Hence, this study aimed to analyze the trends related to e-scooters. For this purpose, we analyzed news articles related to e-scooters published from 2014 to 2020 using dynamic topic modeling to extract issues and sentiment analysis to investigate how the degree of positive and negative opinions in news articles had changed. As a result of topic modeling, it was possible to extract three different topics related to micro-mobility technologies, shared e-scooter services, and regulations for micro-mobility, and the proportion of the topic for regulations for micro-mobility increased as shared e-scooter services increased in recent years. In addition, the top positive words included quick, enjoyable, and easy, whereas the top negative words included threat, complaint, and ilegal, which implies that people satisfied with the convenience of e-scooter or e-scooter sharing services, but safety and parking issues should be addressed for micro-mobility services to become more active. In conclusion, this study was able to understand how issues and social trends related to e-scooters have changed, and to determine the issues that need to be addressed. Moreover, it is expected that the research framework using dynamic topic modeling and sentiment analysis will be helpful in determining social trends on various areas.

    Open Digital Textbook for Smart Education (스마트교육을 위한 오픈 디지털교과서)

    • Koo, Young-Il;Park, Choong-Shik
      • Journal of Intelligence and Information Systems
      • /
      • v.19 no.2
      • /
      • pp.177-189
      • /
      • 2013
    • In Smart Education, the roles of digital textbook is very important as face-to-face media to learners. The standardization of digital textbook will promote the industrialization of digital textbook for contents providers and distributers as well as learner and instructors. In this study, the following three objectives-oriented digital textbooks are looking for ways to standardize. (1) digital textbooks should undertake the role of the media for blended learning which supports on-off classes, should be operating on common EPUB viewer without special dedicated viewer, should utilize the existing framework of the e-learning learning contents and learning management. The reason to consider the EPUB as the standard for digital textbooks is that digital textbooks don't need to specify antoher standard for the form of books, and can take advantage od industrial base with EPUB standards-rich content and distribution structure (2) digital textbooks should provide a low-cost open market service that are currently available as the standard open software (3) To provide appropriate learning feedback information to students, digital textbooks should provide a foundation which accumulates and manages all the learning activity information according to standard infrastructure for educational Big Data processing. In this study, the digital textbook in a smart education environment was referred to open digital textbook. The components of open digital textbooks service framework are (1) digital textbook terminals such as smart pad, smart TVs, smart phones, PC, etc., (2) digital textbooks platform to show and perform digital contents on digital textbook terminals, (3) learning contents repository, which exist on the cloud, maintains accredited learning, (4) App Store providing and distributing secondary learning contents and learning tools by learning contents developing companies, and (5) LMS as a learning support/management tool which on-site class teacher use for creating classroom instruction materials. In addition, locating all of the hardware and software implement a smart education service within the cloud must have take advantage of the cloud computing for efficient management and reducing expense. The open digital textbooks of smart education is consdered as providing e-book style interface of LMS to learners. In open digital textbooks, the representation of text, image, audio, video, equations, etc. is basic function. But painting, writing, problem solving, etc are beyond the capabilities of a simple e-book. The Communication of teacher-to-student, learner-to-learnert, tems-to-team is required by using the open digital textbook. To represent student demographics, portfolio information, and class information, the standard used in e-learning is desirable. To process learner tracking information about the activities of the learner for LMS(Learning Management System), open digital textbook must have the recording function and the commnincating function with LMS. DRM is a function for protecting various copyright. Currently DRMs of e-boook are controlled by the corresponding book viewer. If open digital textbook admitt DRM that is used in a variety of different DRM standards of various e-book viewer, the implementation of redundant features can be avoided. Security/privacy functions are required to protect information about the study or instruction from a third party UDL (Universal Design for Learning) is learning support function for those with disabilities have difficulty in learning courses. The open digital textbook, which is based on E-book standard EPUB 3.0, must (1) record the learning activity log information, and (2) communicate with the server to support the learning activity. While the recording function and the communication function, which is not determined on current standards, is implemented as a JavaScript and is utilized in the current EPUB 3.0 viewer, ths strategy of proposing such recording and communication functions as the next generation of e-book standard, or special standard (EPUB 3.0 for education) is needed. Future research in this study will implement open source program with the proposed open digital textbook standard and present a new educational services including Big Data analysis.

    An Analytical Approach Using Topic Mining for Improving the Service Quality of Hotels (호텔 산업의 서비스 품질 향상을 위한 토픽 마이닝 기반 분석 방법)

    • Moon, Hyun Sil;Sung, David;Kim, Jae Kyeong
      • Journal of Intelligence and Information Systems
      • /
      • v.25 no.1
      • /
      • pp.21-41
      • /
      • 2019
    • Thanks to the rapid development of information technologies, the data available on Internet have grown rapidly. In this era of big data, many studies have attempted to offer insights and express the effects of data analysis. In the tourism and hospitality industry, many firms and studies in the era of big data have paid attention to online reviews on social media because of their large influence over customers. As tourism is an information-intensive industry, the effect of these information networks on social media platforms is more remarkable compared to any other types of media. However, there are some limitations to the improvements in service quality that can be made based on opinions on social media platforms. Users on social media platforms represent their opinions as text, images, and so on. Raw data sets from these reviews are unstructured. Moreover, these data sets are too big to extract new information and hidden knowledge by human competences. To use them for business intelligence and analytics applications, proper big data techniques like Natural Language Processing and data mining techniques are needed. This study suggests an analytical approach to directly yield insights from these reviews to improve the service quality of hotels. Our proposed approach consists of topic mining to extract topics contained in the reviews and the decision tree modeling to explain the relationship between topics and ratings. Topic mining refers to a method for finding a group of words from a collection of documents that represents a document. Among several topic mining methods, we adopted the Latent Dirichlet Allocation algorithm, which is considered as the most universal algorithm. However, LDA is not enough to find insights that can improve service quality because it cannot find the relationship between topics and ratings. To overcome this limitation, we also use the Classification and Regression Tree method, which is a kind of decision tree technique. Through the CART method, we can find what topics are related to positive or negative ratings of a hotel and visualize the results. Therefore, this study aims to investigate the representation of an analytical approach for the improvement of hotel service quality from unstructured review data sets. Through experiments for four hotels in Hong Kong, we can find the strengths and weaknesses of services for each hotel and suggest improvements to aid in customer satisfaction. Especially from positive reviews, we find what these hotels should maintain for service quality. For example, compared with the other hotels, a hotel has a good location and room condition which are extracted from positive reviews for it. In contrast, we also find what they should modify in their services from negative reviews. For example, a hotel should improve room condition related to soundproof. These results mean that our approach is useful in finding some insights for the service quality of hotels. That is, from the enormous size of review data, our approach can provide practical suggestions for hotel managers to improve their service quality. In the past, studies for improving service quality relied on surveys or interviews of customers. However, these methods are often costly and time consuming and the results may be biased by biased sampling or untrustworthy answers. The proposed approach directly obtains honest feedback from customers' online reviews and draws some insights through a type of big data analysis. So it will be a more useful tool to overcome the limitations of surveys or interviews. Moreover, our approach easily obtains the service quality information of other hotels or services in the tourism industry because it needs only open online reviews and ratings as input data. Furthermore, the performance of our approach will be better if other structured and unstructured data sources are added.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.