• Title/Summary/Keyword: Collecting system

Search Result 1,552, Processing Time 0.023 seconds

A Proposal of a Keyword Extraction System for Detecting Social Issues (사회문제 해결형 기술수요 발굴을 위한 키워드 추출 시스템 제안)

  • Jeong, Dami;Kim, Jaeseok;Kim, Gi-Nam;Heo, Jong-Uk;On, Byung-Won;Kang, Mijung
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.1-23
    • /
    • 2013
  • To discover significant social issues such as unemployment, economy crisis, social welfare etc. that are urgent issues to be solved in a modern society, in the existing approach, researchers usually collect opinions from professional experts and scholars through either online or offline surveys. However, such a method does not seem to be effective from time to time. As usual, due to the problem of expense, a large number of survey replies are seldom gathered. In some cases, it is also hard to find out professional persons dealing with specific social issues. Thus, the sample set is often small and may have some bias. Furthermore, regarding a social issue, several experts may make totally different conclusions because each expert has his subjective point of view and different background. In this case, it is considerably hard to figure out what current social issues are and which social issues are really important. To surmount the shortcomings of the current approach, in this paper, we develop a prototype system that semi-automatically detects social issue keywords representing social issues and problems from about 1.3 million news articles issued by about 10 major domestic presses in Korea from June 2009 until July 2012. Our proposed system consists of (1) collecting and extracting texts from the collected news articles, (2) identifying only news articles related to social issues, (3) analyzing the lexical items of Korean sentences, (4) finding a set of topics regarding social keywords over time based on probabilistic topic modeling, (5) matching relevant paragraphs to a given topic, and (6) visualizing social keywords for easy understanding. In particular, we propose a novel matching algorithm relying on generative models. The goal of our proposed matching algorithm is to best match paragraphs to each topic. Technically, using a topic model such as Latent Dirichlet Allocation (LDA), we can obtain a set of topics, each of which has relevant terms and their probability values. In our problem, given a set of text documents (e.g., news articles), LDA shows a set of topic clusters, and then each topic cluster is labeled by human annotators, where each topic label stands for a social keyword. For example, suppose there is a topic (e.g., Topic1 = {(unemployment, 0.4), (layoff, 0.3), (business, 0.3)}) and then a human annotator labels "Unemployment Problem" on Topic1. In this example, it is non-trivial to understand what happened to the unemployment problem in our society. In other words, taking a look at only social keywords, we have no idea of the detailed events occurring in our society. To tackle this matter, we develop the matching algorithm that computes the probability value of a paragraph given a topic, relying on (i) topic terms and (ii) their probability values. For instance, given a set of text documents, we segment each text document to paragraphs. In the meantime, using LDA, we can extract a set of topics from the text documents. Based on our matching process, each paragraph is assigned to a topic, indicating that the paragraph best matches the topic. Finally, each topic has several best matched paragraphs. Furthermore, assuming there are a topic (e.g., Unemployment Problem) and the best matched paragraph (e.g., Up to 300 workers lost their jobs in XXX company at Seoul). In this case, we can grasp the detailed information of the social keyword such as "300 workers", "unemployment", "XXX company", and "Seoul". In addition, our system visualizes social keywords over time. Therefore, through our matching process and keyword visualization, most researchers will be able to detect social issues easily and quickly. Through this prototype system, we have detected various social issues appearing in our society and also showed effectiveness of our proposed methods according to our experimental results. Note that you can also use our proof-of-concept system in http://dslab.snu.ac.kr/demo.html.

Animal Infectious Diseases Prevention through Big Data and Deep Learning (빅데이터와 딥러닝을 활용한 동물 감염병 확산 차단)

  • Kim, Sung Hyun;Choi, Joon Ki;Kim, Jae Seok;Jang, Ah Reum;Lee, Jae Ho;Cha, Kyung Jin;Lee, Sang Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.137-154
    • /
    • 2018
  • Animal infectious diseases, such as avian influenza and foot and mouth disease, occur almost every year and cause huge economic and social damage to the country. In order to prevent this, the anti-quarantine authorities have tried various human and material endeavors, but the infectious diseases have continued to occur. Avian influenza is known to be developed in 1878 and it rose as a national issue due to its high lethality. Food and mouth disease is considered as most critical animal infectious disease internationally. In a nation where this disease has not been spread, food and mouth disease is recognized as economic disease or political disease because it restricts international trade by making it complex to import processed and non-processed live stock, and also quarantine is costly. In a society where whole nation is connected by zone of life, there is no way to prevent the spread of infectious disease fully. Hence, there is a need to be aware of occurrence of the disease and to take action before it is distributed. Epidemiological investigation on definite diagnosis target is implemented and measures are taken to prevent the spread of disease according to the investigation results, simultaneously with the confirmation of both human infectious disease and animal infectious disease. The foundation of epidemiological investigation is figuring out to where one has been, and whom he or she has met. In a data perspective, this can be defined as an action taken to predict the cause of disease outbreak, outbreak location, and future infection, by collecting and analyzing geographic data and relation data. Recently, an attempt has been made to develop a prediction model of infectious disease by using Big Data and deep learning technology, but there is no active research on model building studies and case reports. KT and the Ministry of Science and ICT have been carrying out big data projects since 2014 as part of national R &D projects to analyze and predict the route of livestock related vehicles. To prevent animal infectious diseases, the researchers first developed a prediction model based on a regression analysis using vehicle movement data. After that, more accurate prediction model was constructed using machine learning algorithms such as Logistic Regression, Lasso, Support Vector Machine and Random Forest. In particular, the prediction model for 2017 added the risk of diffusion to the facilities, and the performance of the model was improved by considering the hyper-parameters of the modeling in various ways. Confusion Matrix and ROC Curve show that the model constructed in 2017 is superior to the machine learning model. The difference between the2016 model and the 2017 model is that visiting information on facilities such as feed factory and slaughter house, and information on bird livestock, which was limited to chicken and duck but now expanded to goose and quail, has been used for analysis in the later model. In addition, an explanation of the results was added to help the authorities in making decisions and to establish a basis for persuading stakeholders in 2017. This study reports an animal infectious disease prevention system which is constructed on the basis of hazardous vehicle movement, farm and environment Big Data. The significance of this study is that it describes the evolution process of the prediction model using Big Data which is used in the field and the model is expected to be more complete if the form of viruses is put into consideration. This will contribute to data utilization and analysis model development in related field. In addition, we expect that the system constructed in this study will provide more preventive and effective prevention.

A study on Huh-Joon's medical thoughts in Dong-Eui-Bo-Kham (동의보감(東醫寶鑑)을 통한 허준의 의학사상에 관한 고찰)

  • Kwon, Hak-Cheol;Park, Chan-Guk
    • Journal of Korean Medical classics
    • /
    • v.6
    • /
    • pp.89-130
    • /
    • 1993
  • Huh-joon's medical thoughts shown on his medical book of the Doog-Eui-Bo-Kham can be summerized as follows. 1. The general trend of medical science in Koryo dynasty is that much more interests were concentrated upon the books about curative means rather than upon the books about theoretical knowledge of medical science. With the development of Hyang Yak(鄕樂) (the term referring either various kinds of domestic medical stuffs such as herbs or the curative methods using those stuffs) and the writing of books on Hyang Yak, independent medical science of the nation's own was established in late Koryo dynasty. And the national medical science was continuously further developed until early Choson dynasty. Briskly-expanded mutual exchanges with China in early Choson dynasty provided Choson opportunities to import Chinese medical science and to examine it. Under this circumstances, he wrote the Dong-Eui-Bo-Kham. 2. As we look over the preface and Chip-Rae-Muo(集例文), we can find the characterstic of Doog-Eui-Bo-Kham is that the philosophical theory of Taoism was quoted in explaining the principles of his medical science and that the main idea of Naekyuog is the basis in explaining the way of curing diseases. 3. 83 kinds of medical books were quoted in the Doog-Eui-Bo-Kham. Besides, as many as 200 kinds of books including Tao-tzu's teaching books(道書), history books(史書), almanac(曆書), and Confucius' teaching books(儒家書籍) were quoted in total. Naekyuog and Eue-Hak-Ip-Mun, Dan-Kye-Sim-Bup were the most frequently quoted books among them. 4. Huh-Joon's medical thoughts about health care were like these. 1) The reason why Huh-Joon regarded the idea of health care as of great importance was that he laid much more emphasises on the preventive medicines rather than on the remedial medicines. The direct reason was that he was greatly influenced by profound knowledge of Taoist's study of discipline and who participated in the editing the books from the beginning. 2) Huh-Joon's outlook on human body started from the theory of "Unity of Heaven and Man"(天人合一論), which implied man was a kind of miniature universe. In addition to that, he largely theory of essence(精), vital force(氣), and spirit(神) which were regarded very important as the three most valuable properties in Taoism. However, he took his medical ground on practical and pragmatic idea that he did not discuss fundamental essence(元精), fundamental vital force(元氣), and fundamental spirit(元神) which were given by Heaven from the received only the theory of essence, vital force, and spirit which were acquired after birth and worked mainly on realistic activity of life. 3) Huh-loon accepted Do-In-Bup(導引法) sharply as a method to prevent and cure diseases. 5. Huh-loon's medical thoughts on remedial aspects are as 1) Naekyung was considered so important in Dong-Eui-Bo-Kham that not only each paragraph was begun with the Quotations from Nackyung but also the edited order of the content of the book the same with that of Naekyung. And differently from the former korean medical books he accepted at large and recorded the theories of the four noted physicians of the Geum-Won era(金元四大家) by Dong-Eui-Bo-Kham. 2) For the first time, Huh-Joon introduced the theory of Un-Ki (運氣論) in the Dong-Eui-Bo-Kahm. However, he accepted it as a pathological function of human body but he did not apply physical constitution, physiological function, pathological function, and remedial methods. 3) Huh-loon liked to use Hyang Yak that he recorded korean name of Hyang Yak(鄕名), places of the production(産地), the time of collecting(採取時月), and the way of drying herbs(陰陽乾正法) in the remedial method of a single medicine prescription for diseases at the end of each paragraph. By doing so, he developed, arranged, and revived Hyang Yak. 4) He believed that since the natural features of China were different from those of Korea the reasons of being attacked with its remedial methods couldn't be the same with different from Chinese medical books which primarily focused on paralysis and the injury of the cold has his own structure in his book that he founded independent science of this nation. He consulted enormous documents He discovered and wrote the theory and therefore concrete methods for diseases so that the book hadthe principles of outbreak of diseases(理), methods of cure(法), prescription(方), and a single medicine prescription(藥) and set system of medical science in a good order. By doing so, he and pragmatic development of medical science.

  • PDF

Domestic and International Experts' Perception of Policy and Direction on STEAM Education (융합인재교육(STEAM)의 정책과 실행 방향에 대한 국내외 전문가들의 인식)

  • Jung, Jaehwa;Jeon, Jaedon;Lee, Hyonyong
    • Journal of Science Education
    • /
    • v.39 no.3
    • /
    • pp.358-375
    • /
    • 2015
  • The purposes of this study were to investigate the value, necessity and legitimacy of STEAM Education and to propose practical approaching methods for STEAM Education to be applicable in Korea through a variety of literature review, case studies and collecting suggestions from domestic and international educational experts. The research questions are as follows: (1) To investigate the perception, understanding and recognitions of domestic and foreign professionals in STEAM education. (2) To analyze policy implications for an improvement in STEAM. The following aspects of STEAM were found to be challenges in our current STEAM policy after analyzing multiple questionnaires with the professionals and case studies including their experiences, understanding, supports and directions of the policy from the governments. The results indicate that (1) there was a lack of precise and conceptual understanding of STEAM in respect to experience. Training sessions for teachers in this field to help transform their perception is necessary. Development of practical programs with an easy access is also required. It is important to get the aims of related educational activities recognized by the professionals and established standards for an evaluation. The experts perceived that a theme-based learning is the most preferred and effective approaching method and the programs that develop creative thinking and learning applicable to practice are required to promote. (2) The results indicate that there was a lack of programs and inducements for supporting outstanding STEAM educators. It is shown that making an appropriate environment for STEAM education takes the first priority before training numbers of teachers unilaterally, thus securing enough budget seems critical. The professionals also emphasize on developing specialized teaching materials that include diverse inter-related subjects such as science technology, engineering, arts and humanities and social science with diverse viewpoints and advanced technology. This work requires a STEAM network for teachers to link up and share their materials, documents and experiences. It is necessary to get corporations, universities, and research centers participated in the network. (3) With respect to direction, it is necessary to propose policy that makes STEAM education ordinary and more practical in the present education system. The professionals have recommended training sessions that help develop creative thinking and amalgamative problem-solving techniques. They require reducing the workload of teachers and changing teachers' perspectives towards STEAM. They further urge a tight cooperation between departments of the government related with STEAM.

  • PDF

The Current Status of Recycling Process and Problems of Recycling according to the Packaging Waste of Korea (국내 포장 폐기물에 따른 재질별 재활용 공정 현황 및 재활용 문제점)

  • Ko, Euisuk;Shim, Woncheol;Lee, Hakrae;Kang, Wookgeon;Shin, Jihyeon;Kwon, Ohcheol;Kim, Jaineung
    • KOREAN JOURNAL OF PACKAGING SCIENCE & TECHNOLOGY
    • /
    • v.24 no.2
    • /
    • pp.65-71
    • /
    • 2018
  • Paper packs, glass bottles, metal cans, and plastic materials are classified according to packaging material recycling groups that are Extended Producer Responsibility (EPR). In the case of waste paper pack, the compressed cartons are dissociated to separate polyethylene films and other foreign substance, and then these are washed, pulverized and dried to produce toilet paper. Glass bottle for recycling is provided to the bottle manufacturers after the process of collecting the waste glass bottle, removing the foreign substance, sorting by color, crushing, raw materializing process. Waste glass recycling technology of Korea is largely manual, except for removal of metal components and low specific gravity materials. Metal can is classified into iron and aluminum cans through an automatic sorting machine, compressed, and reproduced as iron and aluminum through a blast furnace. In the case of composite plastic material, the selected compressed product is crushed and then recycled through melt molding and refined products are produced through solid fuel manufacturing steps through emulsification and compression molding through pyrolysis. In the recycling process of paper packs, glass bottles, metal cans, and plastic materials, the influx of recycled materials and other substances interferes with the recycling process and increases the recycling cost and time. Therefore, the government needs to improve the legal system which is necessary to use materials and structure that are easy to recycle from the design stage of products or packaging materials.

Comparision of Family Environment, Health Behavior and Health State of Elementary Students in Urban and Rural Areas (도시.농촌 지역 초등학생의 가족환경, 건강행위 및 건강상태에 관한 비교)

  • Bae, Yeon-Suk;Park, Kyung-Min
    • Research in Community and Public Health Nursing
    • /
    • v.9 no.2
    • /
    • pp.502-517
    • /
    • 1998
  • This research intends to survey family environment, health behavior and health status of the students in urban-rural elementary schools and analyze those factors comparatively, and use the result as basic material for school health teacher to teach health education in connection with family and regional areas. It also intends to improve a pupil's self-abilitiy in health care. The subjects involve 2,774 students of urban elementary schools and 583 student in rural ones, who were selected by means of a multi -stage probability sampling. Using the questionnaire and school documents, we collected data on family environment, health behavior and health status for 19 days. Feb. 2nd 1998 through Feb. 20th 1998. The R -form of Family Environment Scale (Moos, 1974) was used in the analysis of family environment(Cronbach's Alpha =0.80). Questionnaires of Health Behavior in School-aged children used by the WHO in Europe(Aaro et al., 1986) and the ones developed by the Health Promotion Committee of the Western Pacific(WHO, 1995)(adapted by long Young-suk and Moon Young-hee(1996)) were used in the analysis of health behavior, as well documents on absences due to sickness, school health room-visits, levels of physical strength, height, weight and degree of obesity were used to determine health status. In next step, We used them with an $X^2$-test, t-test, Odds Ratio, and a 95% Confidence Interval. 1. In two dimensions of three, family-relationship (t=3.41, p=0.001) and system -maintenances(t= 2.41, p=0.0l6) the mean score of urban children were significantly higher than those of rural ones. In the personal development dimension however, there was little significant difference. Assorting family environment into 10 sub-fields and analyzing them, we recognized that urban children were superior to rural children in the sub-fields of expressiveness (t =3.47, p=0.001), conflict (t=0.48, p=0.001), active-recreational orientation (t = 1.97, p=0.049) and organization (t=4.33, p=0.000). 2. Referring to the Odds Ratios of urban-rural children's health behaviors, urban children set up more desirable behavior than rural children wear ing safety belts (Odds Ratio =0.32, p=0.000), washing hands after meals(Odds Ratio = 0.43, p= 0.000), washing hands after excreting (Odds Ratio = 0.39, p=O.OOO), washing hands after coming - home ( Odds Ratio = 0.75, p = 0.003), brushing teeth before sleeping(Odds Ratio =0.45, p=0.000), brushing teeth more than once a day (Odds Ratio =0.73, p=0.0l2), drinking boiled water (Odds Ratio = 0.49, p=0.000), collecting garbage at home(Odds Ratio=0.31, p=0.000) and in the school(Odds Ratio =0. 67, p=0.000). All these led to significant differences. As to taking milk(Odds Ratio = 1.50, p=0.000), taking care of eyesight(Odds Ratio=1.41, p=0.001) and getting physical exercise in(Odds Ratio = 1.33, p=0.0l9) and outside the school(Odds Ratio = 1.32, p=0.005), rural children had more desirable behavior which also revealed a significant difference. There was little significant difference in smoking, but the smoking rate of rural children(5.5%) was larger than that of urban children(3.9%). 3. Health status was analyzed in terms of absences, school health room-visits, levels of physical strength, and the degree of obesity, height and weight. Considering Odds Ratios of the health status of urban-rural children, the health status of rural children was significantly better than that of the urban ones in the level of physical strength(t=1.51, p=0.000) and the degree of obesity(t=1.84, p=0.000). The mean height of urban children ($150.4{\pm}7.5cm$) is taller than that of their counterparts($149.5{\pm}7.9$), which revealed a significant difference (t =2.47, p=0.0l4). The mean weight of urban children($42.9{\pm}8.6kg$) is larger than that of their counterparts($41.8{\pm}9.0kg$), which was also a significant difference(t=2.81, p=0.005). Considering the results above, we can recognize that there are significant differences in family environment, health behavior, and health status in urban-rural children. These results also suggestion ideas for health education. What we would suggest for the health program of elementary schools is that school health teachers should play an active role in promoting the need and importance of health education, develop the appropriate programs which correspond to the regional characteristics, and incorporate them into schools to improve children's ability to manage their own health management.

  • PDF

Self Production of Radioisotope and Radiopharmaceuticals Divider (방사성동위원소 및 방사성의약품 분주장치의 자체제작)

  • Hong, Sung-Tack;Park, Kwang-Seo;Kim, Seok-Ki;Won, Woo-Jae
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.14 no.2
    • /
    • pp.177-180
    • /
    • 2010
  • Purpose: As PET test came to be covered by the pay system of medical insurance (July 1, 2006) and the needs for it becoming increased for laboratory purpose, it became necessary to purchase expensive medical equipments to solve those problems. However, as most of equipments that are operated by cyclotron are very expensive as to amount from tens of millions up to hundreds of millions of won, it is difficult to purchase those equipments from the point of medical organizations. It may be possible to self manufacture those equipments with least costs if their parts functions that meets the operators demands. The Nuclear Medicine department of National Cancer Center (NCC) is trying to manufacture and use equipments that can be made with least costs, including introducing 2 medical equipments that can improves the operator's works. Materials and Methods: Example 1: Self production of radioisotope($^{18}F$) divider was fabricated. The NCC's Nuclear Medicine department acquired one acrylic panel, seven 3-way valve, tubing etc. that can be found in the market to make the main body of divider in cooperation with biomedical engineering, and placed them inside hot cell, and installed switching box outside of hot cell to make it possible to control them from outside. This main body of divider were placed in radioisotope transfer line that are manufactured in the cyclotron. Example 2: Self production of $^{18}F$-FDG automated divider was fabricated. The NCC's Nuclear Medicine department used cavro pump syringe that consists the main body of divider in cooperation with biomedical engineering, biomedical engineering developed programs that divides a certain amount. $^{18}F$-FDG automated divider is placed inside hot cell, and cable chords were used in the equipment, and then it was connected to PC outside hot cell to make it possible to control the $^{18}F$-FDG automated divider. Results: From the NCC's Nuclear Medicine department tests that were carried out from March, 2007 until now, we found out that radioisotope can be sent to radiopharmaceuticals composite module we want, and from the tests that are carried out at NCC's Nuclear Medicine department using $^{18}F$-FDG automated divider since August, 2009 it was possible to distribute radiopharmaceuticals into vial intended. Conclusion: Through the two examples above, we found out that costs can be reduced by self manufacturing expensive equipments from NCC's cyclotron room with least costs. Also, it decreased radiation exposure dose on workers, and set up problem solving processes in cooperation with lots of parties related.

  • PDF

Exploring Influence of Network Structure, Organizational Learning Culture, and Knowledge Management Participation on Individual Creativity and Performance: Comparison of SI Proposal Team and R&D Team (네트워크 구조와 조직학습문화, 지식경영참여가 개인창의성 및 성과에 미치는 영향에 관한 실증분석: SI제안팀과 R&D팀의 비교연구)

  • Lee, Kun-Chang;Seo, Young-Wook;Chae, Seong-Wook;Song, Seok-Woo
    • Asia pacific journal of information systems
    • /
    • v.20 no.4
    • /
    • pp.101-123
    • /
    • 2010
  • Recently, firms are operating a number of teams to accomplish organizational performance. Especially, ad hoc teams like proposal preparation team are quite different from permanent teams like R&D team in the sense of how the team forms network structure and deals with organizational learning culture and knowledge management participation efforts. Moreover, depending on the team characteristics, individual creativity will differ from each other, which will lead to organizational performance eventually. Previous studies in the field of creativity are lacking in this issue. So main objectives of this study are organized as follows. First, the issue of how to improve individual creativity and organizational performance will be analyzed empirically. This issue will be performed depending on team characteristics such as ad hoc team and permanent team. Antecedents adopted for this research objective are cultural and knowledge factors such as organizational learning culture, and knowledge management participation. Second, the network structure such as degree centrality, and structural hole is used to analyze its influence on individual creativity and organizational performance. SI (System Integration) companies are facing severely tough requirements from clients to submit very creative proposals. Also, R&D teams are widely accepted as relatively creative teams because their responsibilities are focused on suggesting innovative techniques to make their companies remain competitive in the market. SI teams are usually ad hoc, while R&D teams are permanent on an average. By taking advantage of these characteristics of the two kinds of teams, we will prove the validity of the proposed research questions. To obtain the survey data, we accessed 7 SI teams (74 members), and 6 R&D teams (63 members), collecting 137 valid questionnaires. PLS technique was applied to analyze the survey data. Results are as follows. First, in case of SI teams, organizational learning culture affects individual creativity significantly. Meanwhile, knowledge management participation has a significant influence on Individual creativity for the permanent teams. Second, degree centrality Influences individual creativity significantly in case of SI teams. This is comparable with the fact that structural hole has a significant impact on individual creativity for the R&D teams. Practical implications can be summarized as follows: First, network structure of ad hoc team should be designed differently from one of permanent team. Ad hoc team is supposed to show a high creativity in a rather short period, implying that network density among team members should be improved, and those members with high degree centrality should be encouraged to show their Individual creativity and take a leading role by allowing them to get heavily engaged in knowledge sharing and diffusion. In contrast, permanent team should be designed to take advantage of structural hole instead of focusing on network density. Since structural hole can be utilized very effectively in the permanent team, strong arbitrators' merits in the permanent team will increase and therefore helps increase both network efficiency and effectiveness too. In this way, individual creativity in the permanent team is likely to lead to organizational creativity in a seamless way. Second, way of Increasing individual creativity should be sought from the perspective of organizational culture and knowledge management. Organization is supposed to provide a cultural atmosphere in which Innovative idea suggestions and active discussion among team members are encouraged. In this way, trust builds up among team members, facilitating the formation of organizational learning culture. Third, in the ad hoc team, organizational looming culture should be built such a way that individual creativity can grow up fast in a rather short period. Since time is tight, reasonable compensation policy, leader's Initiatives, and learning culture formation should be done In a short period so that mutual trust is built among members quickly, and necessary knowledge and information can be learnt rapidly. Fourth, in the permanent team, it should be kept in mind that the degree of participation in knowledge management determines level of Individual creativity. Therefore, the team ought to facilitate knowledge circulation process such as knowledge creation, storage, sharing, utilization, and learning among team members, which will lead to team performance. In this way, firms must control knowledge networks in permanent team and ad hoc team in a way mentioned above so that individual creativity as well as team performance can be maximized.

A Study on Automatic Classification Model of Documents Based on Korean Standard Industrial Classification (한국표준산업분류를 기준으로 한 문서의 자동 분류 모델에 관한 연구)

  • Lee, Jae-Seong;Jun, Seung-Pyo;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.221-241
    • /
    • 2018
  • As we enter the knowledge society, the importance of information as a new form of capital is being emphasized. The importance of information classification is also increasing for efficient management of digital information produced exponentially. In this study, we tried to automatically classify and provide tailored information that can help companies decide to make technology commercialization. Therefore, we propose a method to classify information based on Korea Standard Industry Classification (KSIC), which indicates the business characteristics of enterprises. The classification of information or documents has been largely based on machine learning, but there is not enough training data categorized on the basis of KSIC. Therefore, this study applied the method of calculating similarity between documents. Specifically, a method and a model for presenting the most appropriate KSIC code are proposed by collecting explanatory texts of each code of KSIC and calculating the similarity with the classification object document using the vector space model. The IPC data were collected and classified by KSIC. And then verified the methodology by comparing it with the KSIC-IPC concordance table provided by the Korean Intellectual Property Office. As a result of the verification, the highest agreement was obtained when the LT method, which is a kind of TF-IDF calculation formula, was applied. At this time, the degree of match of the first rank matching KSIC was 53% and the cumulative match of the fifth ranking was 76%. Through this, it can be confirmed that KSIC classification of technology, industry, and market information that SMEs need more quantitatively and objectively is possible. In addition, it is considered that the methods and results provided in this study can be used as a basic data to help the qualitative judgment of experts in creating a linkage table between heterogeneous classification systems.

A Document Collection Method for More Accurate Search Engine (정확도 높은 검색 엔진을 위한 문서 수집 방법)

  • Ha, Eun-Yong;Gwon, Hui-Yong;Hwang, Ho-Yeong
    • The KIPS Transactions:PartA
    • /
    • v.10A no.5
    • /
    • pp.469-478
    • /
    • 2003
  • Internet information search engines using web robots visit servers conneted to the Internet periodically or non-periodically. They extract and classify data collected according to their own method and construct their database, which are the basis of web information search engines. There procedure are repeated very frequently on the Web. Many search engine sites operate this processing strategically to become popular interneet portal sites which provede users ways how to information on the web. Web search engine contacts to thousands of thousands web servers and maintains its existed databases and navigates to get data about newly connected web servers. But these jobs are decided and conducted by search engines. They run web robots to collect data from web servers without knowledge on the states of web servers. Each search engine issues lots of requests and receives responses from web servers. This is one cause to increase internet traffic on the web. If each web server notify web robots about summary on its public documents and then each web robot runs collecting operations using this summary to the corresponding documents on the web servers, the unnecessary internet traffic is eliminated and also the accuracy of data on search engines will become higher. And the processing overhead concerned with web related jobs on web servers and search engines will become lower. In this paper, a monitoring system on the web server is designed and implemented, which monitors states of documents on the web server and summarizes changes of modified documents and sends the summary information to web robots which want to get documents from the web server. And an efficient web robot on the web search engine is also designed and implemented, which uses the notified summary and gets corresponding documents from the web servers and extracts index and updates its databases.