• Title/Summary/Keyword: information classification

Search Result 8,309, Processing Time 0.04 seconds

Changes in Korean Consumers' Perception on Food Preservatives by a Risk Communication Booklet

  • Kim, Suna;Kim, Ji-Sun;Kang, Hee-Jin;Lee, Gunyoung;Lim, Ho Soo;Yun, Sang Soon;Kim, Jeong-Weon
    • Journal of Food Hygiene and Safety
    • /
    • v.33 no.6
    • /
    • pp.417-426
    • /
    • 2018
  • Food preservatives are very important food additives for the biological and chemical safety of processed foods. The purposes of this study were to investigate Korean consumer's perception and information needs on food preservatives, to develop an educational booklet as a risk communication material on food preservatives, and to assess the educational effect of the developed booklet. To understand perception on food preservatives, a self-administered questionnaire survey was conducted by 381 parents having elementary school students at Seoul and Geoynggi area in Korea. Based on the survey results, brain storming of the authors along with consultation from the professionals, we developed a risk communication booklet about food preservatives. It was exposed to 35 parents of elementary school children, and their evaluation was collected by using a questionnaire and analyzed statistically. Respondents considered food safety (44.8%) as the most important factor while purchasing processed foods. They still perceived food additives as the most hazardous one (41.5%), and among those, food preservatives were the most concerned (45.9%). Total 67.7% of the respondents considered the consumption of food preservatives as hazardous or very hazardous. However, 90.6% of respondents did not have any educational experience about food additives and food preservatives. Based on their information needs, a science-based booklet consisting of the definition, classification, safety, intake, and management of food preservatives was developed. When the booklet titled as 'Food preservatives, Just Know Them!' was exposed to the parents via elementary school teacher, their negative perceptions on food additives and food preservatives were changed positively by increasing the understanding level on preservatives from 18.9% to 90.9% and obtaining 72.7% positive answers on their safety. Therefore, it could be used as an effective risk communication material on food preservatives.

The Effect of Data Size on the k-NN Predictability: Application to Samsung Electronics Stock Market Prediction (데이터 크기에 따른 k-NN의 예측력 연구: 삼성전자주가를 사례로)

  • Chun, Se-Hak
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.239-251
    • /
    • 2019
  • Statistical methods such as moving averages, Kalman filtering, exponential smoothing, regression analysis, and ARIMA (autoregressive integrated moving average) have been used for stock market predictions. However, these statistical methods have not produced superior performances. In recent years, machine learning techniques have been widely used in stock market predictions, including artificial neural network, SVM, and genetic algorithm. In particular, a case-based reasoning method, known as k-nearest neighbor is also widely used for stock price prediction. Case based reasoning retrieves several similar cases from previous cases when a new problem occurs, and combines the class labels of similar cases to create a classification for the new problem. However, case based reasoning has some problems. First, case based reasoning has a tendency to search for a fixed number of neighbors in the observation space and always selects the same number of neighbors rather than the best similar neighbors for the target case. So, case based reasoning may have to take into account more cases even when there are fewer cases applicable depending on the subject. Second, case based reasoning may select neighbors that are far away from the target case. Thus, case based reasoning does not guarantee an optimal pseudo-neighborhood for various target cases, and the predictability can be degraded due to a deviation from the desired similar neighbor. This paper examines how the size of learning data affects stock price predictability through k-nearest neighbor and compares the predictability of k-nearest neighbor with the random walk model according to the size of the learning data and the number of neighbors. In this study, Samsung electronics stock prices were predicted by dividing the learning dataset into two types. For the prediction of next day's closing price, we used four variables: opening value, daily high, daily low, and daily close. In the first experiment, data from January 1, 2000 to December 31, 2017 were used for the learning process. In the second experiment, data from January 1, 2015 to December 31, 2017 were used for the learning process. The test data is from January 1, 2018 to August 31, 2018 for both experiments. We compared the performance of k-NN with the random walk model using the two learning dataset. The mean absolute percentage error (MAPE) was 1.3497 for the random walk model and 1.3570 for the k-NN for the first experiment when the learning data was small. However, the mean absolute percentage error (MAPE) for the random walk model was 1.3497 and the k-NN was 1.2928 for the second experiment when the learning data was large. These results show that the prediction power when more learning data are used is higher than when less learning data are used. Also, this paper shows that k-NN generally produces a better predictive power than random walk model for larger learning datasets and does not when the learning dataset is relatively small. Future studies need to consider macroeconomic variables related to stock price forecasting including opening price, low price, high price, and closing price. Also, to produce better results, it is recommended that the k-nearest neighbor needs to find nearest neighbors using the second step filtering method considering fundamental economic variables as well as a sufficient amount of learning data.

A Study on the Management of Manhwa Contents Records and Archives (만화기록 관리 방안 연구)

  • Kim, Seon Mi;Kim, Ik Han
    • The Korean Journal of Archival Studies
    • /
    • no.28
    • /
    • pp.35-81
    • /
    • 2011
  • Manhwa is a mass media (to expose all faces of an era such as politics, society, cultures, etc with the methodology of irony, parody, etc). Since the Manhwa records is primary culture infrastructure, it can create the high value-added industry by connecting with fancy, character, game, movie, drama, theme park, advertising business. However, due to lack of active and systematic aquisition system, as precious Manhwa manuscript is being lost every year and the contents hard to preserve such as Manhwa content in the form of electronic records are increasing, the countermeasure of Manhwa contents management is needed desperately. In this study, based on these perceptions, the need of Manhwa records management is examined, and the characteristics and the components of Manhwa records were analyzed. And at the same time, the functions of record management process reflecting the characteristics of Manhwa records were extracted by analyzing various cases of overseas Cartoon Archives. And then, the framework of record-keeping regime was segmented into each of acquisition management service areas and the general Manhwa records archiving strategy, which manages the Manhwa contents records, was established and suggested. The acquired Manhwa content records will secure the context among records and warrant the preservation of records and provide diverse access points by reflecting multi classification and multi-level descriptive element. The Manhwa records completed the intellectual arrangement will be preserved after the conservation in an environment equipped with preservation facilities or preserved using digital format in case of electronic records or when there is potential risk of damaging the records. Since the purpose of the Manhwa records is to use them, the information may be provided to diverse classes of users through the exhibition, the distribution, and the development of archival information content. Since the term of "Manhwa records" is unfamiliar yet and almost no study has been conducted in the perspective of records management, it will be the limit of this study only presenting acquisition strategy, management and service strategy of Manhwa contents and suggesting simple examples. However, if Manhwa records management strategy are possibly introduced practically to Manhwa manuscript repositories through archival approach, it will allow systematic acquisition, preservation, arrangement of Manhwa records and will contribute greatly to form a foundation for future Korean culture contents management.

Considerations of Countermeasure Tasks in the Fields of Forest and Forestry in Korea through Case Study on "The Nagoya Protocol (Access to Genetic Resources and Benefit Sharing)" ("유전자원의 접근과 이익공유(ABS)" 사례연구를 통한 국내 산림·임업분야 대응과제 고찰)

  • Lee, Gwan Gyu;Kim, Jun Soon;Jung, Haw young
    • Journal of Korean Society of Forest Science
    • /
    • v.100 no.3
    • /
    • pp.522-534
    • /
    • 2011
  • The aim of this study is to draw forth the tasks for establishing the right of native biology in Korea through the case study on 'Access on genetic resources and Benefit Sharing'. For this purpose, this study decided on its research subject by selecting Hoodia, on which ABS treaty was made the most recently, through the examination of the representative ABS precedents on plant species. This study analyzed the process background of ABS on Hoodia, and compared & analyzed the ABS procedures of 'Bonn Guidelines' adopted by the 6th Conference of the Parties of the Convention on Biological Diversity in 2002 and Hoodia case. Together with the ABS major issues in common drawn as a result of this analysis, and "Nagoya Protocol" adopted by the 10th Conference of the Parties of the Convention on Biological Diversity, this study intended to shed a light on the impending tasks which Korea faces at present and its role relationship. The research results are as follows: 1. It is required that species habitats should be divided based on biological classification and its subsequent community should be established with the development of infrastructure such as a community's independent production, management and monitoring of bio-species. 2. There needs to be a designation of ABS National Focal Point for sharing of ABS-related general information, boosting of implementation of the relevant convention. 3. There needs to be the establishment of ABS convention system consequent on legislative, administrative, political procedures, and designation of the Competent National Authorities for the provision of the format of Prior Informed Consent (PIC) and Mutually Agreed Terms (MAT) and their contents assessment and confirmation. 4. There should be the establishment of integrated management system of ABS-related research and development of forest biological resources and its relevant research projects. 5. There should be information development through the distribution of responsibility and role between the ministries and offices concerned according to bio-resources, and there needs to be efforts in aiming for opening a working group of academic-industrial institutions for developing a mutually interchangeable system. 6. It's required that the efficient access between industrial circles and the people should be promoted by setting up ABS support center of biological resources in ministry and office's charge. 7. There should be a selection of a national supervisory organization for securement of the right of a local community and monitoring of ABS convention implementation, and a countermeasure system for preventing outflow of forest bioresources. Conclusively, it's judged that it will be possible to inquire into the countermeasures for the establishment of the native forest biology dominion through such research results.

Design and Implementation of a Similarity based Plant Disease Image Retrieval using Combined Descriptors and Inverse Proportion of Image Volumes (Descriptor 조합 및 동일 병명 이미지 수량 역비율 가중치를 적용한 유사도 기반 작물 질병 검색 기술 설계 및 구현)

  • Lim, Hye Jin;Jeong, Da Woon;Yoo, Seong Joon;Gu, Yeong Hyeon;Park, Jong Han
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.14 no.6
    • /
    • pp.30-43
    • /
    • 2018
  • Many studies have been carried out to retrieve images using colors, shapes, and textures which are characteristic of images. In addition, there is also progress in research related to the disease images of the crop. In this paper, to be a help to identify the disease occurred in crops grown in the agricultural field, we propose a similarity-based crop disease search system using the diseases image of horticulture crops. The proposed system improves the similarity retrieval performance compared to existing ones through the combination descriptor without using a single descriptor and applied the weight based calculation method to provide users with highly readable similarity search results. In this paper, a total of 13 Descriptors were used in combination. We used to retrieval of disease of six crops using a combination Descriptor, and a combination Descriptor with the highest average accuracy for each crop was selected as a combination Descriptor for the crop. The retrieved result were expressed as a percentage using the calculation method based on the ratio of disease names, and calculation method based on the weight. The calculation method based on the ratio of disease name has a problem in that number of images used in the query image and similarity search was output in a first order. To solve this problem, we used a calculation method based on weight. We applied the test image of each disease name to each of the two calculation methods to measure the classification performance of the retrieval results. We compared averages of retrieval performance for two calculation method for each crop. In cases of red pepper and apple, the performance of the calculation method based on the ratio of disease names was about 11.89% on average higher than that of the calculation method based on weight, respectively. In cases of chrysanthemum, strawberry, pear, and grape, the performance of the calculation method based on the weight was about 20.34% on average higher than that of the calculation method based on the ratio of disease names, respectively. In addition, the system proposed in this paper, UI/UX was configured conveniently via the feedback of actual users. Each system screen has a title and a description of the screen at the top, and was configured to display a user to conveniently view the information on the disease. The information of the disease searched based on the calculation method proposed above displays images and disease names of similar diseases. The system's environment is implemented for use with a web browser based on a pc environment and a web browser based on a mobile device environment.

KB-BERT: Training and Application of Korean Pre-trained Language Model in Financial Domain (KB-BERT: 금융 특화 한국어 사전학습 언어모델과 그 응용)

  • Kim, Donggyu;Lee, Dongwook;Park, Jangwon;Oh, Sungwoo;Kwon, Sungjun;Lee, Inyong;Choi, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.2
    • /
    • pp.191-206
    • /
    • 2022
  • Recently, it is a de-facto approach to utilize a pre-trained language model(PLM) to achieve the state-of-the-art performance for various natural language tasks(called downstream tasks) such as sentiment analysis and question answering. However, similar to any other machine learning method, PLM tends to depend on the data distribution seen during the training phase and shows worse performance on the unseen (Out-of-Distribution) domain. Due to the aforementioned reason, there have been many efforts to develop domain-specified PLM for various fields such as medical and legal industries. In this paper, we discuss the training of a finance domain-specified PLM for the Korean language and its applications. Our finance domain-specified PLM, KB-BERT, is trained on a carefully curated financial corpus that includes domain-specific documents such as financial reports. We provide extensive performance evaluation results on three natural language tasks, topic classification, sentiment analysis, and question answering. Compared to the state-of-the-art Korean PLM models such as KoELECTRA and KLUE-RoBERTa, KB-BERT shows comparable performance on general datasets based on common corpora like Wikipedia and news articles. Moreover, KB-BERT outperforms compared models on finance domain datasets that require finance-specific knowledge to solve given problems.

Development of a water quality prediction model for mineral springs in the metropolitan area using machine learning (머신러닝을 활용한 수도권 약수터 수질 예측 모델 개발)

  • Yeong-Woo Lim;Ji-Yeon Eom;Kee-Young Kwahk
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.307-325
    • /
    • 2023
  • Due to the prolonged COVID-19 pandemic, the frequency of people who are tired of living indoors visiting nearby mountains and national parks to relieve depression and lethargy has exploded. There is a place where thousands of people who came out of nature stop walking and breathe and rest, that is the mineral spring. Even in mountains or national parks, there are about 600 mineral springs that can be found occasionally in neighboring parks or trails in the metropolitan area. However, due to irregular and manual water quality tests, people drink mineral water without knowing the test results in real time. Therefore, in this study, we intend to develop a model that can predict the quality of the spring water in real time by exploring the factors affecting the quality of the spring water and collecting data scattered in various places. After limiting the regions to Seoul and Gyeonggi-do due to the limitations of data collection, we obtained data on water quality tests from 2015 to 2020 for about 300 mineral springs in 18 cities where data management is well performed. A total of 10 factors were finally selected after two rounds of review among various factors that are considered to affect the suitability of the mineral spring water quality. Using AutoML, an automated machine learning technology that has recently been attracting attention, we derived the top 5 models based on prediction performance among about 20 machine learning methods. Among them, the catboost model has the highest performance with a prediction classification accuracy of 75.26%. In addition, as a result of examining the absolute influence of the variables used in the analysis through the SHAP method on the prediction, the most important factor was whether or not a water quality test was judged nonconforming in the previous water quality test. It was confirmed that the temperature on the day of the inspection and the altitude of the mineral spring had an influence on whether the water quality was unsuitable.

Digital Archives of Cultural Archetype Contents: Its Problems and Direction (디지털 아카이브즈의 문제점과 방향 - 문화원형 콘텐츠를 중심으로 -)

  • Hahm, Han-Hee;Park, Soon-Cheol
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.17 no.2
    • /
    • pp.23-42
    • /
    • 2006
  • This is a study of the digital archives of Culturecontent.com where 'Cultural Archetype Contents' are currently in service. One of the major purposes of our study is to point out problems in the current system and eventually propose improvements to the digital archives. The government launched a four-year project for developing the cultural archetype content sources and establishing its related business with the hope of enhancing the nation's competitiveness. More specifically, the project focuses on the production of source materials of cultural archetype contents in the subjects of Korea's history. tradition, everyday life. arts and general geographical books. In addition, through this project, the government also intends to establish a proper distribution system of digitalized culture contents and to control copyright issues. This paper analyzes the digital archives system that stores the culture content data that have been produced from 2002 to 2005 and evaluates the current system's weaknesses and strengths. The summary of our findings is as follows. First. the digital archives system does not contain a semantic search engine and therefore its full function is 1agged. Second, similar data is not classified into the same categories but into the different ones, thereby confusing and inconveniencing users. Users who want to find source materials could be disappointed by the current distributive system. Our paper suggests a better system of digital archives with text mining technology which consists of five significant intelligent process-keyword searches, summarization, clustering, classification and topic tracking. Our paper endeavors to develop the best technical environment for preserving and using culture contents data. With the new digitalized upgraded settings, users of culture contents data will discover a world of new knowledge. The technology we introduce in this paper will lead to the highest achievable digital intelligence through a new framework.

Predicting the Potential Habitat and Future Distribution of Brachydiplax chalybea flavovittata Ris, 1911 (Odonata: Libellulidae) (기후변화에 따른 남색이마잠자리 잠재적 서식지 및 미래 분포예측)

  • Soon Jik Kwon;Yung Chul Jun;Hyeok Yeong Kwon;In Chul Hwang;Chang Su Lee;Tae Geun Kim
    • Journal of Wetlands Research
    • /
    • v.25 no.4
    • /
    • pp.335-344
    • /
    • 2023
  • Brachydiplax chalybea flavovittata, a climate-sensitive biological indicator species, was first observed and recorded at Jeju Island in Korea in 2010. Overwintering was recently confirmed in the Yeongsan River area. This study was aimed to predict the potential distribution patterns for the larvae of B. chalybea flavovittata and to understand its ecological characteristics as well as changes of population under global climate change circumstances. Data was collected both from the Global Biodiversity Information Facility (GBIF) and by field surveys from May 2019 to May 2023. We used for the distribution model among downloaded 19 variables from the WorldClim database. MaxEnt model was adopted for the prediction of potential and future distribution for B. chalybea flavovittata. Larval distribution ranged within a region delimited by northern latitude from Jeju-si, Jeju Special Self-Governing Province (33.318096°) to Yeoju-si, Gyeonggi-do (37.366734°) and eastern longitude from Jindo-gun, Jeollanam-do (126.054925°) to Yangsan-si, Gyeongsangnam-do (129.016472°). M type (permanent rivers, streams and creeks) wetlands were the most common habitat based on the Ramsar's wetland classification system, followed by Tp type (permanent freshwater marshes and pools) (45.8%) and F type (estuarine waters) (4.2%). MaxEnt model presented that potential distribution with high inhabiting probability included Ulsan and Daegu Metropolitan City in addition to the currently discovered habitats. Applying to the future scenarios by Intergovernmental Panel on Climate Change (IPCC), it was predicted that the possible distribution area would expand in the 2050s and 2090s, covering the southern and western coastal regions, the southern Daegu metropolitan area and the eastern coastal regions in the near future. This study suggests that B. chalybea flavovittata can be used as an effective indicator species for climate changes with a monitoring of their distribution ranges. Our findings will also help to provide basic information on the conservation and management of co-existing native species.

Performance Evaluation of Monitoring System for Sargassum horneri Using GOCI-II: Focusing on the Results of Removing False Detection in the Yellow Sea and East China Sea (GOCI-II 기반 괭생이모자반 모니터링 시스템 성능 평가: 황해 및 동중국해 해역 오탐지 제거 결과를 중심으로)

  • Han-bit Lee;Ju-Eun Kim;Moon-Seon Kim;Dong-Su Kim;Seung-Hwan Min;Tae-Ho Kim
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_2
    • /
    • pp.1615-1633
    • /
    • 2023
  • Sargassum horneri is one of the floating algae in the sea, which breeds in large quantities in the Yellow Sea and East China Sea and then flows into the coast of Republic of Korea, causing various problems such as destroying the environment and damaging fish farms. In order to effectively prevent damage and preserve the coastal environment, the development of Sargassum horneri detection algorithms using satellite-based remote sensing technology has been actively developed. However, incorrect detection information causes an increase in the moving distance of ships collecting Sargassum horneri and confusion in the response of related local governments or institutions,so it is very important to minimize false detections when producing Sargassum horneri spatial information. This study applied technology to automatically remove false detection results using the GOCI-II-based Sargassum horneri detection algorithm of the National Ocean Satellite Center (NOSC) of the Korea Hydrographic and Oceanography Agency (KHOA). Based on the results of analyzing the causes of major false detection results, it includes a process of removing linear and sporadic false detections and green algae that occurs in large quantities along the coast of China in spring and summer by considering them as false detections. The technology to automatically remove false detection was applied to the dates when Sargassum horneri occurred from February 24 to June 25, 2022. Visual assessment results were generated using mid-resolution satellite images, qualitative and quantitative evaluations were performed. Linear false detection results were completely removed, and most of the sporadic and green algae false detection results that affected the distribution were removed. Even after the automatic false detection removal process, it was possible to confirm the distribution area of Sargassum horneri compared to the visual assessment results, and the accuracy and precision calculated using the binary classification model averaged 97.73% and 95.4%, respectively. Recall value was very low at 29.03%, which is presumed to be due to the effect of Sargassum horneri movement due to the observation time discrepancy between GOCI-II and mid-resolution satellite images, differences in spatial resolution, location deviation by orthocorrection, and cloud masking. The results of this study's removal of false detections of Sargassum horneri can determine the spatial distribution status in near real-time, but there are limitations in accurately estimating biomass. Therefore, continuous research on upgrading the Sargassum horneri monitoring system must be conducted to use it as data for establishing future Sargassum horneri response plans.