• Title/Summary/Keyword: library open data quality

Search Result 25, Processing Time 0.019 seconds

Computer Vision Approach for Phenotypic Characterization of Horticultural Crops (컴퓨터 비전을 활용한 토마토, 파프리카, 멜론 및 오이 작물의 표현형 특성화)

  • Seungri Yoon;Minju Shin;Jin Hyun Kim;Ho Jeong Jeong;Junyoung Park;Tae In Ahn
    • Journal of Bio-Environment Control
    • /
    • v.33 no.1
    • /
    • pp.63-70
    • /
    • 2024
  • This study explored computer vision methods using the OpenCV open-source library to characterize the phenotypes of various horticultural crops. In the case of tomatoes, image color was examined to assess ripeness, while support vector machine (SVM) and histogram of oriented gradients (HOG) methods effectively identified ripe tomatoes. For sweet pepper, we visualized the color distribution and used the Gaussian mixture model for clustering to analyze its post-harvest color characteristics. For the quality assessment of netted melons, the LAB (lightness, a, b) color space, binary images, and depth mapping were used to measure the net patterns of the melon. In addition, a combination of depth and color data proved successful in identifying flowers of different sizes and distances in cucumber greenhouses. This study highlights the effectiveness of these computer vision strategies in monitoring the growth and development, ripening, and quality assessment of fruits and vegetables. For broader applications in agriculture, future researchers and developers should enhance these techniques with plant physiological indicators to promote their adoption in both research and practical agricultural settings.

Estimation of Pollutant Sources in Dangjin Coal-Fired Power Plant Using Carbon Isotopes (탄소 안정동위원소를 이용한 석탄화력발전소 인근 오염원 기원 추정 : 당진시를 중심으로)

  • Yoon, Soohyang;Cho, Bong-Yeon
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.3
    • /
    • pp.567-575
    • /
    • 2021
  • Residents in Dangjin, South Chungcheong Province, in which large-scale emissions facilities such as coal-fired power plants and steel mills are concentrated, are very much concerned about their health despite the local government's aggressive efforts to improve air quality and reduce greenhouse gases. To understand the impact of coal-fired power plants and external factors on local air pollution, the origins of local pollutants were investigated using stable carbon isotopes that are generally used as tracers of the provenance of fine or ultrafine dust. The origins of the pollutants were analyzed with the data library, built using the seasonally measured data for the two separate locations selected considering the distance from the coal-fired power plant and the analysis of previous studies, and with the back trajectory analysis. As a result of analyzing stable isotope ratios, the tendency of high concentration was found in the order of winter > spring > fall > summer. According to the data matching with the library, the mobile pollutants and open-air incineration had a relatively higher impact on the local air pollution. It is believed that this study, as a pilot study, should focus on securing the reliability of the study results through continuous monitoring and data accumulation.

A Study on Automated Input of Attribute for Referenced Objects in Spatial Relationships of HD Map (정밀도로지도 공간관계 참조객체의 속성 입력 자동화에 관한 연구)

  • Dong-Gi SUNG;Seung-Hyun MIN;Yun-Soo CHOI;Jong-Min OH
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.27 no.1
    • /
    • pp.29-40
    • /
    • 2024
  • Recently, the technology of autonomous driving, one of the core of the fourth industrial revolution, is developing, but sensor-based autonomous driving is showing limitations, such as accidents in unexpected situations, To compensate for this, HD-map is being used as a core infrastructure for autonomous driving, and interest in the public and private sectors is increasing, and various studies and technology developments are being conducted to secure the latest and accuracy of HD-map. Currently, NGII will be newly built in urban areas and major roads across the country, including the metropolitan area, where self-driving cars are expected to run, and is working to minimize data error rates through quality verification. Therefore, this study analyzes the spatial relationship of reference objects in the attribute structuring process for rapid and accurate renewal and production of HD-map under construction by NGII, By applying the attribute input automation methodology of the reference object in which spatial relations are established using the library of open source-based PyQGIS, target sites were selected for each road type, such as high-speed national highways, general national highways, and C-ITS demonstration sections. Using the attribute automation tool developed in this study, it took about 2 to 5 minutes for each target location to automatically input the attributes of the spatial relationship reference object, As a result of automation of attribute input for reference objects, attribute input accuracy of 86.4% for high-speed national highways, 79.7% for general national highways, 82.4% for C-ITS, and 82.8% on average were secured.

KOMUChat: Korean Online Community Dialogue Dataset for AI Learning (KOMUChat : 인공지능 학습을 위한 온라인 커뮤니티 대화 데이터셋 연구)

  • YongSang Yoo;MinHwa Jung;SeungMin Lee;Min Song
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.219-240
    • /
    • 2023
  • Conversational AI which allows users to interact with satisfaction is a long-standing research topic. To develop conversational AI, it is necessary to build training data that reflects real conversations between people, but current Korean datasets are not in question-answer format or use honorifics, making it difficult for users to feel closeness. In this paper, we propose a conversation dataset (KOMUChat) consisting of 30,767 question-answer sentence pairs collected from online communities. The question-answer pairs were collected from post titles and first comments of love and relationship counsel boards used by men and women. In addition, we removed abuse records through automatic and manual cleansing to build high quality dataset. To verify the validity of KOMUChat, we compared and analyzed the result of generative language model learning KOMUChat and benchmark dataset. The results showed that our dataset outperformed the benchmark dataset in terms of answer appropriateness, user satisfaction, and fulfillment of conversational AI goals. The dataset is the largest open-source single turn text data presented so far and it has the significance of building a more friendly Korean dataset by reflecting the text styles of the online community.

A research on the Construction and Sharing of Authority Record-focusing on the Case of Social Networks and Archival Context Project (전거레코드 구축 및 공유에 관한 연구 SNAC 프로젝트 사례를 중심으로)

  • Lee, Eun Yeong
    • The Korean Journal of Archival Studies
    • /
    • no.71
    • /
    • pp.49-89
    • /
    • 2022
  • This study suggests the necessity and domestic application plan a national authority database that promotes an integrated access, richer search, and understanding of historical information sources and archival resources distributed among cultural heritage institutions through the "Social Networks and Archive Context" project case. As the SNAC project was transformed into an international cooperative organization led by NARA, it was possible to secure a sustainable operating system and realize cooperative authority control. In addition, SNAC authority records have the characteristics of providing richer contextual information about life and history and social and intellectual network information compared to libraries. Through case analysis, First, like SNAC, a cooperative body led by the National Archives and having joint ownership of the National Library of Korea should lead the development and expand the scope of participating institutions. Second, in the cooperative method, take a structure in which divisions are made for each field with special strengths, but the main decision-making is made through the administrative team in which the two organizations participate. Third, development of scalable open source software that can collect technical information in various formats when constructing authority data, designing with the structure and elements of archival authority records, designing functions to control the quality of authority records, and building user-friendly interfaces and the need for a platform design reflecting content elements.