• Title/Summary/Keyword: Pre Processing

Search Result 1,998, Processing Time 0.029 seconds

Implementation of CNN-based Classification Training Model for Unstructured Fashion Image Retrieval using Preprocessing with MASK R-CNN (비정형 패션 이미지 검색을 위한 MASK R-CNN 선형처리 기반 CNN 분류 학습모델 구현)

  • Seunga, Cho;Hayoung, Lee;Hyelim, Jang;Kyuri, Kim;Hyeon-Ji, Lee;Bong-Ki, Son;Jaeho, Lee
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.6
    • /
    • pp.13-23
    • /
    • 2022
  • In this paper, we propose a detailed component image classification algorithm by fashion item for unstructured data retrieval in the fashion field. Due to the COVID-19 environment, AI-based online shopping malls are increasing recently. However, there is a limit to accurate unstructured data search with existing keyword search and personalized style recommendations based on user surfing behavior. In this study, pre-processing using Mask R-CNN was conducted using images crawled from online shopping sites and then classified components for each fashion item through CNN. We obtain the accuaracy for collar of the shirt's as 93.28%, the pattern of the shirt as 98.10%, the 3 classese fit of the jeans as 91.73%, And, we further obtained one for the 4 classes fit of jeans as 81.59% and the color of the jeans as 93.91%. At the results for the decorated items, we also obtained the accuract of the washing of the jeans as 91.20% and the demage of jeans accuaracy as 92.96%.

Differences of Teachers and Students' Perceptions on Teaching Skills (교사의 수업전문성에 관한 교사와 학생의 인식 차이)

  • Lee, Okhwa
    • Korean Educational Research Journal
    • /
    • v.43 no.1
    • /
    • pp.125-152
    • /
    • 2022
  • The purpose of this study is to examine the differences of perceptions of teachers and students regarding teaching skills. For the analysis, data was collected by ICALT(International Comparative Analysis of Learning and Teaching) class observation tool and students survey called My Teacher Questionnaire. a student survey. The data of teachers and students can be compared because as the two tools have seven common domains(Safe and stimulating learning climate, Efficient organization, Clear and structured instructions, Intensive and activating teaching, Adjusting instructions and learner processing to inter-learner differences, Teaching learning strategies, Learner engagement). In 2016, in Daejeon, Chungbuk and Chungnam. trained teachers collected data from 106 classes, and 2,866 students responded the survey. The reliability and validity of the two tools, class observation and MTQ(My Teacher Questionnaire) are proven to be satisfactory for use in Korean schools. Students perception on teaching was high, particularly when students are in lower grades and learning major subjects like English, Korean, and math. The domain of higher teaching skills, male students show higher perceptions while female students reported higher perceptions on lower-level teaching skill domains. To compare the perceptions of teachers and students, the predictive reliability of students engagement against teaching skill domains was used. Teachers showed higher predictive reliability on lower teaching skill domains while students showed higher predictive reliability on higher teaching skill domains. It is recommended for further study to develop a professional development model using a teacher class observation tool and the My Teacher Questionnaire for pre-service teachers and school teachers.

A Study on the Application of Suitable Urban Regeneration Project Types Reflecting the Spatial Characteristics of Urban Declining Areas (도시 쇠퇴지역 공간 특성을 반영한 적합 도시재생 사업유형 적용방안 연구)

  • CHO, Don-Cherl;SHIN, Dong-Bin
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.24 no.4
    • /
    • pp.148-163
    • /
    • 2021
  • The diversification of the New Deal urban regeneration projects, that started in 2017 in accordance with the "Special Act on Urban Regeneration Activation and Support", generated the increased demand for the accuracy of data-driven diagnosis and project type forecast. Thus, this research was conducted to develop an application model able to identify the most appropriate New Deal project type for "eup", "myeon" and "dong" across the country. Data for application model development were collected through Statistical geographic information service(SGIS) and the 'Urban Regeneration Comprehensive Information Open System' of the Urban Regeneration Information System, and data for the analysis model was constructed through data pre-processing. Four models were derived and simulations were performed through polynomial regression analysis and multinomial logistic regression analysis for the application of the appropriate New Deal project type. I verified the applicability and validity of the four models by the comparative analysis of spatial distribution of the previously selected New Deal projects by targeting the sites located in Seoul by each model and the result showed that the DI-54 model had the highest concordance rate.

Design and Implementation of User-Level FileSystem in the Combat Management System

  • Kang, Seok-Hyun;Kim, Keun-Hee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.12
    • /
    • pp.9-16
    • /
    • 2022
  • In this paper, we propose a plan to design and utilize the RDBS(Record Block Data file management System) so that data can be recovered when data files in the Combat Management System are mismatched. The CMS(Combat Management System) manages the same files in multiple IPN(Infomation Processing Node) repositories to support multiplexing. However, mismatches in data files can occur due to equipment maintenance or user immaturity. The existing CMS does not manage the history of changes in data files, and when a mismatch occurs, data file were synchronized based on the latest date. But, It is difficult to say that files with the latest date have the highest reliability, and once the file synchronization has progressed, it cannot be recovered with pre-synchronization data. To solve this problem, data was stored and synchronized in units of record blocks using RDBS proposed in this paper, and the Rsync algorithm was used to reduce the overhead of file synchronization due to units of record blocks. SW applied with RDBS was tested for performance in a simulated environment, and it was confirmed that it could be applied to CMS through normal operation confirmation.

Designing a Blockchain-based Smart Contract for Seafarer Wage Payment (블록체인 기반 선원 임금지불을 위한 스마트 컨트랙트 설계)

  • Yoo, Sang-Lok;Kim, Kwang-Il;Ahn, Jang-Young
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.27 no.7
    • /
    • pp.1038-1043
    • /
    • 2021
  • Guaranteed seafarer wage payment is essential to ensure a stable supply of seafarers. However, disputes over non-payment of wages to seafarers often occur. In this study, an automatic wage payment system was designed using a blockchain-based smart contract to resolve the problem of seafarers' wage arrears. The designed system consists of an information register, a matching processing unit, a review rating management unit, and wage remittance before deploying smart contracts. The matching process was designed to send an automatic notification to seafarers and shipowners if the sum of the weight of the four variables, namely wages, ship type/fishery, position, and license, exceeded a pre-defined threshold. In addition, a review rating management system, based on a combination of mean and median, was presented to serve as a medium to mutually fulfill the normal working conditions. The smart contract automatically fulfills the labor contract between the parties without an intermediary. This system will naturally resolve problems such as fraudulent advance payment to seafarers, embezzlement by unregistered employment agencies, overdue wages, and forgery of seafarers' books. If this system design is commercialized and institutionally activated, it is expected that stable wages will be guaranteed to seafarers, and in turn, the difficulties in human resources supply will be solved. We plan to test it in a local environment for further developing this system.

BEEF MEAT TRACEABILITY. CAN NIRS COULD HELP\ulcorner

  • Cozzolino, D.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1246-1246
    • /
    • 2001
  • The quality of meat is highly variable in many properties. This variability originates from both animal production and meat processing. At the pre-slaughter stage, animal factors such as breed, sex, age contribute to this variability. Environmental factors include feeding, rearing, transport and conditions just before slaughter (Hildrum et al., 1995). Meat can be presented in a variety of forms, each offering different opportunities for adulteration and contamination. This has imposed great pressure on the food manufacturing industry to guarantee the safety of meat. Tissue and muscle speciation of flesh foods, as well as speciation of animal derived by-products fed to all classes of domestic animals, are now perhaps the most important uncertainty which the food industry must resolve to allay consumer concern. Recently, there is a demand for rapid and low cost methods of direct quality measurements in both food and food ingredients (including high performance liquid chromatography (HPLC), thin layer chromatography (TLC), enzymatic and inmunological tests (e.g. ELISA test) and physical tests) to establish their authenticity and hence guarantee the quality of products manufactured for consumers (Holland et al., 1998). The use of Near Infrared Reflectance Spectroscopy (NIRS) for the rapid, precise and non-destructive analysis of a wide range of organic materials has been comprehensively documented (Osborne et at., 1993). Most of the established methods have involved the development of NIRS calibrations for the quantitative prediction of composition in meat (Ben-Gera and Norris, 1968; Lanza, 1983; Clark and Short, 1994). This was a rational strategy to pursue during the initial stages of its application, given the type of equipment available, the state of development of the emerging discipline of chemometrics and the overwhelming commercial interest in solving such problems (Downey, 1994). One of the advantages of NIRS technology is not only to assess chemical structures through the analysis of the molecular bonds in the near infrared spectrum, but also to build an optical model characteristic of the sample which behaves like the “finger print” of the sample. This opens the possibility of using spectra to determine complex attributes of organic structures, which are related to molecular chromophores, organoleptic scores and sensory characteristics (Hildrum et al., 1994, 1995; Park et al., 1998). In addition, the application of statistical packages like principal component or discriminant analysis provides the possibility to understand the optical properties of the sample and make a classification without the chemical information. The objectives of this present work were: (1) to examine two methods of sample presentation to the instrument (intact and minced) and (2) to explore the use of principal component analysis (PCA) and Soft Independent Modelling of class Analogy (SIMCA) to classify muscles by quality attributes. Seventy-eight (n: 78) beef muscles (m. longissimus dorsi) from Hereford breed of cattle were used. The samples were scanned in a NIRS monochromator instrument (NIR Systems 6500, Silver Spring, MD, USA) in reflectance mode (log 1/R). Both intact and minced presentation to the instrument were explored. Qualitative analysis of optical information through PCA and SIMCA analysis showed differences in muscles resulting from two different feeding systems.

  • PDF

Cross-Lingual Style-Based Title Generation Using Multiple Adapters (다중 어댑터를 이용한 교차 언어 및 스타일 기반의 제목 생성)

  • Yo-Han Park;Yong-Seok Choi;Kong Joo Lee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.8
    • /
    • pp.341-354
    • /
    • 2023
  • The title of a document is the brief summarization of the document. Readers can easily understand a document if we provide them with its title in their preferred styles and the languages. In this research, we propose a cross-lingual and style-based title generation model using multiple adapters. To train the model, we need a parallel corpus in several languages with different styles. It is quite difficult to construct this kind of parallel corpus; however, a monolingual title generation corpus of the same style can be built easily. Therefore, we apply a zero-shot strategy to generate a title in a different language and with a different style for an input document. A baseline model is Transformer consisting of an encoder and a decoder, pre-trained by several languages. The model is then equipped with multiple adapters for translation, languages, and styles. After the model learns a translation task from parallel corpus, it learns a title generation task from monolingual title generation corpus. When training the model with a task, we only activate an adapter that corresponds to the task. When generating a cross-lingual and style-based title, we only activate adapters that correspond to a target language and a target style. An experimental result shows that our proposed model is only as good as a pipeline model that first translates into a target language and then generates a title. There have been significant changes in natural language generation due to the emergence of large-scale language models. However, research to improve the performance of natural language generation using limited resources and limited data needs to continue. In this regard, this study seeks to explore the significance of such research.

Abbreviation Disambiguation using Topic Modeling (토픽모델링을 이용한 약어 중의성 해소)

  • Woon-Kyo Lee;Ja-Hee Kim;Junki Yang
    • Journal of the Korea Society for Simulation
    • /
    • v.32 no.1
    • /
    • pp.35-44
    • /
    • 2023
  • In recent, there are many research cases that analyze trends or research trends with text analysis. When collecting documents by searching for keywords in abbreviations for data analysis, it is necessary to disambiguate abbreviations. In many studies, documents are classified by hand-work reading the data one by one to find the data necessary for the study. Most of the studies to disambiguate abbreviations are studies that clarify the meaning of words and use supervised learning. The previous method to disambiguate abbreviation is not suitable for classification studies of documents looking for research data from abbreviation search documents, and related studies are also insufficient. This paper proposes a method of semi-automatically classifying documents collected by abbreviations by going topic modeling with Non-Negative Matrix Factorization, an unsupervised learning method, in the data pre-processing step. To verify the proposed method, papers were collected from academic DB with the abbreviation 'MSA'. The proposed method found 316 papers related to Micro Services Architecture in 1,401 papers. The document classification accuracy of the proposed method was measured at 92.36%. It is expected that the proposed method can reduce the researcher's time and cost due to hand work.

Verification of Ground Subsidence Risk Map Based on Underground Cavity Data Using DNN Technique (DNN 기법을 활용한 지하공동 데이터기반의 지반침하 위험 지도 작성)

  • Han Eung Kim;Chang Hun Kim;Tae Geon Kim;Jeong Jun Park
    • Journal of the Society of Disaster Information
    • /
    • v.19 no.2
    • /
    • pp.334-343
    • /
    • 2023
  • Purpose: In this study, the cavity data found through ground cavity exploration was combined with underground facilities to derive a correlation, and the ground subsidence prediction map was verified based on the AI algorithm. Method: The study was conducted in three stages. The stage of data investigation and big data collection related to risk assessment. Data pre-processing steps for AI analysis. And it is the step of verifying the ground subsidence risk prediction map using the AI algorithm. Result: By analyzing the ground subsidence risk prediction map prepared, it was possible to confirm the distribution of risk grades in three stages of emergency, priority, and general for Busanjin-gu and Saha-gu. In addition, by arranging the predicted ground subsidence risk ratings for each section of the road route, it was confirmed that 3 out of 61 sections in Busanjin-gu and 7 out of 68 sections in Sahagu included roads with emergency ratings. Conclusion: Based on the verified ground subsidence risk prediction map, it is possible to provide citizens with a safe road environment by setting the exploration section according to the risk level and conducting investigation.

Development of Integrated Management System Based on GIS on Soft Ground (GIS 기법을 이용한 연약 지반 시공 관리 시스템의 개발)

  • Chun, Sung-Ho;Woo, Sang-Inn;Chung, Choong-Ki;Choi, In-Gul
    • Journal of the Korean Geotechnical Society
    • /
    • v.23 no.7
    • /
    • pp.37-46
    • /
    • 2007
  • In the practice of preloading method for soft ground improvement, field engineers need information of ground properties, construction works and field monitoring on ground behaviors of the site. So, integrating all these informations into one database can provide more efficient way for managing and utilizing the data for construction management. In this study, integrated system for construction management of ground improvement sites under preloading is developed. The developed system consists of database (DB) and application program. The database contains all collected data in a construction site and processed data in the system with their geographic information. All informations in the database are standardized from the result of data characterization. Application program performs various functions on managing and utilizing information in the database; pre- and post- data processing with graphic visualization of output, spatial data interpolation, and prediction of ground behavior using field measuring data. And by providing integrating informations and predictions over entire project area with comprehensible visual displays, the applicability and effectiveness of the developed system for construction management were confirmed.