• Title/Summary/Keyword: Analysis on Labeling

Search Result 328, Processing Time 0.035 seconds

A Comparative Research on End-to-End Clinical Entity and Relation Extraction using Deep Neural Networks: Pipeline vs. Joint Models (심층 신경망을 활용한 진료 기록 문헌에서의 종단형 개체명 및 관계 추출 비교 연구 - 파이프라인 모델과 결합 모델을 중심으로 -)

  • Sung-Pil Choi
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.1
    • /
    • pp.93-114
    • /
    • 2023
  • Information extraction can facilitate the intensive analysis of documents by providing semantic triples which consist of named entities and their relations recognized in the texts. However, most of the research so far has been carried out separately for named entity recognition and relation extraction as individual studies, and as a result, the effective performance evaluation of the entire information extraction systems was not performed properly. This paper introduces two models of end-to-end information extraction that can extract various entity names in clinical records and their relationships in the form of semantic triples, namely pipeline and joint models and compares their performances in depth. The pipeline model consists of an entity recognition sub-system based on bidirectional GRU-CRFs and a relation extraction module using multiple encoding scheme, whereas the joint model was implemented with a single bidirectional GRU-CRFs equipped with multi-head labeling method. In the experiments using i2b2/VA 2010, the performance of the pipeline model was 5.5% (F-measure) higher. In addition, through a comparative experiment with existing state-of-the-art systems using large-scale neural language models and manually constructed features, the objective performance level of the end-to-end models implemented in this paper could be identified properly.

A Study on High-Speed Extraction of Bar Code Region for Parcel Automatic Identification (소포 자동식별을 위한 바코드 관심영역 고속 추출에 관한 연구)

  • Park, Moon-Sung;Kim, Jin-Suk;Kim, Hye-Kyu;Jung, Hoe-Kyung
    • The KIPS Transactions:PartD
    • /
    • v.9D no.5
    • /
    • pp.915-924
    • /
    • 2002
  • Conventional Systems for parcel sorting consist of two sequences as loading the parcel into conveyor belt system and post-code input. Using bar code information, the parcels to be recorded and managed are recognized. This paper describes a 32 $\times$ 32 sized mini-block inspection to extract bar code Region of Interest (ROI) from the line Charged Coupled Device (CCD) camera capturing image of moving parcel at 2m/sec speed. Firstly, the Min-Max distribution of the mini-block has been applied to discard the background of parcel and region of conveying belts from the image. Secondly, the diagonal inspection has been used for the extraction of letters and bar code region. Five horizontal line scanning detects the number of edges and sizes and ROI has been acquired from the detection. The wrong detected area has been deleted by the comparison of group size from labeling processes. To correct excluded bar code region in mini-block processes and for analysis of bar code information, the extracted ROI 8 boundary points and decline distribution have been used with central axis line adjustment. The ROI extraction and central axis creation have become enable within 60~80msec, and the accuracy has been accomplished over 99.44 percentage.

The Error Pattern Analysis of the HMM-Based Automatic Phoneme Segmentation (HMM기반 자동음소분할기의 음소분할 오류 유형 분석)

  • Kim Min-Je;Lee Jung-Chul;Kim Jong-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.5
    • /
    • pp.213-221
    • /
    • 2006
  • Phone segmentation of speech waveform is especially important for concatenative text to speech synthesis which uses segmented corpora for the construction of synthetic units. because the quality of synthesized speech depends critically on the accuracy of the segmentation. In the beginning. the phone segmentation was manually performed. but it brings the huge effort and the large time delay. HMM-based approaches adopted from automatic speech recognition are most widely used for automatic segmentation in speech synthesis, providing a consistent and accurate phone labeling scheme. Even the HMM-based approach has been successful, it may locate a phone boundary at a different position than expected. In this paper. we categorized adjacent phoneme pairs and analyzed the mismatches between hand-labeled transcriptions and HMM-based labels. Then we described the dominant error patterns that must be improved for the speech synthesis. For the experiment. hand labeled standard Korean speech DB from ETRI was used as a reference DB. Time difference larger than 20ms between hand-labeled phoneme boundary and auto-aligned boundary is treated as an automatic segmentation error. Our experimental results from female speaker revealed that plosive-vowel, affricate-vowel and vowel-liquid pairs showed high accuracies, 99%, 99.5% and 99% respectively. But stop-nasal, stop-liquid and nasal-liquid pairs showed very low accuracies, 45%, 50% and 55%. And these from male speaker revealed similar tendency.

Status of Supplier Selection Status and the Practical Use of Purchase Specifications for Self-operated School Foodservices in the Seoul Area (서울 지역 직영 학교 급식의 공급 업체 선정 및 식재료 규격서 사용 실태 조사)

  • Ryu, Kyung
    • The Korean Journal of Food And Nutrition
    • /
    • v.20 no.2
    • /
    • pp.226-239
    • /
    • 2007
  • The purpose of this study was to identify the problems related to the purchasing processes of school foodservices that should be corrected for the food service safety, by examining the purchasing processes and the status of supplier selection. A questionnaire was given to 300 dietitians working at self-operated food services. Ninety-eight responses, excluding incomplete answers, were used for the statistical analysis. The survey consisted of three parts: the general characteristics of the school foodservice and dietitian, purchasing processes and supplier selection, and the purchase specifications. We found that 84% of the contract was made by informal purchasing, and the contract period was 6 months or one year. For supplier selection, problems related to the document screening systems were the superficiality of the content(45.7%) and the absence or lack of clarity of the appraisal criteria(34.8%). The important factors for the facility and equipment standards of suppliers were included unclear evaluation methods for content(41.1%) and inappropriate appraisal lists(21.1%), while unclear evaluation methods for content(41.9%) and absence or lack of clarity of the appraisal criteria(20.4%) were the problems pertaining to the supplier evaluation checklist. When using the Food Labeling Standards to select suppliers, confirmation of the sell-by date and the storage method had the highest score at 3.85 out of 5. For supplier selection, only 25% of the contract was made by using the purchase specifications. The levels of satisfaction of with Kimchi and rice cakes suppliers were significantly different according to employment type and educational background, respectively. Depending on working experiences, satisfaction was significantly different for the use of document screening, as a standard for the selection and management of suppliers, and for the facility and equipment standards of suppliers, The use of purchase specifications was different by employment type, while the use of purchase specifications for contracts was different by working experience. These results imply that the specialization of suppliers is necessary to unsure food safety. Therefore, the objective methods to evaluate the suppliers should be developed by the government, and appropriate education programs for dietitians should be prepared to enhance the utilization of purchase specifications.

Biodistribution of 99mTc Labeled Integrin Antagonist

  • Jang, Beom-Su;Park, Seung-Hee;Shin, In Soo;Maeng, Jin-Soo;Paik, Chang H.
    • Toxicological Research
    • /
    • v.29 no.1
    • /
    • pp.21-25
    • /
    • 2013
  • The selective targeting of an integrin ${\alpha}_v{\beta}_3$ receptor using radioligands may enable the assessment of angiogenesis and integrin ${\alpha}_v{\beta}_3$ receptor status in tumors. The aim of this research was to label a peptidomimetic integrin ${\alpha}_v{\beta}_3$ antagonist (PIA) with $^{99m}Tc(CO)_3$ and to test its receptor targeting properties in nude mice bearing receptor-positive tumors. PIA was reacted with tris-succinimidyl aminotriacetate (TSAT) (20 mM) as a PIA per TSAT. The product, PIA-aminodiacetic acid (ADA), was radiolabeled with $[^{99m}Tc(CO)_3(H_2O)_3]^{+1}$, and purified sequentially on a Sep-Pak C-18 cartridge followed by a Sep-Pak QMA anion exchange cartridge. Using gradient C-18 reverse-phase HPLC, the radiochemical purity of $^{99m}Tc(CO)_3$-ADA-PIA (retention time, 10.5 min) was confirmed to be > 95%. Biodistribution analysis was performed in nude mice (n = 5 per time point) bearing receptor-positive M21 human melanoma xenografts. The mice were administered $^{99m}Tc(CO)_3$-ADA-PIA intravenously. The animals were euthanized at 0.33, 1, and 2 hr after injection for the biodistribution study. A separate group of mice were also co-injected with 200 ${\mu}g$ of PIA and euthanized at 1 hr to quantify tumor uptake. $^{99m}Tc(CO)_3$-ADA-PIA was stable in phosphate buffer for 21 hr, but at 3 and 6 hr, 7.9 and 11.5% of the radioactivity was lost as histidine, respectively. In tumor bearing mice, $^{99m}Tc(CO)_3$-ADA-PIA accumulated rapidly in a receptor-positive tumor with a peak uptake at 20 min, and rapid clearance from blood occurring primarily through the hepatobiliary system. At 20 min, the tumor-to-blood ratio was 1.8. At 1 hr, the tumor uptake was 0.47% injected dose (ID)/g, but decreased to 0.12% ID/g when co-injected with an excess amount of PIA, indicating that accumulation was receptor mediated. These results demonstrate successful $^{99m}TC$ labeling of a peptidomimetic integrin antagonist that accumulated in a tumor via receptor-specific binding. However, tumor uptake was very low because of low blood concentrations that likely resulted from rapid uptake of the agent into the hepatobiliary system. This study suggests that for $^{99m}Tc(CO)_3$-ADA-PIA to be useful as a tumor detection agent, it will be necessary to improve receptor binding affinity and increase the hydrophilicity of the product to minimize rapid hepatobiliary uptake.

Comparison of the Mathematics Educational Values between Pre-service and In-service Elementary School Teachers (수학교육적 가치에 대한 예비 초등교사와 현직 초등교사의 인식 비교)

  • Yim, MinJae;Cho, SooYun;Pang, JeongSuk
    • Communications of Mathematical Education
    • /
    • v.34 no.3
    • /
    • pp.277-297
    • /
    • 2020
  • The purpose of this study was to identify and compare the mathematics educational values of pre-service and in-service elementary school teachers. For this purpose, we implemented a questionnaire investigating mathematics educational values and used principal component analysis which resulted in six components. These components were named as fun, problem-solving, representation, computation, ability, and explanation through systematic labeling processes. Both pre-service and in-service elementary school teachers considered problem-solving the most important and there was no statistical difference between the teacher groups. They also considered fun the least important and in-service elementary school teachers regarded it more important than pre-service counterparts did. All value components except explanation were regarded as important by in-service elementary school teachers, fourth-year pre-service teachers, and first-year pre-service teachers in order. The result of noticeable differences between pre-service and in-service elementary school teachers implies that actual teaching experience may affect teachers' mathematics educational values more than teacher preparation programs. Based on these findings, we need to discuss what should be regarded as important and worthwhile in teacher preparation programs to establish mathematics educational values for pre-service teachers. We also need to confirm whether the mathematics educational values by in-service elementary school teachers may be in line with what has been pursued in the national mathematics curriculum.

Evaluation of nutrient and food intake status, and dietary quality in Korean adults according to nutrition label utilization: Based on 2010-2011 Korean National Health and Nutrition Examination Survey (성인 남녀에서 영양표시 활용 정도에 따른 영양섭취 및 식사의 질 평가: 2010~2011 국민건강영양조사 자료를 이용하여)

  • Bae, Yun-Jung
    • Journal of Nutrition and Health
    • /
    • v.47 no.3
    • /
    • pp.193-205
    • /
    • 2014
  • Purpose: This study was conducted in order to investigate nutrient and food intake status and dietary quality in Korean adults according to nutrition label utilization. Methods: We analyzed data from the combined 2010-2011 KNHANES (Korean National Health and Nutrition Examination Survey). The analysis included 8190 adults aged 19 to 64 years. In this study, according to nutrition label utilization, we classified the subjects according to the "non-utilization of nutrition label (NUNL)" group (male, n = 2716, female, n = 3147), "identification of nutrition label (INL)" group (male, n = 143, female, n = 330), and "Utilization of nutrition label (UNL)" group (male, n = 363, female, n = 1491). Nutrient and food group intake, NAR (nutrient adequacy ratio), MAR (mean adequacy ratio), and INQ (index of nutritional quality) were analyzed using data from the 24-recall method. Results: Results of this study showed that subjects in the NUNL group were significantly more likely to drink alcohol compared with the other two groups. The NUNL group showed a significantly higher frequency of consuming instant noodles, Soju (male), and carbonated drink (female) than the UNL group, whereas the NUNL group showed a significantly lower frequency of consuming milk, soymilk, and yogurt than the UNL group. In addition, regarding diet quality (NAR and INQ), significantly lower vitamin $B_2$, vitamin C, and calcium was observed in the NUNL group compared with the UNL group. For both male and female, significantly higher MAR was observed in the UNL group than in the NUNL group. The NUNL group showed significantly lower consumption of milk compared to the UNL group. Conclusion: Good dietary practice such as referring to nutrition labels and its influence can affect the quality of nutritional intake and selection of food, while it can also provide basic data for specific nutrition education regarding use of nutrition labeling.

Variation of Primary Productivity and Phytoplankton Community in the Weirs of Mid and Downstream of the Nakdong River during Fall and Early Winter: Application of Phytoplankton Pigments and CHEMTAX (추계-동계 낙동강 중 하류 보 구간 일차생산력 및 식물플랑크톤 군집조성 변화: 식물플랑크톤 색소와 CHEMTAX 활용)

  • Choi, Jisoo;Min, Jun Oh;Choi, Bohyung;Kang, Jae Joong;Choi, Kwangsoon;Lee, Sang Heon;Shin, Kyung Hoon
    • Korean Journal of Ecology and Environment
    • /
    • v.52 no.2
    • /
    • pp.81-93
    • /
    • 2019
  • Phytoplankton is one of the important primary producers providing organic matter through photosynthesis in aquatic environments. In order to determine a temporal and spatial variation in primary productivity after weir construction in the Nakdong River, we investigated carbon uptake rates using in-situ $^{13}C$ labeling experiments and identified algal communities contributing to primary productivity using HPLC-CHEMTAX analysis from October to December, 2017. The primary productivity gradually decreased from fall to early winter season ($249{\sim}933mgC\;m^{-2}d^{-1}$ in October, $64{\sim}536mgC\;m^{-2}d^{-1}$ in November and $60{\sim}274mgC\;m^{-2}d^{-1}$ in December, respectively). This is attributed to the temporally declining light intensity and the decreasing biomass and physiological activity of phytoplankton in winter. The contribution of diatoms to the phytoplankton community in the Nakdong River was approximately 63% at all the sampling sites and seasons, while the contribution of cryptophytes increased from 9% in October to 32% in November and December. The temporal changes in the primary productivity and the dominant phytoplankton species in the mid and downstream weirs of the Nakdong River was investigated for the first time, after construction of the weirs, and major environmental factors controlling the temporal variation in primary productivity and phytoplankton communities were identified in this study. We suggest that seasonal field investigations will provide further information on the major environmental factors which affect the annual variation of primary productivity and phytoplankton communities.

Analyzing the discriminative characteristic of cover letters using text mining focused on Air Force applicants (텍스트 마이닝을 이용한 공군 부사관 지원자 자기소개서의 차별적 특성 분석)

  • Kwon, Hyeok;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.75-94
    • /
    • 2021
  • The low birth rate and shortened military service period are causing concerns about selecting excellent military officers. The Republic of Korea entered a low birth rate society in 1984 and an aged society in 2018 respectively, and is expected to be in a super-aged society in 2025. In addition, the troop-oriented military is changed as a state-of-the-art weapons-oriented military, and the reduction of the military service period was implemented in 2018 to ease the burden of military service for young people and play a role in the society early. Some observe that the application rate for military officers is falling due to a decrease of manpower resources and a preference for shortened mandatory military service over military officers. This requires further consideration of the policy of securing excellent military officers. Most of the related studies have used social scientists' methodologies, but this study applies the methodology of text mining suitable for large-scale documents analysis. This study extracts words of discriminative characteristics from the Republic of Korea Air Force Non-Commissioned Officer Applicant cover letters and analyzes the polarity of pass and fail. It consists of three steps in total. First, the application is divided into general and technical fields, and the words characterized in the cover letter are ordered according to the difference in the frequency ratio of each field. The greater the difference in the proportion of each application field, the field character is defined as 'more discriminative'. Based on this, we extract the top 50 words representing discriminative characteristics in general fields and the top 50 words representing discriminative characteristics in technology fields. Second, the number of appropriate topics in the overall cover letter is calculated through the LDA. It uses perplexity score and coherence score. Based on the appropriate number of topics, we then use LDA to generate topic and probability, and estimate which topic words of discriminative characteristic belong to. Subsequently, the keyword indicators of questions used to set the labeling candidate index, and the most appropriate index indicator is set as the label for the topic when considering the topic-specific word distribution. Third, using L-LDA, which sets the cover letter and label as pass and fail, we generate topics and probabilities for each field of pass and fail labels. Furthermore, we extract only words of discriminative characteristics that give labeled topics among generated topics and probabilities by pass and fail labels. Next, we extract the difference between the probability on the pass label and the probability on the fail label by word of the labeled discriminative characteristic. A positive figure can be seen as having the polarity of pass, and a negative figure can be seen as having the polarity of fail. This study is the first research to reflect the characteristics of cover letters of Republic of Korea Air Force non-commissioned officer applicants, not in the private sector. Moreover, these methodologies can apply text mining techniques for multiple documents, rather survey or interview methods, to reduce analysis time and increase reliability for the entire population. For this reason, the methodology proposed in the study is also applicable to other forms of multiple documents in the field of military personnel. This study shows that L-LDA is more suitable than LDA to extract discriminative characteristics of Republic of Korea Air Force Noncommissioned cover letters. Furthermore, this study proposes a methodology that uses a combination of LDA and L-LDA. Therefore, through the analysis of the results of the acquisition of non-commissioned Republic of Korea Air Force officers, we would like to provide information available for acquisition and promotional policies and propose a methodology available for research in the field of military manpower acquisition.

Automated Analyses of Ground-Penetrating Radar Images to Determine Spatial Distribution of Buried Cultural Heritage (매장 문화재 공간 분포 결정을 위한 지하투과레이더 영상 분석 자동화 기법 탐색)

  • Kwon, Moonhee;Kim, Seung-Sep
    • Economic and Environmental Geology
    • /
    • v.55 no.5
    • /
    • pp.551-561
    • /
    • 2022
  • Geophysical exploration methods are very useful for generating high-resolution images of underground structures, and such methods can be applied to investigation of buried cultural properties and for determining their exact locations. In this study, image feature extraction and image segmentation methods were applied to automatically distinguish the structures of buried relics from the high-resolution ground-penetrating radar (GPR) images obtained at the center of Silla Kingdom, Gyeongju, South Korea. The major purpose for image feature extraction analyses is identifying the circular features from building remains and the linear features from ancient roads and fences. Feature extraction is implemented by applying the Canny edge detection and Hough transform algorithms. We applied the Hough transforms to the edge image resulted from the Canny algorithm in order to determine the locations the target features. However, the Hough transform requires different parameter settings for each survey sector. As for image segmentation, we applied the connected element labeling algorithm and object-based image analysis using Orfeo Toolbox (OTB) in QGIS. The connected components labeled image shows the signals associated with the target buried relics are effectively connected and labeled. However, we often find multiple labels are assigned to a single structure on the given GPR data. Object-based image analysis was conducted by using a Large-Scale Mean-Shift (LSMS) image segmentation. In this analysis, a vector layer containing pixel values for each segmented polygon was estimated first and then used to build a train-validation dataset by assigning the polygons to one class associated with the buried relics and another class for the background field. With the Random Forest Classifier, we find that the polygons on the LSMS image segmentation layer can be successfully classified into the polygons of the buried relics and those of the background. Thus, we propose that these automatic classification methods applied to the GPR images of buried cultural heritage in this study can be useful to obtain consistent analyses results for planning excavation processes.