• Title/Summary/Keyword: Evaluation Scheme

Search Result 1,521, Processing Time 0.028 seconds

A Comparative Research on End-to-End Clinical Entity and Relation Extraction using Deep Neural Networks: Pipeline vs. Joint Models (심층 신경망을 활용한 진료 기록 문헌에서의 종단형 개체명 및 관계 추출 비교 연구 - 파이프라인 모델과 결합 모델을 중심으로 -)

  • Sung-Pil Choi
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.1
    • /
    • pp.93-114
    • /
    • 2023
  • Information extraction can facilitate the intensive analysis of documents by providing semantic triples which consist of named entities and their relations recognized in the texts. However, most of the research so far has been carried out separately for named entity recognition and relation extraction as individual studies, and as a result, the effective performance evaluation of the entire information extraction systems was not performed properly. This paper introduces two models of end-to-end information extraction that can extract various entity names in clinical records and their relationships in the form of semantic triples, namely pipeline and joint models and compares their performances in depth. The pipeline model consists of an entity recognition sub-system based on bidirectional GRU-CRFs and a relation extraction module using multiple encoding scheme, whereas the joint model was implemented with a single bidirectional GRU-CRFs equipped with multi-head labeling method. In the experiments using i2b2/VA 2010, the performance of the pipeline model was 5.5% (F-measure) higher. In addition, through a comparative experiment with existing state-of-the-art systems using large-scale neural language models and manually constructed features, the objective performance level of the end-to-end models implemented in this paper could be identified properly.

Development of river discharge estimation scheme using Monte Carlo simulation and 1D numerical analysis model (Monte Carlo 모의 및 수치해석 모형을 활용한 하천 유량 추정기법의 개발)

  • Kang, Hansol;An, Hyunuk;Kim, Yeonsu;Hur, Youngteck;Noh, Joonwoo
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.4
    • /
    • pp.279-289
    • /
    • 2022
  • Since the frequency of heavy rainfall is increasing due to climate change, water levels in the river exceed past historical records. The rating-curve is to convert water level into flow dicscharge from the regression analysis of the water level and corresponding flow discharges. However, the rating-curve involves many uncertainties because of the limited data especially when observed water level exceed past historical water levels. In order to compensate for insufficient data and increase the accuracy of flow discharge data, this study estimates the flow discharge in the river computed mathematically using Monte Carlo simulation based on a 1D hydrodynamic numerical model. Based on the existing rating curve, a random combination of coefficients constituting the rating-curve creates a number of virtual rating curve. From the computed results of the hydrodynamic model, it is possible to estimate flow discharge which reproduces best fit to the observed water level. Based on the statistical evaluation of these samples, a method for mathematically estimating the water level and flow discharge of all cross sections is porposed. The proposed methodology is applied to the junction of Yochoen Stream in the Seomjin River. As a result, it is confirmed that the water level reproducibility was greatly improved. Also, the water level and flow discharge can be calculated mathematically when the proposed method is applied.

Detection of Delay Attack in IoT Automation System (IoT 자동화 시스템의 지연 공격 탐지)

  • Youngduk Kim;Wonsuk Choi;Dong hoon Lee
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.33 no.5
    • /
    • pp.787-799
    • /
    • 2023
  • As IoT devices are widely used at home, IoT automation system that is integrate IoT devices for users' demand are gaining populrity. There is automation rule in IoT automation system that is collecting event and command action. But attacker delay the packet and make time that real state is inconsistent with state recongnized by the system. During the time, the system does not work correctly by predefined automation rule. There is proposed some detection method for delay attack, they have limitations for application to IoT systems that are sensitive to traffic volume and battery consumption. This paper proposes a practical packet delay attack detection technique that can be applied to IoT systems. The proposal scheme in this paper can recognize that, for example, when a sensor transmits an message, an broadcast packet notifying the transmission of a message is sent to the Server recognized that event has occurred. For evaluation purposes, an IoT system implemented using Raspberry Pi was configured, and it was demonstrated that the system can detect packet delay attacks within an average of 2.2 sec. The experimental results showed a power consumption Overhead of an average of 2.5 mA per second and a traffic Overhead of 15%. We demonstrate that our method can detect delay attack efficiently compared to preciously proposed method.

A study on the Physical, Mental and Social Factors Influencing the Health Status of Aged Women in Korea (여성노인의 건강상태와 신체적.심리적.사회적 요소들과의 관계연구)

  • Ro, Seung-Ok
    • Women's Health Nursing
    • /
    • v.2 no.1
    • /
    • pp.53-67
    • /
    • 1996
  • A total health state evaluation of Korean female elderlies was made by using the questionary scheme measuring the physical, mental and social functions of the elderlies, in order to investigate the critical factors for the health maintenance of female elderlies and to develop their preventive nursing program. A total of 280 subjects over 65 years old living in Seoul and the suburban area were selected and interviewed during the period of September and October in 1995. The materials collected were analyzed statistically by using SAS data processing program, and the results and recommendations are summarized as follows. 1. The physical health state of Korean elderly women was evaluated to be satisfactory by showing an average score of 3.722 in 5.0 full-score scale. But this score was lower than those evaluated for the elderlies combined both sexes(4.054). The mental health state of the subjects was also evaluated as high scoring 3.484, possibly due to the fact that 78% of the subjects lived together with their children's family. On the other hand, the social health state of the subjects was relatively low scoring 2.585, mainly due to that 80% of them was widows which was resulted by the 6-7 years longer life-expectancy of Korean women. 2. A significant differences in the physical health state scores between different age groups was observed, indicating the rapid ageing process occurring in this age group. The family structure was appeared to be an important factor influencing the physical health state of the female elderlies ; the physical health score of the women with her husband only was higher than that of those living with children's families, and the lowest score was obtained from those living alone. 3. The age was the most important factor determining the mental health state of the subjects, while the religion, educational status, marriage state and family structure did not significantly influenced the mental health state of the aged women. 4. The social health state of the subject was deeply influenced by the marriage state and family structure, showing significantly lower scores with widowers compared to the married couples. Those living with their married spouse only obtained the highest social health score, while those living along showed the lowest score. The parent and grandparentship of those living with their children and the religion, especially Catholic and Protestant, had positive influence on the social health state of the aged women. 5. The mental health state of aged women showed significant correlation with the factors determining the physical health, except for digestive system related ability and sexual ability and the highest extra home ability. 6. The mental health state of aged women showed significant correlation with the factors determining social health, especially with the parent and grandparentship and the family relative's role. From these results, the following recommendations are made. 1. Since the physical, mental and social health states of aged people are deeply influenced by the sex and the average values of the both sex can create misleading figures, the health evaluation of the elderlies should be made separately by sex. 2. Since the health state of aged women is highly influenced by their family structure, the spouse's role and living with married couple only should be emphasized in respect of preventive health care. 3. The social activity programs and grandparentship teaching programs should be prepared in the nursing care program for aged people.

  • PDF

Performance Evaluation of Advance Warning System for Transporting Hazardous Materials (위험물 운송을 위한 조기경보시스뎀 성능평가)

  • Oh Sei-Chang;Cho Yong-Sung
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.4 no.1 s.6
    • /
    • pp.15-29
    • /
    • 2005
  • Truck Shipment Safety Information, which is a part of the development of NERIS is divided into Optimal Route Guidance System and Emergency Response System. This research is for establishing an advance warning system, which aims for preventing damages(fire, explosion, gas-escape etc.) and detecting incidents that are able to happen during transporting hazardous materials in advance through monitoring the position of moving vehicles and the state of hazardous materials in real-time. This research is peformed to confirm the practical possibility of application of the advance warning system that monitors whether the hazardous materials transport vehicles move the allowed routes, finds the time and the location of incidents of the vehicles promptly and develops the emergency system that is able to respond to the incidents as well by using the technologies of CPS, CDMA and CIS with testing the ability of performance. As the results of the test, communication accuracies are 99$\%$ in freeway, 96$\%$ in arterial, 97$\%$ in hilly sections, 99$\%$ in normal sections, 96$\%$ in local sections, 99$\%$ in urban sections and 98$\%$ in tunnels. According to those results, the system has been recorded a high success rate of communication that enough to apply to the real site. However, the weak point appeared through the testing is that the system has a limitation of communication that is caused in the rural areas and certain areas where are fewer antennas that make communication possible between on-board unit and management server. Consequently, for the practical use of this system, it is essential to develop the exclusive en-board unit for the vehicles and find the method that supplements the receiving limitation of the GPS coordinates inside tunnels. Additionally, this system can be used to regulate illegal acts automatically such as illegal negligence of hazardous materials. And the system can be applied to the study about an application scheme as a guideline for transporting hazardous materials because there is no certain management system and act of toxic substances in Korea.

  • PDF

Current Wheat Quality Criteria and Inspection Systems of Major Wheat Producing Countries (밀 품질평가 현황과 검사제도)

  • 이춘기;남중현;강문석;구본철;김재철;박광근;박문웅;김용호
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.47
    • /
    • pp.63-94
    • /
    • 2002
  • On the purpose to suggest an advanced scheme in assessing the domestic wheat quality, this paper reviewed the inspection systems of wheat in major wheat producing countries as well as the quality criteria which are being used in wheat grading and classification. Most wheat producing countries are adopting both classifications of class and grade to provide an objective evaluation and an official certification to their wheat. There are two main purposes in the wheat classification. The first objectives of classification is to match the wheat with market requirements to maximize market opportunities and returns to growers. The second is to ensure that payments to glowers aye made on the basis of the quality and condition of the grain delivered. Wheat classes has been assigned based on the combination of cultivation area, seed-coat color, kernel and varietal characteristics that are distinctive. Most reputable wheat marketers also employ a similar approach, whereby varieties of a particular type are grouped together, designed by seed coat colour, grain hardness, physical dough properties, and sometimes more precise specification such as starch quality, all of which are genetically inherited characteristics. This classification in simplistic terms is the categorization of a wheat variety into a commercial type or style of wheat that is recognizable for its end use capabilities. All varieties registered in a class are required to have a similar end-use performance that the shipment be consistent in processing quality, cargo to cargo and year to year, Grain inspectors have historically determined wheat classes according to visual kernel characteristics associated with traditional wheat varieties. As well, any new wheat variety must not conflict with the visual distinguishability rule that is used to separate wheats of different classes. Some varieties may possess characteristics of two or more classes. Therefore, knowledge of distinct varietal characteristics is necessary in making class determinations. The grading system sets maximum tolerance levels for a range of characteristics that ensure functionality and freedom from deleterious factors. Tests for the grading of wheat include such factors as plumpness, soundness, cleanliness, purity of type and general condition. Plumpness is measured by test weight. Soundness is indicated by the absence or presence of musty, sour or commercially objectionable foreign odors and by the percentage of damaged kernels that ave present in the wheat. Cleanliness is measured by determining the presence of foreign material after dockage has been removed. Purity of class is measured by classification of wheats in the test sample and by limitation for admixtures of different classes of wheat. Moisture does not influence the numerical grade. However, it is determined on all shipments and reported on the official certificate. U.S. wheat is divided into eight classes based on color, kernel Hardness and varietal characteristics. The classes are Durum, Hard Red Spring, Hard Red Winter, Soft Red Winter, Hard White, soft White, Unclassed and Mixed. Among them, Hard Red Spring wheat, Durum wheat, and Soft White wheat are further divided into three subclasses, respectively. Each class or subclass is divided into five U.S. numerical grades and U.S. Sample grade. Special grades are provided to emphasize special qualities or conditions affecting the value of wheat and are added to and made a part of the grade designation. Canadian wheat is also divided into fourteen classes based on cultivation area, color, kernel hardness and varietal characteristics. The classes have 2-5 numerical grades, a feed grade and sample grades depending on class and grading tolerance. The Canadian grading system is based mainly on visual evaluation, and it works based on the kernel visual distinguishability concept. The Australian wheat is classified based on geographical and quality differentiation. The wheat grown in Australia is predominantly white grained. There are commonly up to 20 different segregations of wheat in a given season. Each variety grown is assigned a category and a growing areas. The state governments in Australia, in cooperation with the Australian Wheat Board(AWB), issue receival standards and dockage schedules annually that list grade specifications and tolerances for Australian wheat. AWB is managing "Golden Rewards" which is designed to provide pricing accuracy and market signals for Australia's grain growers. Continuous payment scales for protein content from 6 to 16% and screenings levels from 0 to 10% based on varietal classification are presented by the Golden Rewards, and the active payment scales and prices can change with market movements.movements.

A Methodology for Extracting Shopping-Related Keywords by Analyzing Internet Navigation Patterns (인터넷 검색기록 분석을 통한 쇼핑의도 포함 키워드 자동 추출 기법)

  • Kim, Mingyu;Kim, Namgyu;Jung, Inhwan
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.123-136
    • /
    • 2014
  • Recently, online shopping has further developed as the use of the Internet and a variety of smart mobile devices becomes more prevalent. The increase in the scale of such shopping has led to the creation of many Internet shopping malls. Consequently, there is a tendency for increasingly fierce competition among online retailers, and as a result, many Internet shopping malls are making significant attempts to attract online users to their sites. One such attempt is keyword marketing, whereby a retail site pays a fee to expose its link to potential customers when they insert a specific keyword on an Internet portal site. The price related to each keyword is generally estimated by the keyword's frequency of appearance. However, it is widely accepted that the price of keywords cannot be based solely on their frequency because many keywords may appear frequently but have little relationship to shopping. This implies that it is unreasonable for an online shopping mall to spend a great deal on some keywords simply because people frequently use them. Therefore, from the perspective of shopping malls, a specialized process is required to extract meaningful keywords. Further, the demand for automating this extraction process is increasing because of the drive to improve online sales performance. In this study, we propose a methodology that can automatically extract only shopping-related keywords from the entire set of search keywords used on portal sites. We define a shopping-related keyword as a keyword that is used directly before shopping behaviors. In other words, only search keywords that direct the search results page to shopping-related pages are extracted from among the entire set of search keywords. A comparison is then made between the extracted keywords' rankings and the rankings of the entire set of search keywords. Two types of data are used in our study's experiment: web browsing history from July 1, 2012 to June 30, 2013, and site information. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The original sample dataset contains 150 million transaction logs. First, portal sites are selected, and search keywords in those sites are extracted. Search keywords can be easily extracted by simple parsing. The extracted keywords are ranked according to their frequency. The experiment uses approximately 3.9 million search results from Korea's largest search portal site. As a result, a total of 344,822 search keywords were extracted. Next, by using web browsing history and site information, the shopping-related keywords were taken from the entire set of search keywords. As a result, we obtained 4,709 shopping-related keywords. For performance evaluation, we compared the hit ratios of all the search keywords with the shopping-related keywords. To achieve this, we extracted 80,298 search keywords from several Internet shopping malls and then chose the top 1,000 keywords as a set of true shopping keywords. We measured precision, recall, and F-scores of the entire amount of keywords and the shopping-related keywords. The F-Score was formulated by calculating the harmonic mean of precision and recall. The precision, recall, and F-score of shopping-related keywords derived by the proposed methodology were revealed to be higher than those of the entire number of keywords. This study proposes a scheme that is able to obtain shopping-related keywords in a relatively simple manner. We could easily extract shopping-related keywords simply by examining transactions whose next visit is a shopping mall. The resultant shopping-related keyword set is expected to be a useful asset for many shopping malls that participate in keyword marketing. Moreover, the proposed methodology can be easily applied to the construction of special area-related keywords as well as shopping-related ones.

Clinical Utility of Turbo Contrase-Enhanced MR Angiography for the Major Branches of the Aortic Arch (대동맥궁 주요 분지들의 고속 조영증강 자기공명혈관조영술의 임상적 유용성)

  • Su Ok Seong
    • Investigative Magnetic Resonance Imaging
    • /
    • v.2 no.1
    • /
    • pp.96-103
    • /
    • 1998
  • Purpose : To assess the clinical utility of turbo contrast-enhanced magnetic resonance angiography(CE MRA) in the evaluation of the aortic arch and its major branches and to compare the image quality of CE MRA among different coils used. Materials and Methods : Turbo three-phase dynamic CE MRA encompassing aortic arch and its major branches was prospectively performed after manual bolus IV injection of contrast material in 29 patients with suspected cerebrovascular diseases at 1.0T MR unit. the raw data were obtained with 3-D FISH sequence (TR 5.4ms, TE 2.3ms, flip angle 30, slab thickness 80nm, effective slice thickness 4.0mm, matrix size $100{\times}256$, FOV 280mm). Total data acquisition time was 4. to 60 seconds. We subjectively evaluated the imge quality with three-rating scheme : "good" for unequivocal normal finding, "fair" for relatively satisfactory quality to diagnose 'normal' despite intravascular low signal, and "poor" for equivocal diagnosis or non-visualization of the origin or segment of the vessels due to low signal or artifacts which needs catheter angiography. At the level of the carotid bifurcation, it was compared with conventional 2D-TOF MRA image. Overall image quality was also compared visually and quantitatively by measuring signal-to-noise ratios (SNRs) of the ascending aorta, the innominate artery and both common carotid arteries among the three different coils used(CP body array(n=12), CP neck array(n=9), and head-and-neck(n=8). Results : Demonstration of the aortic arch and its major branches was rated as "good" in 55% (16/29) and "fair" in 34%(10/29). At the level of the carotid bifurcation, image quality of turbo CE MRA was same as or better than conventional 2D-TOF MRA in 65% (17/26). Overall image quality and SNR were significantlygreater with CP body array coil than with CP neck array or head-and-neck coil. Conclusions : Turbo CE MRA can be used as a screening exam in the evaluation of the major branches of the aortic arch from their origin to the skull base. Overall imagequality appears to be better with CP body array coil than with CP neck array coil or head-and-neck coil.

  • PDF

A Study on the Design of the Grid-Cell Assessment System for the Optimal Location of Offshore Wind Farms (해상풍력발전단지의 최적 위치 선정을 위한 Grid-cell 평가 시스템 개념 설계)

  • Lee, Bo-Kyeong;Cho, Ik-Soon;Kim, Dae-Hae
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.24 no.7
    • /
    • pp.848-857
    • /
    • 2018
  • Recently, around the world, active development of new renewable energy sources including solar power, waves, and fuel cells, etc. has taken place. Particularly, floating offshore wind farms have been developed for saving costs through large scale production, using high-quality wind power and minimizing noise damage in the ocean area. The development of floating wind farms requires an evaluation of the Maritime Safety Audit Scheme under the Maritime Safety Act in Korea. Floating wind farms shall be assessed by applying the line and area concept for systematic development, management and utilization of specified sea water. The development of appropriate evaluation methods and standards is also required. In this study, proper standards for marine traffic surveys and assessments were established and a systemic treatment was studied for assessing marine spatial area. First, a marine traffic data collector using AIS or radar was designed to conduct marine traffic surveys. In addition, assessment methods were proposed such as historical tracks, traffic density and marine traffic pattern analysis applying the line and area concept. Marine traffic density can be evaluated by spatial and temporal means, with an adjusted grid-cell scale. Marine traffic pattern analysis was proposed for assessing ship movement patterns for transit or work in sea areas. Finally, conceptual design of a Marine Traffic and Safety Assessment Solution (MaTSAS) was competed that can be analyzed automatically to collect and assess the marine traffic data. It could be possible to minimize inaccurate estimation due to human errors such as data omission or misprints through automated and systematic collection, analysis and retrieval of marine traffic data. This study could provides reliable assessment results, reflecting the line and area concept, according to sea area usage.

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

  • Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.141-166
    • /
    • 2019
  • Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.