• Title/Summary/Keyword: Free Text

Search Result 183, Processing Time 0.022 seconds

Automatic Text Categorization using the Importance of Sentences (문장 중요도를 이용한 자동 문서 범주화)

  • Ko, Young-Joong;Park, Jin-Woo;Seo, Jung-Yun
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.6
    • /
    • pp.417-424
    • /
    • 2002
  • Automatic text categorization is a problem of assigning predefined categories to free text documents. In order to classify text documents, we have to extract good features from them. In previous researches, a text document is commonly represented by the frequency of each feature. But there is a difference between important and unimportant sentences in a text document. It has an effect on the importance of features in a text document. In this paper, we measure the importance of sentences in a text document using text summarizing techniques. A text document is represented by features with different weights according to the importance of each sentence. To verify the new method, we constructed Korean news group data set and experiment our method using it. We found that our new method gale a significant improvement over a basis system for our data sets.

Extending TextAE for annotation of non-contiguous entities

  • Lever, Jake;Altman, Russ;Kim, Jin-Dong
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.15.1-15.6
    • /
    • 2020
  • Named entity recognition tools are used to identify mentions of biomedical entities in free text and are essential components of high-quality information retrieval and extraction systems. Without good entity recognition, methods will mislabel searched text and will miss important information or identify spurious text that will frustrate users. Most tools do not capture non-contiguous entities which are separate spans of text that together refer to an entity, e.g., the entity "type 1 diabetes" in the phrase "type 1 and type 2 diabetes." This type is commonly found in biomedical texts, especially in lists, where multiple biomedical entities are named in shortened form to avoid repeating words. Most text annotation systems, that enable users to view and edit entity annotations, do not support non-contiguous entities. Therefore, experts cannot even visualize non-contiguous entities, let alone annotate them to build valuable datasets for machine learning methods. To combat this problem and as part of the BLAH6 hackathon, we extended the TextAE platform to allow visualization and annotation of non-contiguous entities. This enables users to add new subspans to existing entities by selecting additional text. We integrate this new functionality with TextAE's existing editing functionality to allow easy changes to entity annotation and editing of relation annotations involving non-contiguous entities, with importing and exporting to the PubAnnotation format. Finally, we roughly quantify the problem across the entire accessible biomedical literature to highlight that there are a substantial number of non-contiguous entities that appear in lists that would be missed by most text mining systems.

Investigation of Terminology Coverage in Radiology Reporting Templates and Free-text Reports

  • Hong, Yi;Zhang, Jin
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.5 no.1
    • /
    • pp.5-14
    • /
    • 2015
  • The Radiological Society of North America (RSNA) is improving reporting practices by developing an online library of clear and consistent report templates. To compare term occurrences in free-text radiology reports and RSNA reporting templates, the Wilcoxon signed-rank test method was applied to investigate how much of the content of conventional narrative reports is covered by the terms included in the RSNA reporting templates. The results show that the RSNA reporting templates cover most terms that appear in actual radiology reports. The Wilcoxon test may be helpful in evaluatingexisting templates and guiding the enhancement of reporting templates.

Method of The Interface Terminology Mapping based Free Text Medical Data (텍스트기반 임상데이터의 인터페이스 용어 매핑 방법)

  • Yoo, Done Sik;Bae, Inho
    • Journal of the Semiconductor & Display Technology
    • /
    • v.13 no.1
    • /
    • pp.97-99
    • /
    • 2014
  • Since 2010, issues for data sharing and data exchanging in hospital information systems have been emerged. In order to solve the issues, standards should be applied to develop the systems and there should be no ambiguities between terminologies in the systems. In this paper, the terminology mapping system for narrative clinical records was implemented. The term mapping precision was 83.4%. This system could help to upgrade the text based clinical system and it would be expected to support for high quality clinical services.

Reconstructive Trends in Post-Ablation Patients with Esophagus and Hypopharynx Defect

  • Ki, Sae Hwi;Choi, Jong Hwan;Sim, Seung Hyun
    • Archives of Craniofacial Surgery
    • /
    • v.16 no.3
    • /
    • pp.105-113
    • /
    • 2015
  • The main challenge in pharyngoesophageal reconstruction is the restoration of swallow and speech functions. The aim of this paper is to review the reconstructive options and associated complications for patients with head and neck cancer. A literature review was performed for pharynoesophagus reconstruction after ablative surgery of head and neck cancer for studies published between January 1980 to July 2015 and listed in the PubMed database. Search queries were made using a combination of 'esophagus' and 'free flap', 'microsurgical', or 'free tissue transfer'. The search query resulted in 123 studies, of which 33 studies were full text publications that met inclusion criteria. Further review into the reference of these 33 studies resulted in 15 additional studies to be included. The pharyngoesophagus reconstruction should be individualized for each patient and clinical context. Fasciocutaneous free flap and pedicled flap are effective for partial phayngoesophageal defect. Fasciocutaneous free flap and jejunal free flap are effective for circumferential defect. Pedicled flaps remain a safe option in the context of high surgical risk patients, presence of fistula. Among free flaps, anterolateral thigh free flap and jejunal free flap were associated with superior outcomes, when compared with radial forearm free flap. Speech function is reported to be better for the fasciocutaneous free flap than for the jejunal free flap.

Text Big Data Analysis and Summary for Free Semester Operational Plan Document (자유학기제 운영계획서에 대한 텍스트 빅데이터 분석 및 요약)

  • Lee, Suan;Park, Beomjun;Kim, Minkyu;Shin, Hye Sook;Kim, Jinho
    • The Journal of Korean Association of Computer Education
    • /
    • v.22 no.3
    • /
    • pp.135-146
    • /
    • 2019
  • Big data analysis is actively used for collecting and analyzing direct information on related topics in each field of society. Applying big data analysis technology in education field is increasingly interested in Korea, because applying this technology helps to identify the effectiveness of education methods and policies and applying them for policy formulation. In this paper, we propose our approach of utilizing big data analysis technology in education field. We focus on free semester program, one of the current core education policies, and we analyze the main points of interests and differences in the free semester through analysis and visualization of texts that are written on the operation reports prepared by each school. We compare regional differences in key characteristics and interests based on the free semester operation reports from middle schools particularly at Seoul and Gangwon-do regions. In conclusion, applying and utilizing big data analysis technology according to the needs and requirements of education field is a great significance.

The Android-based Bluetooth Device Application Design and Implementation (안드로이드 기반의 블루투스 디바이스 응용 설계 및 구현)

  • Cho, Hyo-Sung;Lee, Hyuk-Joon
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.11 no.1
    • /
    • pp.72-85
    • /
    • 2012
  • Today, although most bluetooth hands-free devices within a vehicle provide telephone service functions such as voice communication, caller id display and SMS message display and so on, they do not provide a function that displays Internet-based text data. We need to develop a scheme that displays the internet-based text data including existing hands-free function because the request for using the Internet service is increasing within a vehicle recently. The proposed bluetooth device application includes advanced function such as SNS message arrival notification, the message display function and we chose Android as the implementation mobile platform giving consideration to the fact that most SNS applications operate on Android and the platform is easily embedded into small embedded device. Smartphone or tablet PC connected with the proposed bluetooth device is an Android-based device and we designed a form of Android app for the function implementation of the devices. When the audio-text gateway app receives SNS text data, it extracts title and sender information from the message header information in a form of text data and sends them via ACL (Asynchronous Connection-Oriented) link to the bluetooth device showing the data on the screen. Android-based bluetooth devices are not possible to play voice through speaker because the bluetooth hands-free or headset profile ported within Android platform normally only includes audio gateway's function. The proposed bluetooth device application, therefore, applies the streaming scheme that sends data via ACL link instead of the way that sending them via SCO (Synchronous Connection-Oriented) link.

A Study on the Development of E-book Contents for Fashion Online Entrepreneurship Education (패션온라인창업 교육을 위한 전자책 콘텐츠 개발에 대한 연구)

  • Hwa-Yeon Jeong;Eun-Hee Hong
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.26 no.1
    • /
    • pp.33-44
    • /
    • 2024
  • This study developed e-book content in order to use e-books as a tool to provide more efficient classes to learners who are familiar with smart devices and online spaces. E-book contents were produced using Sigil-0.9.10. The development process is as follows. Before e-book development, it is necessary to prepare manuscript files, image files to be inserted, fonts to be used, and e-book covers. After inserting the book cover images, it is necessary to register the table of contents using the title tag and register the free fonts. Also, a style must be created for text or images used in the main text connected to a file containing the entire text. Then, after separating the entire text file into separate files according to each chapter, the text is completed in turn. E-books were produced focusing on hyperlink functions so that educational content and various example images could be accessed. Currently, there is a lack of research on e-books as textbooks in universities within the fashion design major. In the future, if e-book contents are developed according to the characteristics of courses and the level of learners, they can be used as effective teaching tools.