• Title/Summary/Keyword: parsing

Search Result 507, Processing Time 0.028 seconds

Chunking Korean and an Application (한국어 낱말 묶기와 그 응용)

  • Un Koaunghi;Hong Jungha;You Seok-Hoon;Lee Kiyong;Choe Jae-Woong
    • Language and Information
    • /
    • v.9 no.2
    • /
    • pp.49-68
    • /
    • 2005
  • Application of chunking to English and some other European languages has shown that it is a viable parsing mechanism for natural languages. Although a small number of attempts have been made to apply chunking to the analysis of the Korean language, it still is not clear enough what criteria there are to identify appropriate units of chunking, and how efficient and valid the chunking algorithms would be when applied to some authentic Korean texts. The purpose of this research is to provide an alternative set of algorithms for chunking Korean, and to implement them, and to test them against some English-Korean parallel corpora, which is English and Korean bibles matched sentence by sentence. It is shown in the paper that aligning related texts and identifying matched phrases between the two languages can be achieved through appropriate chunking and matching algorithms defined on the morphologically-tagged parallel corpus. Chunking and matching processes are based on the content words rather than the function words, and the matching itself is done in terms of the transfer dictionary. The implementation is done in C and XML, and can be accessed through the Internet.

  • PDF

A Study on multi-translation system for e-business collaboration (e-비즈니스 협업에 적합한 다중변환 시스템 연구)

  • Ahn, Kyeong-Rim;Chung, Jin-Wook
    • Journal of Internet Computing and Services
    • /
    • v.7 no.6
    • /
    • pp.123-130
    • /
    • 2006
  • The transaction was happened within a single business entity or a single marketplace at the stage of e-business. It becomes to grow to complex form. Expecially, the need for business collaboration between business entities or marketplaces has being on the rise as the core topic. The format translation between documents is very important factor according to various the exchanged document formats. In this paper, we define ebXML as the basic format of exchanged document according to object-oriented business transaction. Also we design the multi-format translation system to support the translation of various document formats. The proposed system in this paper, is designed with model-driven method and it is possible to construct with various structure as for system environment. The proposed translation system is designed to use the proposed system as adding the corresponding parsing module even though any format of document. Also, we increase the reusability of data as using the common data set. In this paper, we prove the superiority of the proposed system to compare the performance with the legacy system for various format translation.

  • PDF

A Study on Processing XML Documents (XML 문서 처리에 관한 연구)

  • Kim, Tae Gwon
    • Journal of KIISE
    • /
    • v.43 no.4
    • /
    • pp.489-496
    • /
    • 2016
  • XML can effectively express structured or semi-structured data as well as relational databases. XQuery is a query language for retrieving information for such an XML document. In this paper, an XQuery composer is designed and implemented, with an API provided for XQuery processors, and a proper processor is registered. This composer shows query results immediately processed by the processor. As this composer contains a parser for XQuery, it can compose XQuery effectively using a diverse dialog box designed for XQuery grammar. A dialog box is affiliated with a clause region, which is a region that algebra operates from the parsing tree. It can compose path expressions for an XML document easily as it shows an element tree from DTD graphically. Path expressions are composed automatically by marking elements in the structural hierarchy and by specifying the predicate of an element partially.

The Gaurantee of Real-Time Vital Sign Information Service Message of Patient Monitoring System in Distributed Network Systems (분산 네트워크 시스템에서 환자 모니터링 시스템의 실시간 생체정보 서비스 메시지 보장)

  • Lim, Se-Jung;Kim, Gwang-Jun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.4 no.2
    • /
    • pp.162-167
    • /
    • 2009
  • In this paper, we present a patient real-time vital sign information transmission system to effectively support developing real-time communication service by using a real-time object model named TMO (Time-Triggered Message-Triggered Object). Also, we describes the application environment as the PMS(Patient Monitoing System) to guarantee real-time service message with TMO structure in distributed network systems. We have to design to obtain useful vital sign information, which is generated at parsing data receiver modulor of HIS with TMO structure, that is offered by the central monitor of PMS. Vital sign informations of central monitor is composed of the raw data of several bedsite patient monitors. We are willing to maintain vital sign information of real time and continuity that is generated from the bedsite patient monitor. In the real time simulation techniques based on TMO object modeling, we have observed several advantages to the TMO structuring scheme. TMO object modeling has a strong traceability between requirement specification and design.

  • PDF

Design and Implementation of Real-Time Monitoring System for PLGR Work (PLGR 작업을 위한 실시간 모니터링 시스템의 설계 및 구현)

  • Lee, Tae-Oh;Jeong, Seong-Hoon;Yim, Jae-Hong
    • Journal of Navigation and Port Research
    • /
    • v.27 no.1
    • /
    • pp.87-95
    • /
    • 2003
  • Submarine optical fiber cable construction consists of marine survey, PLGR(Pre Lay Grapnel Run), shore-end-work, laying the submarine optical cable. This PLGR is work to ease the cable lay safely in seabed, improve the performance of Plough and ROV (Remotely-Operated Vehicle) laying work, and protect laying equipment. This paper presents the design and implementation of real-time monitoring system for PLGR work in submarine optical fiber cable construction enterprise. In this paper, we designe overall real-time monitoring system. For this purpose, the modules such as serial multiport communication module, real-time processing module, environment configuration module, real-time graph and a printout modules are designed and implemented. For the validity evaluation of this paper, serial multi port communication module, data parsing, realtime graph output are implemented and tested.

A Study on Smartwatch review data of SNS and sentiment analytical using opinion mining (스마트워치 SNS 리뷰 데이터와 오피니언 마이닝을 통한 감성 분석 처리에 대한 연구)

  • Shin, Donghyun;Choi, YongLak
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.1047-1050
    • /
    • 2015
  • Wearable device, along with IoT(Internet of Things), is considered the core of upcoming generation's convergence technology. Companies are intensely competing one another for prior occupation in the smartwatch market. Consumers that use smartwatch express their preferences by sharing their opinions through SNS(Social Networking Service). Through this study, emotions dictionary is built, which consists of attributes and emotional words related to smartwatch. Based on the emotions dictionary, SNS data has been categorized according to the attributes through opinion data model. Afterwards, overall polarity and attribute polarity of collected data are distinguished through natural language parsing, followed by an analysis of smartwatch reviews. This study will contribute to determination of which attributes of smartwatch to be improved, to arise consumer's interest for individual smartwatch.

  • PDF

An Area-efficient Design of SHA-256 Hash Processor for IoT Security (IoT 보안을 위한 SHA-256 해시 프로세서의 면적 효율적인 설계)

  • Lee, Sang-Hyun;Shin, Kyung-Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.1
    • /
    • pp.109-116
    • /
    • 2018
  • This paper describes an area-efficient design of SHA-256 hash function that is widely used in various security protocols including digital signature, authentication code, key generation. The SHA-256 hash processor includes a padder block for padding and parsing input message, so that it can operate without software for preprocessing. Round function was designed with a 16-bit data-path that processed 64 round computations in 128 clock cycles, resulting in an optimized area per throughput (APT) performance as well as small area implementation. The SHA-256 hash processor was verified by FPGA implementation using Virtex5 device, and it was estimated that the throughput was 337 Mbps at maximum clock frequency of 116 MHz. The synthesis for ASIC implementation using a $0.18-{\mu}m$ CMOS cell library shows that it has 13,251 gate equivalents (GEs) and it can operate up to 200 MHz clock frequency.

A General-Purpose Service Information Processing System for Integrated Data Broadcasting Environment (통합 데이터 방송 환경을 위한 범용 서비스 인포메이션 처리 시스템)

  • Jeon, Je-Min;Choi, Hyeon-Seok;Kim, Jung-Sun
    • The KIPS Transactions:PartC
    • /
    • v.16C no.1
    • /
    • pp.101-108
    • /
    • 2009
  • The data broadcasting service, which is growing remarkably today, provides viewers with useful information as well as high quality video and audio. Service information is a kind of additional data that contains a wide range of information such as channel list and/or program title. Each service information is transmitted in the form of a table. And most standard committees have specified their own table list used for carrying the service information. Consequently, It causes incompatibility among services that each broadcast operators produce because the tables that they use differ from each other. In this paper, we propose a general-purpose service information processing system for an integrated data broadcasting middleware that is compatible with heterogenous broadcasting environments. The system is able to change its target table list dynamically without any code modification. Futhermore, we also adopted a thread pool model for efficient parsing and event dispatching.

Smart Outlet System for Single-person Household based on IoT (Internet of Things) (사물 인터넷 기반의 1인 가구를 위한 스마트 콘센트 시스템)

  • Kim, Hye-Suk;Park, Byeong-Ju;Cho, Young-Ju
    • Journal of Digital Contents Society
    • /
    • v.18 no.5
    • /
    • pp.895-904
    • /
    • 2017
  • In this paper, we propose an IoT based smart outlet system for one-person household with electric safety, electric energy consumption management and transmission of messages to the reserved contacts by connecting the emergency network. The proposed smart outlet system is implemented by parsing packet data and protocol between server, module, application and WiFi AP receiver. The WiFi AP built for communication can be used not only for the outlet but also for the gas barrier, the door lock and the like. In addition, the proposed method can provide the remote monitoring function by receiving the amount of power of the receptacle received through the AC current sensor (SCT-013) in real time. The smart outlet system is expected to be capable of automatically shutting off the power in case of emergency through automatic power use analysis in addition to the standby power cutoff function.

Web Data Collection and Utilization using Content Syndication (콘텐츠 신디케이션을 이용한 웹 데이터 수집 및 활용)

  • Hwang, Sanghyun;Kim, Heewan
    • Journal of Service Research and Studies
    • /
    • v.5 no.2
    • /
    • pp.83-92
    • /
    • 2015
  • Many data on the web are present, put out by processing in the content in order to provide services by collecting the necessary data is not easy. One of the reasons is because there is no way to provide a standardized data. Therefore, it can be seen as a part or all of the contents of the site, the content distribution to be available for other services is very important. A syndication format that allows you to use a representative of some or all of the site's content for other services such as RSS and there are Atom, OPML-based XML. Throughout the links provided in this syndication format is called feed address. With a feed address to collect data faster than the conventional HTML parsing and data provider is the advantage of being able to easily provide the data to the outside. In this study, we feed the data obtained by collecting by implementing the web address based on the data acquisition system to propose a method for processing and utilizing the data as a background.