• Title/Summary/Keyword: OPEN API

Search Result 613, Processing Time 0.029 seconds

Data Model Study for National Research Data Commons Service (국가연구데이터커먼즈 서비스를 위한 데이터모델 연구)

  • Cho, Minhee;Lee, Mikyoung;Song, Sa-kwang;Yim, Hyung-Jun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.436-438
    • /
    • 2022
  • National Research Data Commons aims to build a system that can be used jointly by arranging analysis resources such as computing infrastructure, software, toolkit, API, and services used for data analysis together with research data to maximize the use of research data. do. The sharing and utilization system for publications and research data in the R&D process is well known. However, the environment in which data and tightly coupled software and computing infrastructure can be shared and utilized is insignificant and there is no management system. In this study, a data model is designed to systematically manage information on digital research resources required in the data-oriented R&D research process. This will be used to register and manage digital research resource information in the National Research Data Commons Service.

  • PDF

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

  • An, Jungkook;Kim, Hee-Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.49-67
    • /
    • 2015
  • Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.

Design of Web based Simulation Provenance Data Sharing Service (웹 기반 시뮬레이션 이력출처 데이터 공유 서비스 설계)

  • Jung, Youngjin;Nam, Dukyun;Yu, Jinseung;Lee, JongSuk Ruth;Cho, Kumwon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.5
    • /
    • pp.1128-1134
    • /
    • 2014
  • Web based simulation service is actively utilized to computably analyze various kinds of phenomena in real world according to progress of computing technology and spread of Network. However it is hard to share data and information among users on the services, because most of web based simulation services do not share and open simulation processing information and results. In this paper, we design a simulation provenance data sharing service on EDISON_CFD (EDucation-research Integration Simulation On the Net for Computational Fluid Dynamics) to share the calculated simulation performance information. To store and share the simulation processing information, we define the simulation processing step as "Problem ${\rightarrow}$ Plan, Design ${\rightarrow}$ Mesh ${\rightarrow}$ Simulation performance ${\rightarrow}$ Result ${\rightarrow}$ Report." Users can understand a problem solving method through a computer simulation by searching the simulation performance information with Search/Share API of the store. Besides, this opened simulation information can reduce the waste of calculation resource to process same simulation jobs.

Design and Implementation of UCC Metadata Manager for Social Collaborative Service (소셜 협업 서비스를 위한 UCC 메타데이터 매니저 설계 및 구현)

  • Oh, Jung-Min;Song, Ju-Hong;Moon, Nam-Mee
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.48 no.3
    • /
    • pp.1-10
    • /
    • 2011
  • Social network service is defined as an online service or communication service based on social relations among people applied the concept of social network. Social collaborative service included in social network service is characterized by the new value of modified content or recreated content made through collaborative creation process between members of the group. It has the remarkable merits such as sharing and collaboration. But, at the same time, it has the latent problems such as content reuse or copy that is not allowed for members to use. It has been emerged that UCC which is a typical example of recreated or modified content has the copyright issues in both creation and publishing step. To resolve this matter, we don't have many appropriate methods except CCL so far. So, in this paper, we define the problem and implement the UCC metadata manager to control metadata reflecting the feature of UCC. We draw the reference metadata element to identify original content utilized re-creation process. After that we define the R.Metadata Loader module based on the use case. Finally, the proposed UCC metadata manager provides the information of referenced content and lets us to identify the relationship between reference contents. So as to implement prototype, we use Kaltura which is CMS using open source and obtain functional extensibility of metadata manager by using open API.

Characterization of Cellulase and Xylanase from Bacillus subtilis NC1 Isolated from Environmental Soil and Determination of Its Genes (Bacillus subtilis NC1 유래 cellulase와 xylanase의 특성 규명 및 효소 유전자의 규명)

  • Park, Chang-Su;Kang, Dae-Ook;Choi, Nack-Shick
    • Journal of Life Science
    • /
    • v.22 no.7
    • /
    • pp.912-919
    • /
    • 2012
  • A Bacillus sp. strain producing celluase and xylanase was isolated from environmental soil with LB agar plate containing carboxymethylcellulose (CM-cellulose) and beechwood xylan stained with trypan blue as substrates, respectively. Based on the 16S rRNA gene sequence and API 50 CHL test, the strain was identified as B. subtilis and named B. subtilis NC1. The cellulase and xylanase from B. subtilis NC1 exhibited the highest activities for CM-cellulose and beechwood xylan as substrate, respectively, and both enzymes showed the maximum activity at pH 5.0 and $50^{\circ}C$. We cloned and sequenced the genes for cellulase and xylanase from genomic DNA of the B. subtilis NC1 by the shot-gun cloning method. The cloned cellulase and xylanase genes consisted of a 1,500 bp open reading frame (ORF) encoding a 499 amino acid protein with a calculated molecular mass of 55,251 Da and a 1,269 bp ORF encoding a 422 amino acid protein with a calculated molecular mass of 47,423 Da, respectively. The deduced amino acid sequences from the genes of cellulase and xylanase showed high identity with glycosyl hydrolases family (GH) 5 and 30, respectively.

Highly Reliability Network Technology for Transmitting a Disaster Information (재해정보 전송을 위한 고신뢰성 네트워크 기술)

  • Kim, Kyung-Jun;Kim, Dongju;Jang, Dae-Jin;Oh, Eun-Ho;Kim, Jin-Man
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.3
    • /
    • pp.115-124
    • /
    • 2015
  • In this paper we analyse the previous (Quality of Services) and QoE(Quality of Experience) methods, and propose a high reliable network system framework and its service forwarding method that is able to provide seamless N-Screen services for proliferating disaster informations. The service satisfaction measurement, i.e., QoE, of contents consumers in N-screens services is going to be important the factor in disaster information proliferation because N-Screen services in the previous methods based on multi devices only focused on information transmission. The proposed system around these services is composed of a disaster information process framework for accepting user's service requirement, push service modules for minimizing the number of packets to be caused when carrying out the push service, and a push service controller for maximizing QoE measures. In order to provide a seamless N-Screen service on diverse screens, such as smartphone, PC, and big screen, we also have Open API(Application Programming Interface) functions. Through these results, we expect to evaluate QoS and QoE quality in the seamless N-Screen service.

Discovering Interdisciplinary Convergence Technologies Using Content Analysis Technique Based on Topic Modeling (토픽 모델링 기반 내용 분석을 통한 학제 간 융합기술 도출 방법)

  • Jeong, Do-Heon;Joo, Hwang-Soo
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.3
    • /
    • pp.77-100
    • /
    • 2018
  • The objectives of this study is to present a discovering process of interdisciplinary convergence technology using text mining of big data. For the convergence research of biotechnology(BT) and information communications technology (ICT), the following processes were performed. (1) Collecting sufficient meta data of research articles based on BT terminology list. (2) Generating intellectual structure of emerging technologies by using a Pathfinder network scaling algorithm. (3) Analyzing contents with topic modeling. Next three steps were also used to derive items of BT-ICT convergence technology. (4) Expanding BT terminology list into superior concepts of technology to obtain ICT-related information from BT. (5) Automatically collecting meta data of research articles of two fields by using OpenAPI service. (6) Analyzing contents of BT-ICT topic models. Our study proclaims the following findings. Firstly, terminology list can be an important knowledge base for discovering convergence technologies. Secondly, the analysis of a large quantity of literature requires text mining that facilitates the analysis by reducing the dimension of the data. The methodology we suggest here to process and analyze data is efficient to discover technologies with high possibility of interdisciplinary convergence.

Artificial Intelligence Algorithms, Model-Based Social Data Collection and Content Exploration (소셜데이터 분석 및 인공지능 알고리즘 기반 범죄 수사 기법 연구)

  • An, Dong-Uk;Leem, Choon Seong
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.23-34
    • /
    • 2019
  • Recently, the crime that utilizes the digital platform is continuously increasing. About 140,000 cases occurred in 2015 and about 150,000 cases occurred in 2016. Therefore, it is considered that there is a limit handling those online crimes by old-fashioned investigation techniques. Investigators' manual online search and cognitive investigation methods those are broadly used today are not enough to proactively cope with rapid changing civil crimes. In addition, the characteristics of the content that is posted to unspecified users of social media makes investigations more difficult. This study suggests the site-based collection and the Open API among the content web collection methods considering the characteristics of the online media where the infringement crimes occur. Since illegal content is published and deleted quickly, and new words and alterations are generated quickly and variously, it is difficult to recognize them quickly by dictionary-based morphological analysis registered manually. In order to solve this problem, we propose a tokenizing method in the existing dictionary-based morphological analysis through WPM (Word Piece Model), which is a data preprocessing method for quick recognizing and responding to illegal contents posting online infringement crimes. In the analysis of data, the optimal precision is verified through the Vote-based ensemble method by utilizing a classification learning model based on supervised learning for the investigation of illegal contents. This study utilizes a sorting algorithm model centering on illegal multilevel business cases to proactively recognize crimes invading the public economy, and presents an empirical study to effectively deal with social data collection and content investigation.

  • PDF

A Study on User's Requirement Analysis for Improvement of OASIS (한의학술논문검색시스템 기능개선을 위한 사용자 요구 분석에 관한 연구)

  • Han, Jeong-Min;Bae, Sun-Hee;Song, Mi-Young
    • Journal of Information Management
    • /
    • v.40 no.3
    • /
    • pp.79-97
    • /
    • 2009
  • Thanks to current development of many search engines and web technologies, a new semantic searching technology appears, featuring giving a relevant meaning to the keyword beyond the previous keyword search service. On the wave of advance of various search engines, the enhancement of OASIS offered by KIOM is needed as well. To do this, KIOM examined demographic and sociological analysis on their position, status, and career, the convenience of OASIS, and the value of papers offered in OASIS from members who have ever used it. Furthermore, the importance of each area involved in oriental medicine is also examined in terms of a new direction for OASIS improvement. Based on the result of the user survey, it turned out that not only an automatic search system that can find meaning of chinese character-centered key words but also a Authority-system which can distinguish homonym beyond simple keyword search system should be introduced quickly. Also, we reached the conclusion that it is necessary to interconnect a citation index information on references with laboratory information of the agencies concerned and interconnect major web sites around the world by using Open API. OASIS is the only domestic web site for offering papers that cover oriental medicine. Therefore, if requirements about the site in oriental medical circles are analyzed sufficiently and the problems of its information search system are improved, OASIS is expected to play a critical role in the development of oriental medicine.

A Study on the Diffusion of Emergency Situation Information in Association with Beacon Positioning Technology and Administrative Address (Beacon 위치측위 기술과 행정주소를 연계한 재난재해 상황 전파 연구)

  • Mo, Eunsu;Lee, Jeakwang
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.5 no.9
    • /
    • pp.211-216
    • /
    • 2016
  • Worldwide casualties caused by earthquakes, floods, fire or other disaster has been increasing. So many researchers are being actively done technical studies to ensure golden-time. In this paper if a disaster occurs, use the IoT technologies in order to secure golden-time and transmits the message after to find the user of the accident area first. When the previous job is finished, gradually finds a user of the surrounding area and transmits the message. For national emergency information, OPEN API of Korea Meteorological Administration was used. To collect detailed information on a relevant area in real time, this study established the system that connects and integrates Crowd Sensing technology with BLE (Bluetooth Low Energy) Beacon technology. Up to now, the CBS based on base station has been applied. However, this study designed and mapped DB in the integration of Beacon based user positioning and national administrative address system in order to estimate local users. In this experiment, the accuracy and speed of information dif6fusion algorithm were measured with a rise in the number of users. The experiments were conducted in a manner that increases the number of users by one thousand and was measured the accuracy and speed of the message spread transfer algorithm. Finally, became operational in less than one second in 20,000 users, it was confirmed that the notification message is sent.