• 제목/요약/키워드: Statistics & Information DB

Search Result 53, Processing Time 0.025 seconds

Bayesian Analysis for Neural Network Models

  • Chung, Younshik;Jung, Jinhyouk;Kim, Chansoo
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.155-166
    • /
    • 2002
  • Neural networks have been studied as a popular tool for classification and they are very flexible. Also, they are used for many applications of pattern classification and pattern recognition. This paper focuses on Bayesian approach to feed-forward neural networks with single hidden layer of units with logistic activation. In this model, we are interested in deciding the number of nodes of neural network model with p input units, one hidden layer with m hidden nodes and one output unit in Bayesian setup for fixed m. Here, we use the latent variable into the prior of the coefficient regression, and we introduce the 'sequential step' which is based on the idea of the data augmentation by Tanner and Wong(1787). The MCMC method(Gibbs sampler and Metropolish algorithm) can be used to overcome the complicated Bayesian computation. Finally, a proposed method is applied to a simulated data.

Analysis of Impact Between Data Analysis Performance and Database

  • Kyoungju Min;Jeongyun Cho;Manho Jung;Hyangbae Lee
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.3
    • /
    • pp.244-251
    • /
    • 2023
  • Engineering or humanities data are stored in databases and are often used for search services. While the latest deep-learning technologies, such like BART and BERT, are utilized for data analysis, humanities data still rely on traditional databases. Representative analysis methods include n-gram and lexical statistical extraction. However, when using a database, performance limitation is often imposed on the result calculations. This study presents an experimental process using MariaDB on a PC, which is easily accessible in a laboratory, to analyze the impact of the database on data analysis performance. The findings highlight the fact that the database becomes a bottleneck when analyzing large-scale text data, particularly over hundreds of thousands of records. To address this issue, a method was proposed to provide real-time humanities data analysis web services by leveraging the open source database, with a focus on the Seungjeongwon-Ilgy, one of the largest datasets in the humanities fields.

Study on Improving Oriental Medicine Statistical System for Multidimensional Statistical Data

  • Yea, Sang-Jun;Kim, Chul;Kim, Jin-Hyun;Jang, Hyun-Chul;Kim, Sang-Kyun;Song, Mi-Young
    • International Journal of Contents
    • /
    • v.7 no.3
    • /
    • pp.13-18
    • /
    • 2011
  • Oriental medicine statistics are essential in research planning, research evaluation, and policy decision based on objective data. However, integrated administration of such statistics is not presently possible in the oriental medicine field, which has been slow in incorporating information communication technology. In an effort to address this problem, the Korea Institute of Oriental Medicine (KIOM) developed an oriental medicine statistical system in 2009, and the system has been offered in the traditional medicine information portal of OASIS. However, according to a 2010 survey targeting OASIS users, those surveys reported that needs for a system where various statistical data can be extracted via an interactive approach to multidimensional data. As a result of an analysis of the functions of the existing system, it was found that it is necessary to array and arithmetically analyze Stats Value, Drill Up & Drill Down, and Pivot. To this end, the existing DB schema should be redesigned. Based on our analysis result, we redesigned the database into a structure that is applicable to the reverse pivot algorithm. We used J2EE/JSP and a Flex framework to design and develop an oriental medicine statistical system that can provide multidimensional statistical data. Considering that the improved oriental medicine statistical system is planned to be offered by OASIS of KIOM, utilization and value of oriental medicine statistical data are expected to be enhanced.

Integration of Internet Shopping Mall and Auction and E-mail marketing by Statistics of Database (쇼핑몰과 경매의 통합 및 DB 통계에 의한 E-mail 마케팅 구현)

  • Park, Hae-Ran;Kim, Hyo-Rim;Lee, Sung-Yong;Choi, Young-Bok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10b
    • /
    • pp.1489-1492
    • /
    • 2000
  • 요즘 전자상거래의 활성화로 전 세계적으로 인터넷 쇼핑몰과 경매 사이트를 운영하는 곳이 많이 있다. 하지만 사용하던 물건을 팔고 새로운 상품을 구매하려고 한다면 일반적으로 중고 물건을 파는 사이트를 찾아서 그곳에서 물건을 팔고 다시 다른 인터넷 쇼핑몰에서 물건을 사야하는 번거러움이 있다. 그리고 쇼핑몰 사이트의 관리자 입장에서는 판매부진상품이나 이원상품 등을 관리하기 어렵다. 또 기존에 구축되어 있는 맡은 쇼핑몰과 경매사이트의 데이터베이스의 활용도를 보면 저장된 상품을 보여주고. 판매가 되면 삭제되는 역할에 국한된 경우가 많다. 본 논문에서는 전자상거래의 사용자가 인터넷을 보다 간편하게 이용하고 사용자가 등록한 중고물품, 쇼핑몰의 판매부진상품, 이월상품의 경매로 인한 구매자의 참여를 위해 쇼핑몰과 경매 사이트를 통합하여 운영하고, 지금까지의 공통적이고 일반적인 내용의 E-mail 마케팅을 데이터 베이스 통계분석에 의해 차별화 되고 집중적인 E-mail 마케팅으로 구현한다.

  • PDF

A Study on Image Recommendation System based on Speech Emotion Information

  • Kim, Tae Yeun;Bae, Sang Hyun
    • Journal of Integrative Natural Science
    • /
    • v.11 no.3
    • /
    • pp.131-138
    • /
    • 2018
  • In this paper, we have implemented speeches that utilized the emotion information of the user's speech and image matching and recommendation system. To classify the user's emotional information of speech, the emotional information of speech about the user's speech is extracted and classified using the PLP algorithm. After classification, an emotional DB of speech is constructed. Moreover, emotional color and emotional vocabulary through factor analysis are matched to one space in order to classify emotional information of image. And a standardized image recommendation system based on the matching of each keyword with the BM-GA algorithm for the data of the emotional information of speech and emotional information of image according to the more appropriate emotional information of speech of the user. As a result of the performance evaluation, recognition rate of standardized vocabulary in four stages according to speech was 80.48% on average and system user satisfaction was 82.4%. Therefore, it is expected that the classification of images according to the user's speech information will be helpful for the study of emotional exchange between the user and the computer.

Medical CRM Frame Design for Medical Institution (의료기관 전문 의료용 CRM 프레임 설계)

  • Kim, Gui-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.12
    • /
    • pp.20-27
    • /
    • 2008
  • Hospitals today use independent systems for each department and job such as Hospital Information Sytem(HIS), Picture Archiving Communications System(PACS), Ordering Communication System(OCS), Electronic Medical Record(EMR), Enterprise Resource Planning(ERP), etc and each system employs its own DB. So, it is impossible to integrate information within the institution and difficult to keep transparency and consistency of data. I in this study offered a data integration environment through flexible management linked with other systems, and by doing that, designed a medical CRM frame which offers the optimum service the customer wants at the optimum time. I designed 4 of medical CRM frame: customer relationship management, public relations/marketing, service management, and statistics/analysis by the customer relationship management process standardization and aimed to offer tailored mobile contents according to customer's characters and health situation on the basis of customer's data by securing mobile medical contents for personalized medical information service.

Development for establishing Big Data-based alley commercial area (빅데이터 기반 골목상권 영역설정 방법론 개발)

  • Hwang, Dong-Hyun;Ko, Kyeong-Seok;Park, Sang-June;Kim, Wan-Su
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.6
    • /
    • pp.784-792
    • /
    • 2018
  • In this study, we designed the area except the development market and the traditional market, where large scale shops were concentrated by realizing the real estate center of the alley commercial area. In addition, we have developed an area setting method for the alley area where reliability and rationality can be ensured by utilizing the actual data such as the business statistics, the survey data of the business, and the store business DB, which are managed by the local government or the state. The alley commercial areas were classified into five groups according to density. It is thought that users can distinguish the commercial areas from dense commercial areas to the commercial areas in order to utilize various commercial areas.

A Study on the Construction of the Database Structure for the Korea In-depth Accident Study (한국형 교통사고 심층조사 DB 체계 구축에 대한 연구)

  • Kim, Siwoo;Lee, Jaewan;Youn, Younghan
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.22 no.2
    • /
    • pp.29-36
    • /
    • 2014
  • The accident statistics use the data from police accident reports and statistics. Although the official accident statistics are useful, they provide very limited information about how accidents occur, the cause of the accident and the injury mechanisms. This limitations could be overcome by carrying out the in-depth accident study and analysing investigations, collecting more detailed information. Meanwhile a net of in-depth investigation teams are operating worldwide, such as NASS (National Accident Sampling System) and CIREN (Crash Injury Research and Engineering Network) in US, OTS (On the spot investigation) in UK. In this study, the database structure and variables of Korea in-depth accidents investigation system would be proposed through considering the database structure of GIDAS (Germany In-Depth Accidents Study). GIDAS is one of the best system on the in-depth accident study system in the world. GIDAS was established in 1999 as a cooperation project between Federal Highway Research Institute of Germany (BASt) and research association on automotive engineering of German Car Industry(FAT). The iGLAD (Initiative for the Global Harmonization of Accident Data) was also considered to introduce into the database variables of Korea in-depth accident study. Current police reports were compared with GIDAS and iGLAD. To collaborate of the Worldwide in-depth accident data, this paper proposed that the database of Korea in-depth accident study would be introduced the structure of GIDAS and the core variables of iGLAD. Harmonization of the structures and core variables of Korea in-depth accident study will be better than the making unique ones. The database structure and core variables of KIDAS(Korea In-Depth Accident Study) introduced of GIDAS and iGLAD.

Analyzing Site Characteristics and Suitability for Wind Farm Facilities in Forest Lands (산지 내 풍력발전단지 입지 특성 및 적합성 분석)

  • Kwon, Soon-Duk;Joo, Woo-Yeong;Kim, Won-Kyung;Kim, Jong-Ho;Kim, Eun-Hee
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.4
    • /
    • pp.86-100
    • /
    • 2014
  • The purposes of this study are to provide a guideline for the suitability of wind farm facilities in forest lands and to suggest improvement plans of policies and systems to minimize the damage of forest lands. First, we implemented a literature review and field surveys to examine and select factors for the suitability of wind farm facilities in forest lands. Spatial database for selected location factors of wind farm facilities in forest lands was constructed to develop the suitability model for locating wind farm facilities focusing on Gangwon-do. Data used in this study include wind power resource, legal mountainous preserved area, forest roads, developed areas, forest class, and other spatial data. In order to find specific-sized potential areas for a certain number of wind farm turbines, we used block statistics and focal statistics methods. As a result, the areas for potential wind farm locations were 1,261ha from a block statistics method and 1,411ha from a focal statistics method. Based on the outputs of this research, it is required to make an urgent solution for the prevention of forest disaster and to prepare reduction measures for the destruction of ridge landscape.

Development of Web-Based Supporting Tool (VESTAP) for Climate Change Vulnerability Assesment in Lower and Municipal-Level Local Governments (기초 및 광역지자체 기후변화 취약성 평가를 위한 웹기반 지원 도구(VESTAP) 개발)

  • OH, Kwan-Young;LEE, Moung-Jin;HAN, Do-Eun
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.19 no.1
    • /
    • pp.1-11
    • /
    • 2016
  • Climate change is the issue that attracts the most attention in the field of environment, as well as the most challenging task faced by the human race. There are various ways to resolve this issue. South Korea has established the primary and secondary national climate change adaptation plans at the national level, and is making it compulsory for each local government (lower and municipal-level) to establish climate change adaptation plans. Climate change vulnerability assessment plays an essential role in establishing climate change adaptation action plans. However, vulnerability assessment has a difficulty performing individual assessments since the results are produced through complex calculations of multiple impact factors. Accordingly, this study developed a web-based supporting tool(VESTAP) for climate change vulnerability assesment that can be used by lower and municipal-level local governments. The VESTAP consists of impact DB and vulnerability assessment and display tool. The index DB includes total 455 impacts of future climate data simulated with RCP (Representative Concentration Pathways) 4.5 and 8.5, atmospheric environment data, other humanities and social statistics, and metadata. The display tool has maximized convenience by providing various analytical functions such as spatial distribution, bias and schematization of each vulnerability assessment result. A pilot test of health vulnerability assessment by particulate matters in Sejong Metropolitan Autonomous City was performed using the VESTAP, and Bukang-myeon showed the highest vulnerability. By using the developed tool, each local government is expected to be able to establish climate change adaptation action plans more easily and conveniently based on scientific evidence.