• Title/Summary/Keyword: Cluster-based Language Models

Search Result 4, Processing Time 0.018 seconds

Language Modeling Approaches to Information Retrieval

  • Banerjee, Protima;Han, Hyo-Il
    • Journal of Computing Science and Engineering
    • /
    • v.3 no.3
    • /
    • pp.143-164
    • /
    • 2009
  • This article surveys recent research in the area of language modeling (sometimes called statistical language modeling) approaches to information retrieval. Language modeling is a formal probabilistic retrieval framework with roots in speech recognition and natural language processing. The underlying assumption of language modeling is that human language generation is a random process; the goal is to model that process via a generative statistical model. In this article, we discuss current research in the application of language modeling to information retrieval, the role of semantics in the language modeling framework, cluster-based language models, use of language modeling for XML retrieval and future trends.

Ratio Estimation of Indirect Cost Sector about Defense Companies by Statistic Technique (통계 기법에 의한 방산업체의 간접원가부문 비율 추정)

  • Lim, Hyeoncheol;Kim, Suhwan
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.40 no.4
    • /
    • pp.246-252
    • /
    • 2017
  • In the defense acquisition, a company's goal is to maximize profits, and the government's goal is to allocate budgets efficiently. Each year, the government estimates the ratio of indirect cost sector to defense companies, and estimates the ratio to be applied when calculating cost of the defense articles next year. The defense industry environment is changing rapidly, due to the increasing trend of defense acquisition budgets, the advancement of weapon systems, the effects of the 4th industrial revolution, and so on. As a result, the cost structure of defense companies is being diversifying. The purpose of this study is to find an alternative that can enhance the rationality of the current methodology for estimating the ratio of indirect cost sector of defense companies. To do this, we conducted data analysis using the R language on the cost data of defense companies over the past six years in the Defense Integrated Cost System. First, cluster analysis was conducted on the cost characteristics of defense companies. Then, we conducted a regression analysis of the relationship between direct and indirect costs for each cluster to see how much it reflects the cost structure of defense companies in direct labor cost-based indirect cost rate estimates. Lastly a new ratio prediction model based on regularized regression analysis was developed, applied to each cluster, and analyzed to compare performance with existing prediction models. According to the results of the study, it is necessary to estimate the indirect cost ratio based on the cost character group of defense companies, and the direct labor cost based indirect cost ratio estimation partially reflects the cost structure of defense companies. In addition, the current indirect cost ratio prediction method has a larger error than the new model.

A Research on the Paradigm of Interaction Based on Attributes (인터렉션 속성에 기초한 인터렉션 범식화 연구)

  • Shan, Shu Ya;Pan, Young Hwan
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.5
    • /
    • pp.127-138
    • /
    • 2021
  • The aim of this study is to demonstrate interaction as a describable field and tries to understand interaction from the perspective of attributes, thus building a theoretical to help interactive designer understand this field by common rule, rather than waste huge time and labor cost on iteration. Since the concept of interaction language has been brought out in 2000, there are varies of related academical studies, but all with defect such as proposed theoretical models are built on a non-uniform scale, or the analyzing perspective are mainly based on researcher's personal experience and being too unobjective. The value of this study is the clustered resource of research which mainly based on academical review. It collected 21 papers researched on interaction paradigm or interaction attributes published since 2000, extracting 19 interaction attribute models which contains 174 interaction attributes. Furthermore, these 174 attributes were re-clustered based on a more unified standard scale, and the two theoretical models summarized from it are respectively focuses on interaction control and interaction experience, both of which covered 6 independent attributes. The propose of this theoretical models and the analyzation of the cluster static will contribute on further revealing of the importance of interaction attribute, or the attention interaction attribute has been paid on. Also, in this regard, the interactive designer could reasonably allocate their energy during design process, and the future potential on various direction of interaction design could be discussed.

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.