• 제목/요약/키워드: Re-Rank Model

검색결과 12건 처리시간 0.021초

SVM을 이용한 음성채팅시스템의 성능 향상 방법 (Performance Improvement Methods of a Spoken Chatting System Using SVM)

  • 안혁주;이성희;송영길;김학수
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제4권6호
    • /
    • pp.261-268
    • /
    • 2015
  • 음성채팅시스템에서 사용자의 음성 질의는 자동음성인식기를 통하여 텍스트 질의로 변환된다. 만약 자동음성인식기의 1순위 결과가 틀린다면 이 오류는 그대로 음성채팅시스템에 전파된다. 자동음성인식기의 1순위 정밀도를 향상시키기 위하여 본 논문에서는 RankSVM을 이용하여 자동음성인식기의 n개 결과를 재순위화하는 후처리 모델을 제안한다. 채팅시스템을 학습하기 위해서는 대용량의 채팅 문장들이 필요하다. 만약 새로운 채팅 문장들이 학습데이터에 자주 추가되지 않는다면 채팅시스템의 응답은 금방 진부해질 것이다. 이러한 문제를 해결하기 위하여 본 논문에서는 SVM을 이용하여 TV와 영화 시나리오로부터 채팅 문장들을 자동으로 선택하는 데이터 수집 모델을 제안한다. 실험에서 제안된 후처리 모델은 후처리를 하지 않은 모델보다 정확률에서 4.4%, 재현율에서 6.4% 더 좋은 결과를 보였다. 그리고 제안된 데이터 수집 모델은 98.95%의 높은 정확률과 57.14%의 재현율을 보였다.

동일인 인식을 위한 컬러 공간의 탐색 및 결합 (Color Space Exploration and Fusion for Person Re-identification)

  • 남영호;김민기
    • 한국멀티미디어학회논문지
    • /
    • 제19권10호
    • /
    • pp.1782-1791
    • /
    • 2016
  • Various color spaces such as RGB, HSV, log-chromaticity have been used in the field of person re-identification. However, not enough studies have been done to find suitable color space for the re-identification. This paper reviews color invariance of color spaces by diagonal model and explores the suitability of each color space in the application of person re-identification. It also proposes a method for person re-identification based on a histogram refinement technique and some fusion strategies of color spaces. Two public datasets (ALOI and ImageLab) were used for the suitability test on color space and the ImageLab dataset was used for evaluating the feasibility of the proposed method for person re-identification. Experimental results show that RGB and HSV are more suitable for the re-identification problem than other color spaces such as normalized RGB and log-chromaticity. The cumulative recognition rates up to the third rank under RGB and HSV were 79.3% and 83.6% respectively. Furthermore, the fusion strategy using max score showed performance improvement of 16% or more. These results show that the proposed method is more effective than some other methods that use single color space in person re-identification.

Identifying Influential People Based on Interaction Strength

  • Zia, Muhammad Azam;Zhang, Zhongbao;Chen, Liutong;Ahmad, Haseeb;Su, Sen
    • Journal of Information Processing Systems
    • /
    • 제13권4호
    • /
    • pp.987-999
    • /
    • 2017
  • Extraction of influential people from their respective domains has attained the attention of scholastic community during current epoch. This study introduces an innovative interaction strength metric for retrieval of the most influential users in the online social network. The interactive strength is measured by three factors, namely re-tweet strength, commencing intensity and mentioning density. In this article, we design a novel algorithm called IPRank that considers the communications from perspectives of followers and followees in order to mine and rank the most influential people based on proposed interaction strength metric. We conducted extensive experiments to evaluate the strength and rank of each user in the micro-blog network. The comparative analysis validates that IPRank discovered high ranked people in terms of interaction strength. While the prior algorithm placed some low influenced people at high rank. The proposed model uncovers influential people due to inclusion of a novel interaction strength metric that improves results significantly in contrast with prior algorithm.

A Comparison Study of the Test for Right Censored and Grouped Data

  • Park, Hyo-Il
    • Communications for Statistical Applications and Methods
    • /
    • 제22권4호
    • /
    • pp.313-320
    • /
    • 2015
  • In this research, we compare the efficiency of two test procedures proposed by Prentice and Gloeckler (1978) and Park and Hong (2009) for grouped data with possible right censored observations. Both test statistics were derived using the likelihood ratio principle, but under different semi-parametric models. We review the two statistics with asymptotic normality and consider obtaining empirical powers through a simulation study. The simulation study considers two types of models the location translation model and the scale model. We discuss some interesting features related to the grouped data and obtain null distribution functions with a re-sampling method. Finally we indicate topics for future research.

남성 근로자의 재흡연에 관련된 요인 (Factors Affecting Re-smoking in Male Workers)

  • 양진훈;하희숙;임지선;강윤식;이덕희;천병렬;감신
    • Journal of Preventive Medicine and Public Health
    • /
    • 제38권2호
    • /
    • pp.208-214
    • /
    • 2005
  • Objectives: This study was performed to examine the factors affecting re-smoking in male workers. Methods: A self-administrated questionnaire survey was conducted during April 2003 to examine the smoking state of 1,154 employees of a company that launched a smoking cessation campaign in1998. Five hundred and eighty seven persons, who had stopped smoking for at least one week, were selected as the final study subjects. This study collected data on smoking cessation success or failure for 6 months, and looked at the factors having an effect on re-smoking within this period. This study employed the Health Belief Model as its theoretical basis. Results: The re-smoking rate of the 587 study subjects who had stopped smoking for at least one week was 44.8% within the 6 month period. In a simple analysis, the re-smoking rates were higher in workers with a low age, on day and night shifts, blue collar, of a low rank, where this was their second attempt at smoking cessation and for those with a shorter job duration (p<0.05). Of the cues to action variables in the Heath Belief Model, re-smoking was significantly related with the perceived susceptibility factor, economic advantages of smoking cessation among the perceived benefits factor, the degree of cessation trial's barrier of the perceived barriers factor, smoking symptom experience, recognition of the degree of harmfulness of environmental tobacco smoke and the existence of chronic disease due to smoking (p<0.05). In the multiple logistic regression analysis for re-smoking, the significant variables were age, perceived susceptibility for disease, economic advantages due to smoking cessation, the perceived barrier for smoking cessation, recognition on the degree of harmfulness of environmental tobacco smoke, the existence of chronic disease due to smoking and the number of attempts at smoking cessation (p<0.05). Conclusion: From the result of this study, for an effective smoking ban policy within the work place, health education that improves the knowledge of the adverse health effects of smoking and the harmfulness of environmental tobacco smoke will be required, as well as counter plans to reduce the barriers for smoking cessation.

제주도 대기환경의 부식성 평가 (Assessment of Atmospheric Corrosivity at Jeju Island)

  • 김귀식;양경조;허철구;송정화
    • 한국해양공학회지
    • /
    • 제19권5호
    • /
    • pp.50-57
    • /
    • 2005
  • This study has been conducted to investigate corrosivity of carbon steel, Cu, Zn and Al for one year from Sept. 2003 to Aug. 2004. A model of ISO 9223-ISO 9226 that represents the relation between metal corrosions and environmental parameters was used for atmospheric corrosion evaluations. Environmental parameters for these evaluations are time of wetness(TOW), $SO_2$ and Chloride. Corrosion rates for four metals which are exposed indoors and outdoors were measured on five locations in Jeju Island; Gosan, Seogwipo, Seongsan, Chuna hill and Jeju city. The environmental factor of atmospheric corrosion of Jeju Island for $SO_2$ class is P0, a clean area. TOW as T3 and T4 indicates that Jeju has the characteristics of a tropical area. Chlorides class within 3 km from the coast show the features of costal area as S2 and S3 classes. Chuna hill show the features of woodland as a S1 class. In Corrosion classes of each site which was measured outdoors is higher than indoors. Gosan is the highest class as the rank of C5, and indicated that they're ranked as C3 or C4.

양식업의 양식방법별 어종별 생산효율성 비교분석에 관한 연구 (The Study on the Comparative Analysis of the Aquaculture Production Efficiency Regarding Methods and Species)

  • 박철형
    • 수산경영론집
    • /
    • 제43권2호
    • /
    • pp.79-94
    • /
    • 2012
  • The purpose of this study is to investigate the production efficiencies of the Korean aquaculture fishery with respect to species and methods using a Data Envelopment Analysis. The study extracted the 8 fishes in each of the sea cage culture, aquarium basin, and enclosed aquaculture for the analytical purposes. First, the study estimated the technical, pure technical, and scale efficiencies of the total of 24 aquaculture fishes based on the traditional DEA under the assumptions of both CRS and VRS. 2 fishes were identified as the efficient DMUs under the CCR-model, and 6 fishes under the BCC-model. Second, we tested to see if there was any difference in production efficiencies regarding those three different methods of aquaculture. we could not find any evidence of the differences in efficiency using a rank sum test based on the traditional DEA. However, we could do find that the pure technical efficiency in the sea cage culture was lower than others at 1% level of significance and the pure technical efficiency in enclosed aquaculture was also lower than others at 5% level of significance using Bilateral-DEA, which could explicitly consider the heterogeneity in the 3 production methods of aquaculture. Finally, the study obtained the 95% confidence intervals of the efficiency scores for the 24 fishes under our study using the smoothed bootstraping method in the process of the re-sampling in cooperation with both a kernel density estimation and a reflection method. At the same time, we could estimate the bias-corrected efficiency scores while the traditionally estimated efficiency scores suffered from the biases in the process of solving a linear programming with the deterministic nature of a production frontier. And hence, we could distinguish the differences in production efficiencies of the 8 fishes with respect to those 3 methods of aquaculture.

Estimation of the genetic milk yield parameters of Holstein cattle under heat stress in South Korea

  • Lee, SeokHyun;Do, ChangHee;Choy, YunHo;Dang, ChangGwon;Mahboob, Alam;Cho, Kwanghyun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권3호
    • /
    • pp.334-340
    • /
    • 2019
  • Objective: The objective of this study was to investigate the genetic components of daily milk yield and to re-rank bulls in South Korea by estimated breeding value (EBV) under heat stress using the temperature-humidity index (THI). Methods: This study was conducted using 125,312 monthly test-day records, collected from January 2000 to February 2017 for 19,889 Holstein cows from 647 farms in South Korea. Milk production data were collected from two agencies, the Dairy Cattle Genetic Improvement Center and the Korea Animal Improvement Association, and meteorological data were obtained from 41 regional weather stations using the Automated Surface Observing System (ASOS) installed throughout South Korea. A random regression model using the THI was applied to estimate genetic parameters of heat tolerance based on the test-day records. The model included herd-year-season, calving age, and days-in-milk as fixed effects, as well as heat tolerance as an additive genetic effect, permanent environmental effect, and direct additive and permanent environmental effect. Results: Below the THI threshold (${\leq}72$; no heat stress), the variance in heat tolerance was zero. However, the heat tolerance variance began to increase as THI exceeded the threshold. The covariance between the genetic additive effect and the heat tolerance effect was -0.33. Heritability estimates of milk yield ranged from 0.111 to 0.176 (average: 0.128). Heritability decreased slightly as THI increased, and began to increase at a THI of 79. The predicted bull EBV ranking varied with THI. Conclusion: We conclude that genetic evaluation using the THI function could be useful for selecting bulls for heat tolerance in South Korea.

한국어-영어/일본어-영어 교차언어정보검색에서 클러스터 분석을 통한 성능 향상 (Performance Improvement by Cluster Analysis in Korean-English and Japanese-English Cross-Language Information Retrieval)

  • 이경순
    • 정보처리학회논문지B
    • /
    • 제11B권2호
    • /
    • pp.233-240
    • /
    • 2004
  • 본 논문에서는 교차언어정보검색에서 점진적 클러스터링을 통해서 모호성을 묵시적으로 해소하는 방법을 제안한다. 연구 목적은 질의 번역에서 모호성이 크게 증가된 상태에서 문서 클러스터가 문서 문맥 역할과 모호성 해소 역할을 하는지를 보고자 하는 것이다. 제안하는 방법은 한국어/일본어 질의를 사전을 이용하여 영어로 번역을 하고, 번역된 영어 질의에 대해서 벡터공간검색모델이나 확률검색모델에 의해서 문서를 검색한다 검색된 문서의 순위대로 점진적 클러스터를 동적으로 생성하고, 이 클러스터 정보를 질의에 반영해서 문서의 순위를 다시 결정하는 것이다. TREC 테스트컬렉션을 이용한 실험에서 모호성 해소를 하지 않은 질의에 대해서, 제안한 방법은 한국어-영어 교차언어정보검색에서는 벡터공간검색모델에서 39.41%의 성능향상, 확률검색모델에서 36.79%의 성능향상을 보였다. 일-영 교차언어정보검색에서는 각각 17.59%와 30.46%의 성능향상을 보였다. 적합성 피드백 방법과의 비교에서는 모호성 해소를 하지 않은 경우 확률검색모델에서 12.30%의 성능향상을 보였다. 이를 통해, 클러스터 분석은 질의 모호성 해소에 도움을 주어서 검색성능 향상에 기여하였음을 알 수 있다.

XLinks를 이용한 하이퍼텍스트 검색 시스템 (Hypertext Retrieval System Using XLinks)

  • 김은정;배종민
    • 정보처리학회논문지D
    • /
    • 제8D권5호
    • /
    • pp.483-494
    • /
    • 2001
  • 일반적인 하이퍼텍스트 검색 모델은 문서와 문서사이의 관계나 링크의 의미를 무시하고, 모든 문서를 독립적인 존재로 간주하여 검색한다. 그러나 하이퍼텍스트 검색 시스템에 있어 링크 정보를 이용하며 검색의 성능을 향상시킬 수 있다. 기존의 링크 기반 하이퍼텍스트 검색 모델은 문서의 색인 과정에서 링크 정보를 무시하고, 검색 결과 집합에 대하여 문서의 우선 순위를 제조정하는데 링크 정보를 활용한다. 이는 링크정보의 활용이 검색 결과 집합의 문서들에만 한정된다는 단점이 있다. 본 논문에서는 링크 정보를 문서의 색인 과정에서 활용한다. 색인 과정에서 링크 정보를 이용하여 문서 내 용어의 가중치와 문서 내 inLinks의 가중치를 정의하고, 이들의 이용하여 문서의 우선 순위를 위한 확장된 RSV 계산식을 제시한다. 실험 결과에서 링크 의미에 따른 검색 조회율과 정확도를 제시하고 기존 링크 기반 검색 모델과의 비교, 분석 결과를 제시한다.

  • PDF