• Title/Summary/Keyword: 중복분석

Search Result 1,435, Processing Time 0.032 seconds

Exploration on Possibility of the Disciplinary Convergence of the User Studies and the Research in Practice (이용자연구와 실용연구 분야의 학제적 융합 가능성 도출 연구)

  • Lee, Jee Yeon;Kam, Miah
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.1
    • /
    • pp.129-155
    • /
    • 2018
  • This research aims to discover various aspects of the user studies and the research in practice and also to propose collaboration methods by empirical analysis of the data. To determine the application applicability of the user studies in other subject areas, the degree of keyword overlap between the user studies and the User Experience (UX), one of the research in practice discipline, was measured. The quantitative information science methods including simple frequency analysis were applied to more than ten thousand published papers to generate the network mapping and ranking as well as comparative analysis by time. The analysis result showed that there were slightly lesser overlap between the user studies and the UX in the domestically published articles than the international ones. It also revealed that there is a relationship between the actual occurrences of collaboration and the keyword overlap. The temporal analysis showed that there is increasingly more keyword overlap between two disciplines and thus it is possible to predict the active convergence in the future.

A Study on the "Kor-T", a Modified Tapered h-index, by Applying the Ranking According to the Number of Citations of Journals in Evaluating Korean Journals (학술지의 피인용횟수 순위를 적용한 tapered h-지수의 변형지표 "Kor-hT"에 관한 연구)

  • Ko, Young Man;Cho, Soo-Ryun;Park, Ji Young
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.4
    • /
    • pp.111-131
    • /
    • 2013
  • This study describes the meaning of and the formula for Kor-$h_T$, which is a modified index built on the tapered h-index by applying 'the ranking according to the number of citations of journals'. This study evaluated the de-duplication rate of index values of Kor-$h_T$ and analyzed the change in the correlation between the index values and evaluation elements using the Korea Citation Index data from 2008 to 2010. Kor-$h_T$ is compared with h-index, tapered h-index, and IF. As a result, Kor-$h_T$ appeared to be superior to other indexes on de-duplication rate. It is also shown that there is a very strong positive correlation between the evaluation elements, the number of citations and the number of articles of journals, and the index values of Kor-$h_T$.

Study of Efficient Algorithm for Deduplication of Complex Structure (복잡한 구조의 데이터 중복제거를 위한 효율적인 알고리즘 연구)

  • Lee, Hyeopgeon;Kim, Young-Woon;Kim, Ki-Young
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.14 no.1
    • /
    • pp.29-36
    • /
    • 2021
  • The amount of data generated has been growing exponentially, and the complexity of data has been increasing owing to the advancement of information technology (IT). Big data analysts and engineers have therefore been actively conducting research to minimize the analysis targets for faster processing and analysis of big data. Hadoop, which is widely used as a big data platform, provides various processing and analysis functions, including minimization of analysis targets through Hive, which is a subproject of Hadoop. However, Hive uses a vast amount of memory for data deduplication because it is implemented without considering the complexity of data. Therefore, an efficient algorithm has been proposed for data deduplication of complex structures. The performance evaluation results demonstrated that the proposed algorithm reduces the memory usage and data deduplication time by approximately 79% and 0.677%, respectively, compared to Hive. In the future, performance evaluation based on a large number of data nodes is required for a realistic verification of the proposed algorithm.

The analyses of duplicated contents of 'Consumer Life' area in Technology & Home Economics and other subject textbooks for middle and high school students (중·고등학교 기술·가정 교과서와 타 교과 교과서의 '소비생활' 영역 중복 내용 분석)

  • Lee, Jung Yoon;Yu, Nan Sook
    • Journal of Korean Home Economics Education Association
    • /
    • v.27 no.4
    • /
    • pp.121-140
    • /
    • 2015
  • The purposes of this study were to analyze the duplicated contents of 'Consumer life' area of Technology & Home Economics and other subject textbooks for the middle and high school students. It focused on textbooks compiled following the 2009 revised curriculum. To achieve the purposes of this study, "Technology & Home Economics I II", "Social studies I II", and "Ethics I II"textbooks for middle school and "Technology & Home Economics", "Social studies", and "Life & Ethics" textbooks for high school were analyzed based on the criteria for analyses of 'Consumer life' area. The results were as follows. First, the analysis of duplicated contents in Technology & Home Economics and other subjects (Ethics, Social studies) for middle school revealed that Technology & Home Economics textbook had the most proportion of 'Consumer Life' area, followed by Social studies and Ethics. The duplicated content elements in Technology & Home Economics, Ethics, and Social studies textbooks for middle school were 'consumer decision making', 'consumer information', 'economic impact of consumption', 'food life and sustainability', and 'consumption and sustainability'. Secondly, as a result of the content analysis of textbooks for high school Technology & Home Economics, Social studies, and Life & Ethics according to the criteria of analysis, it was found that Technology & Home Economics textbook had the most proportion of 'Consumer Life' area, followed by Life & Ethics and Social studies. The "content elements" 'food life management and consumption environment', 'desire of consumption', 'economic impact of consumption', 'changing factors and characteristics of consumer culture', and 'consumption and sustainability' were commonly found in all three textbooks. In this way, the 'Consumer life' area of Technology & Home Economics is thought to play a central role in teaching the 'Consumer Life' area because of its strength that contains detailed contents about consumer life for adolescent consumers who will apply it to everyday life. Based on the result of this research, it is needed to consider articulation of 'Consumer life' area of secondary schools for the future curriculum development of Technology & Home Economics to reduce the duplicated contents and to help the adolescents develop the ability to solve consumption problems they may encounter in real life and grow up to be rational adult consumers.

A System for Measuring the Similarity and Redundancy of R&D Project (R&D 과제의 유사도 및 중복도 측정 시스템에 관한 연구)

  • Choi, Kook-Hyun;Kang, Yong-Suk;Kim, Jong-Hee;Shin, Yong-Tae;Kim, Jong-Bae
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.329-331
    • /
    • 2014
  • The analysis of the similarities and redundancies among R&D projects is important for the efficient investment of government budgets. When government R&D projects are planned, the redundancies of research tasks are examined by institutions specializing in research management, relevant offices and departments, and the government to prevent redundant funding. However, as existing similarity analyses depend on methods wherein new task proposals and existing R&D project proposals are compared and looked up based on keywords. This results in vulnerability wherein similarity cannot be accurately measured in the event of partial modifications of the task name or technical substitutions. This study aims to use patent information as characteristics by which R&D project documents can be identified. The patent data used is based on materials officially published by the government's R&D patent trend survey project (http://ipas.rndip.re.kr). The study aims to propose a method by which patent information can be used to analyze the similarity and redundancy among R&D projects when new projects are entered. For this purpose, a similarity measurement model based on set theory and probability theory is presented. The presented measurement model is implemented into an actual system to identify redundant documents, and calculate and show their similarity.

  • PDF

Optimization Using Partial Redundancy Elimination in SSA Form (SSA Form에서 부분 중복 제거를 이용한 최적화)

  • Kim, Ki-Tae;Yoo, Weon-Hee
    • The KIPS Transactions:PartD
    • /
    • v.14D no.2
    • /
    • pp.217-224
    • /
    • 2007
  • In order to determine the value and type statically. CTOC uses the SSA Form which separates the variable according to assignment. The SSA Form is widely being used as the intermediate expression of the compiler for data flow analysis as well as code optimization. However, the conventional SSA Form is more associated with variables rather than expressions. Accordingly, the redundant expressions are eliminated to optimize expressions of the SSA From. This paper defines the partial redundant expression to obtain a more optimized code and also implements the technique for eliminating such expressions.

Korean Morphological Analysis Sharing Partial Analyses (부분 분석 결과를 공유하는 한국어 형태소 분석)

  • 이상호
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.75-79
    • /
    • 1994
  • 한국어 어절의 모든 가능한 형태소 분석 결과는 형태소 격자 구조로 대응된다. 즉, 형태소 분석과정은 형태소 격자 구조를 만드는 과정과 동일하다고 말할 수 있다. 기존의 방법들은 여러개의 가능한 분석 결과에 중복되는 형태소들을 그대로 저장하여 자료 관리의 비효율성이 있었다. 본 논문에서 설명하는 형태소 분석기는 형태소 분석의 중간 결과를 공유하여, 자료의 중복 저장을 피했고, 모든 가능한 형태소 분석 결과를 형태소 격자 구조의 가능한 모든 경로로 대응하였다. 한편, 형태소 배열 규칙은 품사 태깅된 말뭉치로부터 자동으로 추출되었다. 또한, 사전도 품사 태깅된 말뭉치로부터 자동으로 구축되었으며, 굴절된 형태소는 등록되지 않는다. 그러나 불규칙 및 축약 현상에 관한 정보는 수동으로 추가되었다. 불규칙 및 축약 현상의 발생 가능 위치는 한글 자소 패턴에 의해서 찾아지고, 이들 현상의 처리는 절차적인 방법에 의해 해결되었다.

  • PDF

Reliability analysis of multi-state parallel system with a multi-functional standby component (다기능 대기부품을 갖는 다중상태 병렬시스템의 신뢰도 분석)

  • Kim, Dong-Hyeon;Lee, Suk-Hoon;Lim, Jae-Hak
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.20 no.4
    • /
    • pp.75-87
    • /
    • 2015
  • A redundant structure typically consists of primary component and standby component taking over the function of the primary component when the primary component fails. In this research, we consider a redundant structure in which a standby component can take over the function of more than one primary component when primary components fail. And we assume that the system has multi-state according to the states of components while all components have two states. This system is called as the multi-state redundant system with a multi-functional standby component. This type of redundant structure is frequently adapted by the system such as an aircraft in which the weight is an important design factor. In this paper, we propose new reliability model for this multi-state redundant system with a multi-functional standby component in order for evaluating the reliability of the system. Under the assumption that all components have constant failure rate, we evaluate the reliability of the system by applying Markov analysis method. And we investigate the effect of the multi-functional standby component by comparing reliabilities of the parallel system with multi-functional standby component and a simple parallel system and a parallel system with redundant structure.

The Difference of Characters between Housing Poverty Types - Subcriterion Criteria of Substandard Housing, Unaffordable Housing and Double Housing Poverty (유형별 주거빈곤가구의 차이 - 최저주거기준 하위기준미달, 주거비 과부담, 중복주거빈곤가구)

  • Lim, Se hee;Park, Kyung ha
    • 한국사회정책
    • /
    • v.24 no.4
    • /
    • pp.31-62
    • /
    • 2017
  • This study intends to identify the difference of socio-economic characters and housing welfare needs between housing poverty types and to know the independent effects of variables on the housing poverty types. It was revealed that the double housing poverty household, housing below facility standard, unaffordable housing with low income, housing below structure performance environment standard, housing below area standard and housing below room standard should be supported one by one. And the variables related with the housing poverty types are different Suggestions were made for housing welfare policy for the double housing poverty, the control for rental housing market, the policy considering income level for unaffordable housing, the housing policy for the disable household.

Log Management Using Backward Log Analysis in Client-Server Database System (클라이언트/서버 데이터베이스 시스템에서 역방향 로그 분석을 이용한 로그 관리)

  • 이찬섭;박용문;고병오;최의인
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.11B
    • /
    • pp.1928-1938
    • /
    • 2000
  • 기존 데이터베이스 시스템에서 사용되는 회복 기법들은 시스템 파손 시 빠른 회복을 지원하기 위해서 물리적 로깅(physical logging)을 사용한다. 그러나 이런 기법들을 클라이언트/서버 환경에 그대로 적용할 경우에는 여러 가지 문제점이 발생된다. 물리적 기법의 경우에는 로그 분석 시 before-image와 after-image의 중복이 발견된다는 문제점이 있으며, 기존의 대부분 회복 기법들은 시스템 파손 시 전방향(forward)으로 로그를 분석함으로써 불필요한 회복 동작이 존재할 수 있다. 또한 시스템 회복 시 로그 접근 횟수의 증가로 인해 회복 속도가 늦어지는 문제점이 있다. 이 논문에서는 이런 문제점을 해결하고 클라이언트/서버 환경에 적합한 회복 기법을 제안하기 위해 중복된 before-image를 제거하고 재수행 전용 로그 레코드(redo-only log record)만을 로그에 기록함으로써 로깅 오버헤드를 감소시키면서 로그 분석 시간을 감소시킨 역방향 로그 분석 기법을 제안하였다. 또한 로그 분석 시 유지해야 하는 자료구조의 오버헤드를 최소화했다. 마지막으로 제안된 기법과 기존의 기법을 비교 분석하였다.

  • PDF