• 제목/요약/키워드: Incremental Data

검색결과 462건 처리시간 0.023초

A New Incremental Learning Algorithm with Probabilistic Weights Using Extended Data Expression

  • Yang, Kwangmo;Kolesnikova, Anastasiya;Lee, Won Don
    • Journal of information and communication convergence engineering
    • /
    • 제11권4호
    • /
    • pp.258-267
    • /
    • 2013
  • New incremental learning algorithm using extended data expression, based on probabilistic compounding, is presented in this paper. Incremental learning algorithm generates an ensemble of weak classifiers and compounds these classifiers to a strong classifier, using a weighted majority voting, to improve classification performance. We introduce new probabilistic weighted majority voting founded on extended data expression. In this case class distribution of the output is used to compound classifiers. UChoo, a decision tree classifier for extended data expression, is used as a base classifier, as it allows obtaining extended output expression that defines class distribution of the output. Extended data expression and UChoo classifier are powerful techniques in classification and rule refinement problem. In this paper extended data expression is applied to obtain probabilistic results with probabilistic majority voting. To show performance advantages, new algorithm is compared with Learn++, an incremental ensemble-based algorithm.

점진적 스레드 트리를 구성하기 위한 파싱 알고리즘 (A Parsing Algorithm for Constructing Incremental Threaded Tree)

  • 이대식
    • 인터넷정보학회논문지
    • /
    • 제7권4호
    • /
    • pp.91-99
    • /
    • 2006
  • 점진적 파싱 기법은 프로그램의 점진적 구성을 허용하는 언어기반 환경의 중요한 부분이며, 프로그램의 변경된 부분에 대해서만 구문분석을 다시 함으로써 시스템의 성능을 향상 시킨다. 기존의 점진적 파싱은 파싱 정보를 저장하기 위해 스택 자료구조를 사용한다. 본 논문에서는 스택 자료구조를 사용하지 않고 노드 주소로 스레드를 추가하여 스레드 트리 구성 알고리즘을 제안한다. 또한 구성된 스레드 트리를 사용하여 5단계의 점진적 파싱 과정으로 나누어 점진적 스레드 트리 구성 알고리즘을 제안한다.

  • PDF

Distributed Incremental Approximate Frequent Itemset Mining Using MapReduce

  • Mohsin Shaikh;Irfan Ali Tunio;Syed Muhammad Shehram Shah;Fareesa Khan Sohu;Abdul Aziz;Ahmad Ali
    • International Journal of Computer Science & Network Security
    • /
    • 제23권5호
    • /
    • pp.207-211
    • /
    • 2023
  • Traditional methods for datamining typically assume that the data is small, centralized, memory resident and static. But this assumption is no longer acceptable, because datasets are growing very fast hence becoming huge from time to time. There is fast growing need to manage data with efficient mining algorithms. In such a scenario it is inevitable to carry out data mining in a distributed environment and Frequent Itemset Mining (FIM) is no exception. Thus, the need of an efficient incremental mining algorithm arises. We propose the Distributed Incremental Approximate Frequent Itemset Mining (DIAFIM) which is an incremental FIM algorithm and works on the distributed parallel MapReduce environment. The key contribution of this research is devising an incremental mining algorithm that works on the distributed parallel MapReduce environment.

Multi-channel Incremental Data Converters

  • Bae, Sung-Hwan;Lee, Chang-Ki;Kim, Dae-Ik
    • 한국전자통신학회논문지
    • /
    • 제4권1호
    • /
    • pp.33-36
    • /
    • 2009
  • Incremental converters provide a solution for such measurement applications, as they retain most of the advantages of conventional ${\Delta}{\Sigma}$ converters, and yet they are capable of offset-free and accurate conversion. Most of the previous research on incremental converters was for single-channel and dc signal applications, where they can perform extremely accurate data conversion with more than 20-bit resolution. In this paper, a design technique for implementing multi-channel incremental data converters to convert narrow bandwidth ac signals is discussed. It incorporates the operation principle, topology, and digital decimation filter design. The theoretical results are verified by simulation results.

  • PDF

Incremental Information Content of Cash Flow and Earnings in the Iranian Capital Market

  • Asgari, Leila;Salehi, Mahdi;Mohammadi, Ali
    • 산경연구논집
    • /
    • 제5권1호
    • /
    • pp.5-9
    • /
    • 2014
  • Purpose - This study aims to examine the incremental information content of cash flw from operations and earnings in the Iranian capital market. Design, methodology, and approach - Based on market-based accounting research, this study uses statistical associations between accounting data (earnings and cash flw) and stock returns to assess/measure the incremental information content (value relevance) of cash flw and earnings. A multivariate regression model based on panel data is used to examine the incremental information content of earnings and cash flow from operations. Results - The results show that both earnings and cash flow from operations have incremental information content beyond each other. These results are consistent with the findings of recent studies. Overall, the fidings of this study support the usefulness of cash flw information in addition to earnings, in fim valuation by investors in the Iranian market. Conclusions - The study makes the following contributions to the Iranian literature on incremental information content of cash flw and earnings. First, this study employs actual cash flw data derived from cash flw statements. Second, this study employs a large sample size for a more recent period.

Markov 그라픽 데이타에 대한 incremental-runlength의 확률분포 (Incremental-runlength distribution for Markov graphic data source)

  • 김재균
    • 전기의세계
    • /
    • 제29권6호
    • /
    • pp.389-392
    • /
    • 1980
  • For Markov graphic source, it is well known that the conditional runlength coding for the runs of correct prediction is optimum for data compression. However, because of the simplicity in counting and the stronger concentration in distrubution, the incremental run is possibly a better parameter for coding than the run itself for some cases. It is shown that the incremental-runlength is also geometrically distributed as the runlength itself. The distribution is explicitly described with the basic parameters defined for a Markov model.

  • PDF

점증적인 맵 갱신을 지원하는 DB 기반 내비게이션의 성능 향상을 위한 데이터 단편화 방지 기법 (Data Fragmentation Protection Technique for the Performance Enhancement of DB-Based Navigation Supporting Incremental Map Update)

  • 김용호;김재광;진성일
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제9권3호
    • /
    • pp.77-82
    • /
    • 2020
  • 차량에 탑재된 내비게이션의 대부분은 복잡한 구조의 PSF(Physical Storage Format) 파일 기반으로 개발되어 점증적 맵 갱신을 지원하기 어렵다. 이를 해결하기 위한 차세대 내비게이션 방법의 하나로서 DB 기반의 내비게이션 기술이 주목받고 있다. 점진적 맵 갱신을 지원하는 DB 기반 내비게이션 구현에 있어 지속적인 맵 데이터 갱신으로 인한 데이터 단편화현상으로 데이터 접근 비용이 증가할 수 있어 검색 성능의 저하가 발생할 수 있다. 본 논문에서는 점증적 맵 갱신을 지원하는 DB 기반 내비게이션의 성능 향상 방법의 하나로 데이터 단편화 방지 기법을 제시하고 실제 구현을 통하여 성능 향상 효과가 있음을 검증하였다.

미리 순서가 매겨진 학습 데이타를 이용한 효과적인 증가학습 (Efficient Incremental Learning using the Preordered Training Data)

  • 이선영;방승양
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제27권2호
    • /
    • pp.97-107
    • /
    • 2000
  • 증가학습은 점진적으로 학습 데이타를 늘려가며 신경망을 학습시킴으로써 일반적으로 학습시간을 단축시킬 뿐만 아니라 신경망의 일반화 성능을 향상시킨다. 그러나, 기존의 증가학습은 학습 데이타를 선정하는 과정에서 데이타의 중요도를 반복적으로 평가한다. 본 논문에서는 분류 문제의 경우 학습이 시작되기 전에 데이타의 중요도를 한 번만 평가한다. 제안된 방법에서는 분류 문제의 경우 클래스 경계에 가까운 데이타일수록 그 데이타의 중요도가 높다고 보고 이러한 데이타를 선택하는 방법을 제시한다. 두가지 합성 데이타와 실세계 데이타의 실험을 통해 제안된 방법이 기존의 방법보다 학습 시간을 단축시키며 일반화 성능을 향상시킴을 보인다.

  • PDF

AN EFFICIENT ALGORITHM FOR SLIDING WINDOW BASED INCREMENTAL PRINCIPAL COMPONENTS ANALYSIS

  • Lee, Geunseop
    • 대한수학회지
    • /
    • 제57권2호
    • /
    • pp.401-414
    • /
    • 2020
  • It is computationally expensive to compute principal components from scratch at every update or downdate when new data arrive and existing data are truncated from the data matrix frequently. To overcome this limitations, incremental principal component analysis is considered. Specifically, we present a sliding window based efficient incremental principal component computation from a covariance matrix which comprises of two procedures; simultaneous update and downdate of principal components, followed by the rank-one matrix update. Additionally we track the accurate decomposition error and the adaptive numerical rank. Experiments show that the proposed algorithm enables a faster execution speed and no-meaningful decomposition error differences compared to typical incremental principal component analysis algorithms, thereby maintaining a good approximation for the principal components.

Incremental Multi-classification by Least Squares Support Vector Machine

  • Oh, Kwang-Sik;Shim, Joo-Yong;Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권4호
    • /
    • pp.965-974
    • /
    • 2003
  • In this paper we propose an incremental classification of multi-class data set by LS-SVM. By encoding the output variable in the training data set appropriately, we obtain a new specific output vectors for the training data sets. Then, online LS-SVM is applied on each newly encoded output vectors. Proposed method will enable the computation cost to be reduced and the training to be performed incrementally. With the incremental formulation of an inverse matrix, the current information and new input data are used for building another new inverse matrix for the estimation of the optimal bias and lagrange multipliers. Computational difficulties of large scale matrix inversion can be avoided. Performance of proposed method are shown via numerical studies and compared with artificial neural network.

  • PDF