• 제목/요약/키워드: Information Mining

검색결과 3,350건 처리시간 0.03초

기계학습 기반 비트코인 채굴 난이도 예측 연구 (Machine Learning Based Prediction of Bitcoin Mining Difficulty)

  • 이준원;권태경
    • 정보보호학회논문지
    • /
    • 제29권1호
    • /
    • pp.225-234
    • /
    • 2019
  • 비트코인은 탈중앙화와 분산원장을 특징으로 하는 암호화폐로서 "작업증명"이라는 채굴시스템을 통해 유지된다. 채굴 시스템에서는 블록 생성시간을 일정하게 유지하기 위해 채굴 난이도를 조정하게 되는데, 기존의 채굴 난이도 변경 방식은 미래의 해시파워를 반영할 수 없다는 문제가 있다. 따라서 실제시간과 예정시간 사이에 발생하는 오차로 인해 블록생성과 실세계 시간의 불일치를 가중시키게 되고, 결국 거래 기한을 맞추지 못하거나 코인 호핑 공격에 취약점을 노출시키게 된다. 블록 생성시간을 일정하게 유지시키기 위한 기존 연구도 여전히 오차 문제를 갖는다. 본 연구에서는 이러한 오차를 줄이기 위한 기계학습 기반 채굴 난이도 예측 방안을 제시한다. 이전 해시파워를 학습하여 미래의 해시파워를 예측하고 예측한 값을 이용하여 채굴 난이도를 조정한다. 우리의 실험 결과는 이와 같은 경우 기존 채굴 난이도 조정방식보다 오차율을 약 36% 더 줄일 수 있음을 보여준다.

A Robust and Device-Free Daily Activities Recognition System using Wi-Fi Signals

  • Ding, Enjie;Zhang, Yue;Xin, Yun;Zhang, Lei;Huo, Yu;Liu, Yafeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권6호
    • /
    • pp.2377-2397
    • /
    • 2020
  • Human activity recognition is widely used in smart homes, health care and indoor monitor. Traditional approaches all need hardware installation or wearable sensors, which incurs additional costs and imposes many restrictions on usage. Therefore, this paper presents a novel device-free activities recognition system based on the advanced wireless technologies. The fine-grained information channel state information (CSI) in the wireless channel is employed as the indicator of human activities. To improve accuracy, both amplitude and phase information of CSI are extracted and shaped into feature vectors for activities recognition. In addition, we discuss the classification accuracy of different features and select the most stable features for feature matrix. Our experimental evaluation in two laboratories of different size demonstrates that the proposed scheme can achieve an average accuracy over 95% and 90% in different scenarios.

데이터마이닝을 이용한 설문조사 및 분석 (Questionnaire Survey and Analysis Using Data Mining)

  • 박만희;채화성;신완선
    • 산업경영시스템학회지
    • /
    • 제25권5호
    • /
    • pp.46-52
    • /
    • 2002
  • Today's database system needs to collect huge amount of questionnaire that results from development of the information technology by the internet, so it has to be administrable. However, there are many difficulties concerned with finding analytic data or useful information in the high capacity-database. Data mining can solve these problems and utilize the database. Questionnaire analysis that uses data mining has drawn relevant patterns that did not look or was tended to overlook before. These patterns can be applied by a new business rule. The purpose of this research is to analyze the questionnaire results and to present the result that can help to make decision easily with data mining. Recognition and analysis about these techniques of data mining show suitable type of questionnaire survey. This research focus on the form of present composition and the model of suitable questionnaire to analyze the type of it. Also, the comparison between the actual questionnaire result and the conventional statistical analysis is examined.

Gene Algorithm of Crowd System of Data Mining

  • Park, Jong-Min
    • Journal of information and communication convergence engineering
    • /
    • 제10권1호
    • /
    • pp.40-44
    • /
    • 2012
  • Data mining, which is attracting public attention, is a process of drawing out knowledge from a large mass of data. The key technique in data mining is the ability to maximize the similarity in a group and minimize the similarity between groups. Since grouping in data mining deals with a large mass of data, it lessens the amount of time spent with the source data, and grouping techniques that shrink the quantity of the data form to which the algorithm is subjected are actively used. The current grouping algorithm is highly sensitive to static and reacts to local minima. The number of groups has to be stated depending on the initialization value. In this paper we propose a gene algorithm that automatically decides on the number of grouping algorithms. We will try to find the optimal group of the fittest function, and finally apply it to a data mining problem that deals with a large mass of data.

I-Tree: A Frequent Patterns Mining Approach without Candidate Generation or Support Constraint

  • Tanbeer, Syed Khairuzzaman;Sarkar, Jehad;Jeong, Byeong-Soo;Lee, Young-Koo;Lee, Sung-Young
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2007년도 춘계학술발표대회
    • /
    • pp.31-33
    • /
    • 2007
  • Devising an efficient one-pass frequent pattern mining algorithm has been an issue in data mining research in recent past. Pattern growth algorithms like FP-Growth which are found more efficient than candidate generation and test algorithms still require two database scans. Moreover, FP-growth approach requires rebuilding the base-tree while mining with different support counts. In this paper we propose an item-based tree, called I-Tree that not only efficiently mines frequent patterns with single database scan but also provides multiple mining scopes with multiple support thresholds. The 'build-once-mine-many' property of I-Tree allows it to construct the tree only once and perform mining operation several times with the variation of support count values.

  • PDF

FEROM: Feature Extraction and Refinement for Opinion Mining

  • Jeong, Ha-Na;Shin, Dong-Wook;Choi, Joong-Min
    • ETRI Journal
    • /
    • 제33권5호
    • /
    • pp.720-730
    • /
    • 2011
  • Opinion mining involves the analysis of customer opinions using product reviews and provides meaningful information including the polarity of the opinions. In opinion mining, feature extraction is important since the customers do not normally express their product opinions holistically but separately according to its individual features. However, previous research on feature-based opinion mining has not had good results due to drawbacks, such as selecting a feature considering only syntactical grammar information or treating features with similar meanings as different. To solve these problems, this paper proposes an enhanced feature extraction and refinement method called FEROM that effectively extracts correct features from review data by exploiting both grammatical properties and semantic characteristics of feature words and refines the features by recognizing and merging similar ones. A series of experiments performed on actual online review data demonstrated that FEROM is highly effective at extracting and refining features for analyzing customer review data and eventually contributes to accurate and functional opinion mining.

메타데이터 기반 개인용 미디어 검색/관리 시스템 (A Personal Media Search/Management System based on Metadata)

  • 김현기;허정;서희철;임수종;황이규;장명길
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2006년도 추계학술발표대회
    • /
    • pp.153-156
    • /
    • 2006
  • 최근 개인 컴퓨터에 저장되는 다양한 미디어 정보에 대한 검색요구가 크게 대두되면서, 다양한 데스크톱 검색 시스템이 출현하고 있다. 그러나, 기존 데스크톱 검색 시스템은 파일명, 일부 제한된 메타데이터, 콘텐츠들에 대한 키워드 기반의 검색을 수행하기 때문에 사용자의 요구에 부합하는 결과를 정확하게 제시하는 못하는 문제점이 있다. 본 논문에서는 이와 같은 문제점을 해결하기 위해서 시맨틱웹 기술을 활용하여, 온톨로지에 기반한 메타데이터를 정의하고 이를 기반으로 메타데이터간의 의미적 연관성에 기반한 시맨틱 데스크톱 검색/관리 시스템에 대해 기술한다.

  • PDF

A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases

  • Ahmed, Chowdhury Farhan;Tanbeer, Syed Khairuzzaman;Jeong, Byeong-Soo
    • ETRI Journal
    • /
    • 제32권5호
    • /
    • pp.676-686
    • /
    • 2010
  • Mining sequential patterns is an important research issue in data mining and knowledge discovery with broad applications. However, the existing sequential pattern mining approaches consider only binary frequency values of items in sequences and equal importance/significance values of distinct items. Therefore, they are not applicable to actually represent many real-world scenarios. In this paper, we propose a novel framework for mining high-utility sequential patterns for more real-life applicable information extraction from sequence databases with non-binary frequency values of items in sequences and different importance/significance values for distinct items. Moreover, for mining high-utility sequential patterns, we propose two new algorithms: UtilityLevel is a high-utility sequential pattern mining with a level-wise candidate generation approach, and UtilitySpan is a high-utility sequential pattern mining with a pattern growth approach. Extensive performance analyses show that our algorithms are very efficient and scalable for mining high-utility sequential patterns.

Environmental Consciousness Data Modeling by Association Rules

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권3호
    • /
    • pp.529-538
    • /
    • 2005
  • Data mining is the method to find useful information for large amounts of data in database. It is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are association rules, decision tree, clustering, neural network and so on. Association rule mining searches for interesting relationships among items in a riven large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary quality measures for association rule, support and confidence and lift. We analyze Gyeongnam social indicator survey data using association rule technique for environmental information discovery. We can use to environmental preservation and environmental improvement by association rule outputs.

  • PDF

Comparison of Multiway Discretization Algorithms for Data Mining

  • Kim, Jeong-Suk;Jang, Young-Mi;Na, Jong-Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권4호
    • /
    • pp.801-813
    • /
    • 2005
  • The discretization algorithms for continuous data have been actively studied in the area of data mining. These discretizations are very important in data analysis, especially for efficient model selection in data mining. So, in this paper, we introduce the principles of some mutiway discretization algorithms including KEX, 1R and CN4 algorithm and investigate the efficiency of these algorithms through numerical study. For various underlying distribution, we compare these algorithms in view of misclassification rate.

  • PDF