• Title/Summary/Keyword: Top-K mining

Search Result 94, Processing Time 0.03 seconds

Performance Analysis of Top-K High Utility Pattern Mining Methods (상위 K 하이 유틸리티 패턴 마이닝 기법 성능분석)

  • Ryang, Heungmo;Yun, Unil;Kim, Chulhong
    • Journal of Internet Computing and Services
    • /
    • v.16 no.6
    • /
    • pp.89-95
    • /
    • 2015
  • Traditional frequent pattern mining discovers valid patterns with no smaller frequency than a user-defined minimum threshold from databases. In this framework, an enormous number of patterns may be extracted by a too low threshold, which makes result analysis difficult, and a too high one may generate no valid pattern. Setting an appropriate threshold is not an easy task since it requires the prior knowledge for its domain. Therefore, a pattern mining approach that is not based on the domain knowledge became needed due to inability of the framework to predict and control mining results precisely according to the given threshold. Top-k frequent pattern mining was proposed to solve the problem, and it mines top-k important patterns without any threshold setting. Through this method, users can find patterns from ones with the highest frequency to ones with the k-th highest frequency regardless of databases. In this paper, we provide knowledge both on frequent and top-k pattern mining. Although top-k frequent pattern mining extracts top-k significant patterns without the setting, it cannot consider both item quantities in transactions and relative importance of items in databases, and this is why the method cannot meet requirements of many real-world applications. That is, patterns with low frequency can be meaningful, and vice versa, in the applications. High utility pattern mining was proposed to reflect the characteristics of non-binary databases and requires a minimum threshold. Recently, top-k high utility pattern mining has been developed, through which users can mine the desired number of high utility patterns without the prior knowledge. In this paper, we analyze two algorithms related to top-k high utility pattern mining in detail. We also conduct various experiments for the algorithms on real datasets and study improvement point and development direction of top-k high utility pattern mining through performance analysis with respect to the experimental results.

Case study of the mining-induced stress and fracture network evolution in longwall top coal caving

  • Li, Cong;Xie, Jing;He, Zhiqiang;Deng, Guangdi;Yang, Bengao;Yang, Mingqing
    • Geomechanics and Engineering
    • /
    • v.22 no.2
    • /
    • pp.133-142
    • /
    • 2020
  • The evolution of the mining-induced fracture network formed during longwall top coal caving (LTCC) has a great influence on the gas drainage, roof control, top coal recovery ratio and engineering safety of aquifers. To reveal the evolution of the mining-induced stress and fracture network formed during LTCC, the fracture network in front of the working face was observed by borehole video experiments. A discrete element model was established by the universal discrete element code (UDEC) to explore the local stress distribution. The regression relationship between the fractal dimension of the fracture network and mining stress was established. The results revealed the following: (1) The mining disturbance had the most severe impact on the borehole depth range between approximately 10 m and 25 m. (2) The distribution of fractures was related to the lithology and its integrity. The coal seam was mainly microfractures, which formed a complex fracture network. The hard rock stratum was mainly included longitudinal cracks and separated fissures. (3) Through a numerical simulation, the stress distribution in front of the mining face and the development of the fracturing of the overlying rock were obtained. There was a quadratic relationship between the fractal dimension of the fractures and the mining stress. The results obtained herein will provide a reference for engineering projects under similar geological conditions.

Deep Learning Framework with Convolutional Sequential Semantic Embedding for Mining High-Utility Itemsets and Top-N Recommendations

  • Siva S;Shilpa Chaudhari
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.1
    • /
    • pp.44-55
    • /
    • 2024
  • High-utility itemset mining (HUIM) is a dominant technology that enables enterprises to make real-time decisions, including supply chain management, customer segmentation, and business analytics. However, classical support value-driven Apriori solutions are confined and unable to meet real-time enterprise demands, especially for large amounts of input data. This study introduces a groundbreaking model for top-N high utility itemset mining in real-time enterprise applications. Unlike traditional Apriori-based solutions, the proposed convolutional sequential embedding metrics-driven cosine-similarity-based multilayer perception learning model leverages global and contextual features, including semantic attributes, for enhanced top-N recommendations over sequential transactions. The MATLAB-based simulations of the model on diverse datasets, demonstrated an impressive precision (0.5632), mean absolute error (MAE) (0.7610), hit rate (HR)@K (0.5720), and normalized discounted cumulative gain (NDCG)@K (0.4268). The average MAE across different datasets and latent dimensions was 0.608. Additionally, the model achieved remarkable cumulative accuracy and precision of 97.94% and 97.04% in performance, respectively, surpassing existing state-of-the-art models. This affirms the robustness and effectiveness of the proposed model in real-time enterprise scenarios.

A Study on Environmental Monitoring of Open-cut Mining Ground Using Remote Sensing Technique

  • Tanaka Yoshiki;Tachiiri Kaoru;Gotoh Keinosuke;Hamamoto Ryota
    • Proceedings of the KSRS Conference
    • /
    • 2004.10a
    • /
    • pp.549-552
    • /
    • 2004
  • Since open-cut mining excavates gradually from the top of the mountain, vegetation planting is needed to reduce negative environmental impact on the surrounding environment. Accordingly, this study aimed at performing the environmental monitoring of the open-cut mining ground using the satellite remote sensing technique. As the research technique, in order to grasp the environmental change around the open-cut mining ground, NDVI (normalized difference vegetation index) was calculated, and every year change of the vegetation activity was analyzed. The results of the study showed lower vegetation activity in the open-cut mining ground compared to the surrounding areas and suggested the need for closed monitoring by remote sensing techniques.

  • PDF

Numerical Analysis of Deep Seawater Flow Disturbance Characteristics Near the Manganese Nodule Mining Device (망간단괴 집광기 주위 해수 유동교란 수치해석)

  • Lim, Sung-Jin;Chae, Yong-Bae;Jeong, Shin-Taek;Cho, Hong-Yeon;Lee, Sang-Ho
    • Ocean and Polar Research
    • /
    • v.36 no.4
    • /
    • pp.475-485
    • /
    • 2014
  • Seawater flow characteristics around a manganese nodule mining device in deep sea were analyzed through numerical investigation. The mining device influences the seawater flow field with complicated velocity distributions, and they are largely dependent on the seawater flow speed, device moving speed, and injection velocity from the collecting part. The flow velocity and turbulent kinetic energy distributions are compared at several positions from the device rear, side, and top, and it is possible to predict the distance from which the mining device affects the seawater flow field through the variation of turbulent kinetic energy. With the operation of the collecting device the turbulent kinetic energy remarkably increases, and it gradually decreases along the seawater flow direction. Turbulent kinetic energy behind the mining system increases with the seawater flow velocity. The transient behavior of nodule particles, which are not collected, is also predicted. This study will be helpful in creating an optimal design for a manganese nodule collecting device that can operate efficiently and which is eco-friendly.

Recognition of Dog Breeds based on Deep Learning using a Random-Label and Web Image Mining (웹 이미지 마이닝과 랜덤 레이블을 이용한 딥러닝 기반 개 품종 인식)

  • Kang, Min-Seok;Hong, Kwang-Seok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.201-202
    • /
    • 2018
  • In this paper, a dog breed image provided by Dataset of existing ImageNet and Oxford-IIIT Pet Image is combined with a dog breed image obtained through data mining on Internet and a random-label is added. this paper introduces to recognize 122 classes of dog breeds and 1 class that is not dog breeds. The recognition rate of dog breeds using both conventional DB and collection DB was improved 1.5% over Top-1 compared to recognition rate of dog breeds using only existing DB. The image recognition rate about non-dog image, was 93% recognition rate in case of 10000 random DBs.

  • PDF

Investigation and Analysis for the Status of Urban Mining Industry in Korea (국내 도시광산산업 현황 조사·분석)

  • Kim, Lyung-Joo;Shin, Ho-Jung;Kang, Hong-Yoon
    • Resources Recycling
    • /
    • v.25 no.5
    • /
    • pp.3-13
    • /
    • 2016
  • Statistics on the urban mining industry is the essential information to develop the urban mining industry systematically and the prerequisite way to understand its related trends. Status on domestic urban mining industry was thus investigated through the integrated method which uses both the top-down way based on the national statistics utilization and the bottom-up way based on field data gathering. Results indicated that the scale of metal resources produced through domestic urban mine was 19.6 trillion won, which corresponds to approximately 22 percent of metal demand in korea. The number of firms for urban mining was 917, and they are mostly placed in metropolitan area and Gyeongsang province. It was also found that about 58 percent of urban mining firms was in small business level less than 10 employees. Compared to the results in 2009, the number of urban mining companies in 2014 generally increased, and that of rare metal companies grew up significantly. This study is particularly different from the conventional statistics investigation on the point of the actual scale findings of metal resources based on the field data.

Analysis of the failure mechanism and support technology for the Dongtan deep coal roadway

  • Chen, Miao;Yang, Sheng-Qi;Zhang, Yuan-Chao;Zang, Chuan-Wei
    • Geomechanics and Engineering
    • /
    • v.11 no.3
    • /
    • pp.401-420
    • /
    • 2016
  • The stability of deep coal roadways with large sections and thick top coal is a typical challenge in many coal mines in China. The innovative Universal Discrete Element Code (UDEC) trigon block is adopted to create a numerical model based on a case study at the Dongtan coal mine in China to better understand the failure mechanism and stability control mechanism of this kind of roadway. The failure process of an unsupported roadway is simulated, and the results suggest that the deformation of the roof is more serious than that of the sides and floor, especially in the center of the roof. The radial stress that is released is more intense than the tangential stress, while a large zone of relaxation appears around the roadway. The failure process begins from partial failure at roadway corners, and then propagates deeper into the roof and sides, finally resulting in large deformation in the roadway. A combined support system is proposed to support roadways based on an analysis of the simulation results. The numerical simulation and field monitoring suggest that the availability of this support method is feasible both in theory and practice, which can provide helpful references for research on the failure mechanisms and scientific support designing of engineering in deep coal mines.

Online Social Media Review Mining for Living Items with Probabilistic Approach: A Case Study

  • Li, Shuai;Hao, Fei;Kim, Hee-Cheol
    • Smart Media Journal
    • /
    • v.2 no.2
    • /
    • pp.20-27
    • /
    • 2013
  • The concept of social media is top of the agenda for many business executives and decision makers, as well as consultants try to identify ways where companies can make profitable use of applications such as Netflix, Flixster. The social media is playing an increasingly important role as the information sources for customers making product choices etc. With the flourish of Web 2.0 technology, customer reviews are becoming more and more useful and important information resources for people to save their time and energy on purchasing products that they want. This paper proposes the Bayesian Probabilistic Classification algorithm to mine the social media review, and evaluates it by different splits and cross validation mechanism from the real data set. The explored study experimental results show the robustness and effectiveness of proposed approach for mining the social media review.

  • PDF

Launching Simulation of Integrated Mining System for Deep-Seabed Mineral Resources (심해저 광물자원 채광시스템의 설치 거동 해석)

  • Hong, Sup;Kim, Hyung-Woo;Choi, Jong-Su;Yeu, Tae-Kyeong
    • Proceedings of the Korea Committee for Ocean Resources and Engineering Conference
    • /
    • 2006.11a
    • /
    • pp.315-318
    • /
    • 2006
  • This paper concerns about coupled dynamic analysis of the deep-seabed mining system in launching operation. The dynamic behavior of mining system consisting of lifting pipe, buffer station, flexible conduit and self-propelled miner is simulated in time domain. The launching operation is divided into four critical phases: (1) deployment of miner and flexible conduit, (2) deployment of lifting pipe, flexible conduit and miner, (3) touch-down of miner, (4) final launching. The dynamic responses of sub-systems - miner, flexible conduit, buffer and lifting pipe - are analyzed in each launching phase. According to the changing periods of forced excitation at the top, the dynamic responses of sub-systems are diverse in their characteristics. It has been shown that the total integrated responses of sub-systems are strongly affected by the design parameters. Especially, the principal dimensions of flexible conduit seem to be significant in determining of the global response. Based on the simulation results, safe operation conditions are investigated.

  • PDF