• 제목/요약/키워드: Data bin

검색결과 1,564건 처리시간 0.024초

포아송 분포를 가정한 Wafer 수준 Statistical Bin Limits 결정방법과 표본크기 효과에 대한 평가 (Methods and Sample Size Effect Evaluation for Wafer Level Statistical Bin Limits Determination with Poisson Distributions)

  • 박성민;김영식
    • 산업공학
    • /
    • 제17권1호
    • /
    • pp.1-12
    • /
    • 2004
  • In a modern semiconductor device manufacturing industry, statistical bin limits on wafer level test bin data are used for minimizing value added to defective product as well as protecting end customers from potential quality and reliability excursion. Most wafer level test bin data show skewed distributions. By Monte Carlo simulation, this paper evaluates methods and sample size effect regarding determination of statistical bin limits. In the simulation, it is assumed that wafer level test bin data follow the Poisson distribution. Hence, typical shapes of the data distribution can be specified in terms of the distribution's parameter. This study examines three different methods; 1) percentile based methodology; 2) data transformation; and 3) Poisson model fitting. The mean square error is adopted as a performance measure for each simulation scenario. Then, a case study is presented. Results show that the percentile and transformation based methods give more stable statistical bin limits associated with the real dataset. However, with highly skewed distributions, the transformation based method should be used with caution in determining statistical bin limits. When the data are well fitted to a certain probability distribution, the model fitting approach can be used in the determination. As for the sample size effect, the mean square error seems to reduce exponentially according to the sample size.

A Better Prediction for Higher Education Performance using the Decision Tree

  • Hilal, Anwar;Zamani, Abu Sarwar;Ahmad, Sultan;Rizwanullah, Mohammad
    • International Journal of Computer Science & Network Security
    • /
    • 제21권4호
    • /
    • pp.209-213
    • /
    • 2021
  • Data mining is the application of specific algorithms for extracting patterns from data and KDD is the automated or convenient extraction of patterns representing knowledge implicitly stored or captured in large databases, data warehouses, the Web, other massive information repositories or data streams. Data mining can be used for decision making in educational system. But educational institution does not use any knowledge discovery process approach on these data; this knowledge can be used to increase the quality of education. The problem was happening in the educational management system, but to make education system more flexible and discover knowledge from it huge data, we will use data mining techniques to solve problem.

TAC를 반영한 BIN 데이터 기반의 냉난방 부하 변화에 관한 연구 (Heating and Cooling Load Evaluation Study with TAC Based BIN Data)

  • 이광섭;김유진;민경천;이의준;강은철
    • 설비공학논문집
    • /
    • 제29권9호
    • /
    • pp.463-471
    • /
    • 2017
  • According to the Korea industrial standard of air conditioning systems (KS C 9306), cooling and heating loads for buildings can be calculated by using maximum and minimum temperature in BIN data. Cooling and heating loads can be determined by building set temperature and ambient temperature. Cooling and heating system capacity of buildings can be normally designed according to determined heating and cooling loads. Cooling and heating system capacity can be reduced by updated BIN data, applying TAC (Technical Advisory Committee) values. In this study, updated BIN data have been analyzed using ambient temperature of 19 areas in Korea for the last 10 years (2005~2014) provided by KMA (Korea Meteorological Administration). Building cooling and heating loads have been calculated following TAC based BIN data. As a result, designed system capacity decreased depending on applying TAC. Those were reduced as 7.1% ($100m^2$ building), 8.7% ($1,000m^2$ building) in cooling capacity, 11.7% in heating capacity when TAC 2.5% applied. And also, it is expected system initial and operating cost by decreasing system capacity.

Obesity Level Prediction Based on Data Mining Techniques

  • Alqahtani, Asma;Albuainin, Fatima;Alrayes, Rana;Al muhanna, Noura;Alyahyan, Eyman;Aldahasi, Ezaz
    • International Journal of Computer Science & Network Security
    • /
    • 제21권3호
    • /
    • pp.103-111
    • /
    • 2021
  • Obesity affects individuals of all gender and ages worldwide; consequently, several studies have performed great works to define factors causing it. This study develops an effective method to trace obesity levels based on supervised data mining techniques such as Random Forest and Multi-Layer Perception (MLP), so as to tackle this universal epidemic. Notably, the dataset was from countries like Mexico, Peru, and Colombia in the 14- 61year age group, with varying eating habits and physical conditions. The data includes 2111 instances and 17 attributes labelled using NObesity, which facilitates categorization of data using Overweight Levels l I and II, Insufficient Weight, Normal Weight, as well as Obesity Type I to III. This study found that the highest accuracy was achieved by Random Forest algorithm in comparison to the MLP algorithm, with an overall classification rate of 96.7%.

간역열부하계산용(簡易熱負荷計算用) Bin기상(氣象)데이터 (Development of Bin Weather Data for Simplified Energy Calculations)

  • 김두천;최진희
    • 대한설비공학회지:설비저널
    • /
    • 제17권1호
    • /
    • pp.28-43
    • /
    • 1988
  • The purpose of this research is to produce bin weather data for Seoul from Standard Weather Data. The intended use of these data is for input to recently developed models for simplified energy calculations and for generating variable-base degree-day information. The data produced under this study include $3^{\circ}C$ bin data covering the full range of dry-bulb temperatures with mean coincident wet-bulb and daytime coincident solar radiation, wet-bulb bins down to freezing temperature, wind speed bins with prevailing directions, and heating and cooling degree hours to nine different temperature bases. All of these data are tabulated in six separate time periods and total daily categories for monthly and annual periods.

  • PDF

다중 방송 채널에 데이터 할당을 위한 두 단계 저장소-적재 알고리즘 (Two Level Bin-Packing Algorithm for Data Allocation on Multiple Broadcast Channels)

  • 권혁민
    • 한국멀티미디어학회논문지
    • /
    • 제14권9호
    • /
    • pp.1165-1174
    • /
    • 2011
  • 데이터 방송 시스템에서 서버는 방송 채널을 통하여 데이터들을 지속적으로 전파하고, 이동 클라이언트는 자신이 원하는 데이터가 방송 채널에 나타나기를 기다리기만 하면 된다. 그러나 방송 채널은 많은 데이터들에 의해 공유되어야 하므로, 원하는 데이터를 수신하기까지 예상 지연시간이 증가할 수 있다. 본 논문은 전체 데이터들의 예상 지연시간을 최소화하기 위하여 다중 방송 채널에 적절하게 데이터를 할당하기 위한 주제를 연구하여 TLBP(Two Level Bin-Packing)로 명명된 새로운 데이터 할당 기법을 제안한다. 본 논문은 우선 평균 예상지연시간의 이론적 하한 값을 소개하고, 이 값에 기초하여 저장소의 용량을 결정한다. TLBP 기법은 저장소-적재 알고리즘을 이용하여 전체 데이터들을 다수 개의 그룹으로 분할하고, 각 그룹의 데이터들을 각 채널에 배정한다. TLBP는 저장소-적재 알고리즘을 두 단계로 적용함에 의해, 동일 방송 채널에 할당된 데이터들의 액세스 확률의 차이를 방송 스케줄에 반영할 수 있어 성능을 향상시킬 수 있다. TLBP와 세가지의 기존 기법과 성능을 비교하기 위하여 시뮬레이션이 수행되었다. 시뮬레이션 결과에 의하면 TLBP는 합리적인 실행부담을 가지면서도 평균 예상지연시간의 성능에 있어서 다른 기법보다 우수한 성능을 보인다.

FCDD 기반 웨이퍼 빈 맵 상의 결함패턴 탐지 (Detection of Defect Patterns on Wafer Bin Map Using Fully Convolutional Data Description (FCDD) )

  • 장승준;배석주
    • 산업경영시스템학회지
    • /
    • 제46권2호
    • /
    • pp.1-12
    • /
    • 2023
  • To make semiconductor chips, a number of complex semiconductor manufacturing processes are required. Semiconductor chips that have undergone complex processes are subjected to EDS(Electrical Die Sorting) tests to check product quality, and a wafer bin map reflecting the information about the normal and defective chips is created. Defective chips found in the wafer bin map form various patterns, which are called defective patterns, and the defective patterns are a very important clue in determining the cause of defects in the process and design of semiconductors. Therefore, it is desired to automatically and quickly detect defective patterns in the field, and various methods have been proposed to detect defective patterns. Existing methods have considered simple, complex, and new defect patterns, but they had the disadvantage of being unable to provide field engineers the evidence of classification results through deep learning. It is necessary to supplement this and provide detailed information on the size, location, and patterns of the defects. In this paper, we propose an anomaly detection framework that can be explained through FCDD(Fully Convolutional Data Description) trained only with normal data to provide field engineers with details such as detection results of abnormal defect patterns, defect size, and location of defect patterns on wafer bin map. The results are analyzed using open dataset, providing prominent results of the proposed anomaly detection framework.

농업차륜(農業車輪)의 성능평가(性能評價)를 위한 인공토조(人工土槽)시스템의 제작(製作) 및 자료수집(資料蒐集) 시스템의 구성(構成) (Construction of the Soil Bin System and Associated Micro computer-Based Data Acquisition System for the Evaluation of Wheel Performance)

  • 이규승;정창주;이용국;박승제
    • Journal of Biosystems Engineering
    • /
    • 제13권2호
    • /
    • pp.28-37
    • /
    • 1988
  • This study was conducted to construct the soil bin system and associated microcomputer-based data acquisition system which is to be used for the effective evaluation of wheel performance. The soil bin system consists of four main parts; soil bin, carriage drive system, test carriage and soil processing carriage. The test carriage was constructed to measure the five performance parameters of testing wheels; pulling forte, motion resistance, sinkage and rotational speed of test wheel, and speed of test carriage. The test wheel is powered by a hydraulic system up to 8 ps. Soil processing carriage was designed to provide uniform test soil condition across the toil bin, and reproduction of soil conditions found satisfiable. The data acquisition system consists of APPLE II PLUS microcomputer, strain amplifier, I/O interface, A/D converter, digital counter and various transducers. It takes about 0.86 seconds to measure a set of performance parameters and store on the floppy disk simultaneously. Series of experiment showed that this system can be used effectively for evaluating the wheel performance associated with soil.

  • PDF

Analysis and Comparison of Sorting Algorithms (Insertion, Merge, and Heap) Using Java

  • Khaznah, Alhajri;Wala, Alsinan;Sahar, Almuhaishi;Fatimah, Alhmood;Narjis, AlJumaia;Azza., A.A
    • International Journal of Computer Science & Network Security
    • /
    • 제22권12호
    • /
    • pp.197-204
    • /
    • 2022
  • Sorting is an important data structure in many applications in the real world. Several sorting algorithms are currently in use for searching and other operations. Sorting algorithms rearrange the elements of an array or list based on the elements' comparison operators. The comparison operator is used in the accurate data structure to establish the new order of elements. This report analyzes and compares the time complexity and running time theoretically and experimentally of insertion, merge, and heap sort algorithms. Java language is used by the NetBeans tool to implement the code of the algorithms. The results show that when dealing with sorted elements, insertion sort has a faster running time than merge and heap algorithms. When it comes to dealing with a large number of elements, it is better to use the merge sort. For the number of comparisons for each algorithm, the insertion sort has the highest number of comparisons.

Douglas Peucker 근사화 알고리즘과 빈 분류 기반 벡터 맵 데이터 압축 (Vector Map Data compression based on Douglas Peucker Simplification Algorithm and Bin Classification)

  • 박진혁;장봉주;권오준;정재진;이석환;권기룡
    • 한국멀티미디어학회논문지
    • /
    • 제18권3호
    • /
    • pp.298-311
    • /
    • 2015
  • Vector data represents a map by its coordinate and Raster data represents a map by its pixel. Since these data types have very large data size, data compression procedure is a compulsory process. This paper compare the results from three different methodologies; GIS (Geographic Information System) vector map data compression using DP(Douglas-Peucker) Simplification algorithm, vector data compression based on Bin classification and the combination between two previous methods. The results shows that the combination between the two methods have the best performance among the three tested methods. The proposed method can achieve 4-9% compression ratio while the other methods show a lower performance.