• 제목/요약/키워드: Data reduction

검색결과 6,299건 처리시간 0.031초

Issues and Empirical Results for Improving Text Classification

  • Ko, Young-Joong;Seo, Jung-Yun
    • Journal of Computing Science and Engineering
    • /
    • 제5권2호
    • /
    • pp.150-160
    • /
    • 2011
  • Automatic text classification has a long history and many studies have been conducted in this field. In particular, many machine learning algorithms and information retrieval techniques have been applied to text classification tasks. Even though much technical progress has been made in text classification, there is still room for improvement in text classification. In this paper, we will discuss remaining issues in improving text classification. In this paper, three improvement issues are presented including automatic training data generation, noisy data treatment and term weighting and indexing, and four actual studies and their empirical results for those issues are introduced. First, the semi-supervised learning technique is applied to text classification to efficiently create training data. For effective noisy data treatment, a noisy data reduction method and a robust text classifier from noisy data are developed as a solution. Finally, the term weighting and indexing technique is revised by reflecting the importance of sentences into term weight calculation using summarization techniques.

Integral Field Spectroscopic Data Reduction Method for High Resolution Infrared Observation

  • Lee, Sung-Ho;Pak, Soo-Jong;Choi, Min-Ho
    • Journal of Astronomy and Space Sciences
    • /
    • 제27권4호
    • /
    • pp.309-318
    • /
    • 2010
  • We introduce a technical approach for reducing three-dimensional infrared (IR) spectroscopic data generated by integral field spectroscopy or slit-scanning observations. The first part of data reduction using IRAF presents a guideline for processing spectral images from long-slit IR spectroscopy. Multichannel image reconstruction, Image Analysis and Display (MIRIAD) is used in the later part to construct and analyze the data cubes which contain spatial and kinematic information of the objects. This technic has been applied to a sample data set of diffuse 2.1218 ${\mu}m$ $H_2$ 1-0 S(1) emission features observed by slit-scanning around Sgr A East in the Galactic center. Details of image processing for the high-dispersion infrared data are described to suggest a sequence of contamination cleaning and distortion correction. Practical solutions for handling data cubes are presented for survey observations with various configurations of slit positioning.

데이터 마이닝 기반의 6 시그마 방법론 : 철강산업 적용사례 (A Six Sigma Methodology Using Data Mining : A Case Study of "P" Steel Manufacturing Company)

  • 장길상
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제20권3호
    • /
    • pp.1-24
    • /
    • 2011
  • Recently, six sigma has been widely adopted in a variety of industries as a disciplined, data-driven problem solving approach or methodology supported by a handful of powerful statistical tools in order to reduce variation through continuous process improvement. Also, data mining has been widely used to discover unknown knowledge from a large volume of data using various modeling techniques such as neural network, decision tree, regression analysis, etc. This paper proposes a six sigma methodology based on data mining for effectively and efficiently processing massive data in driving six sigma projects. The proposed methodology is applied in the hot stove system which is a major energy-consuming process in a "P" steel company for improvement of heat efficiency through reduction of energy consumption. The results show optimal operation conditions and reduction of the hot stove energy cost by 15%.

Adaptive B-spline volume representation of measured BRDF data for photorealistic rendering

  • Park, Hyungjun;Lee, Joo-Haeng
    • Journal of Computational Design and Engineering
    • /
    • 제2권1호
    • /
    • pp.1-15
    • /
    • 2015
  • Measured bidirectional reflectance distribution function (BRDF) data have been used to represent complex interaction between lights and surface materials for photorealistic rendering. However, their massive size makes it hard to adopt them in practical rendering applications. In this paper, we propose an adaptive method for B-spline volume representation of measured BRDF data. It basically performs approximate B-spline volume lofting, which decomposes the problem into three sub-problems of multiple B-spline curve fitting along u-, v-, and w-parametric directions. Especially, it makes the efficient use of knots in the multiple B-spline curve fitting and thereby accomplishes adaptive knot placement along each parametric direction of a resulting B-spline volume. The proposed method is quite useful to realize efficient data reduction while smoothing out the noises and keeping the overall features of BRDF data well. By applying the B-spline volume models of real materials for rendering, we show that the B-spline volume models are effective in preserving the features of material appearance and are suitable for representing BRDF data.

Data Supply Voltage Reduction Scheme for Low-Power AMOLED Displays

  • Nam, Hyoungsik;Jeong, Hoon
    • ETRI Journal
    • /
    • 제34권5호
    • /
    • pp.727-733
    • /
    • 2012
  • This paper demonstrates a new driving scheme that allows reducing the supply voltage of data drivers for low-power active matrix organic light-emitting diode (AMOLED) displays. The proposed technique drives down the data voltage range by 50%, which subsequently diminishes in the peak power consumption of data drivers at the full white pattern by 75%. Because the gate voltage of a driving thin film transistor covers the same range as a conventional driving scheme by means of a level-shifting scheme, the low-data supply scheme achieves the equivalent dynamic range of OLED currents. The average power consumption of data drivers is reduced by 60% over 24 test images, and power consumption is kept below 25%.

센서 네트워크를 위한 PCA 기반의 데이터 스트림 감소 기법 (A PCA-based Data Stream Reduction Scheme for Sensor Networks)

  • 알렉산더 페도시브;최영환;황인준
    • 인터넷정보학회논문지
    • /
    • 제10권4호
    • /
    • pp.35-44
    • /
    • 2009
  • 데이터 스트림이란 새로운 개념과 기존의 단순 데이터 사이에 존재하는 개념적 차이를 극복하기 위해서는 많은 연구가 필요하다. 대표적인 예로써 센서 네크워크에서의 데이터 스트림 처리를 들 수 있는 데, 이를 위해서는 대역폭이나 에너지, 메모리와 같은 자원적 한계에서 부터 연속 질의를 포함하는 질의처리의 특수성까지 고려해야 할 대상이 광범위하다. 본 논문에서는 데이터 스트림 처리에서의 물리적 제약사항에 해당하는 한정된 메모리 문제를 해결하기 위해 PCA 기법을 기반으로 하는 데이터 스트림 축소 방안을 제안하다. PCA는 상호 관련된 다수의 변수들을 관련이 없는 적은 수의 변수로 변환해준다. 본 논문에서는 질의 처리 엔진의 협력을 가정하고서 센서 네크워크의 스트림 데이터 처리를 위해 PCA 기법을 적용하며, 다른 센서로부터 얻어진 많은 측정값 사이에 시공간적 관련성을 이용한다. 최종적으로 그러한 데이터 처리를 위한 프레임워크를 제시하고 다양한 실험을 통하여 기법의 성능을 분석한다.

  • PDF

Risk Perceptions and Risk-reduction Strategies in Internet Apparel Shopping

  • Lee, Mi-Young
    • 패션비즈니스
    • /
    • 제9권3호
    • /
    • pp.134-149
    • /
    • 2005
  • Although Internet retailing is becoming a viable channel for apparel retailing, consumers are still reluctant to use Internet for apparel purchasing because at their concerns at Internet security and the difficulties at virtual shopping in unfamiliar shopping environment. The purpose at this study is to examine the nature at perceived risk associated with Internet apparel shopping and risk-reduction strategies used by Internet apparel shoppers. The data were collected via an online survey by a online research company. A total at 4,254 Internet users participated in this survey. Among these Internet users, 1,146 respondents had previous shopping experience in Internet shopping. Within this group, 195 were Internet apparel information seekers, and 589 were Internet apparel purchasers. Descriptive statistics, analysis of variance, and t-test were used to analyze the data. The perceived risks and risk-reduction strategies used by Internet apparel no-interest shoppers, Internet apparel information seekers (browsers), moderate Internet apparel purchasers, heavy Internet apparel purchasers were examined and compared. The results indicated that these tour groups were significantly different in apparel related risk, performance risk, and privacy risk. Internet purchasers tend to perceive more apparel-related, performance, and privacy risks than others. The results also indicated that these tour groups were significantly different in their opinions of risk-reduction strategies.

자동차 엔진 크랭크축의 진동해석을 위한 자유도 저감법 개발에 관한 연구 (A Proposed Reduction Method for Vibatiton Analysis of Automobile Engine Crakshfts)

  • 최명진
    • 한국생산제조학회지
    • /
    • 제5권2호
    • /
    • pp.29-37
    • /
    • 1996
  • High speed engines with high power are increasingly on demands and almost engines employ crankshafts Such problems as bending and torsional vibrations become the point at issue in crankshaft analysis and design. In this study to overcome the diffiiculty with the large amount of computation in finite element vibration analysis of a crankshaft, a reduction method based on influence coefficient and lumped parameter is presented. which reduces the computation amount effectively and can be used in vibrational analysis and design of any types of crankshafts Crank journal and pinparts are meodelled as elements with 6degrees of freedom per node. Crank web part is modelled using equivalent mass and stiffness matices . based up on lumped parameter and influence coefficient respectively to reduce total degrees of freedom considerablely. To confirm the scheme of the study the results are compared with the known data and they are coincident. Also a simple crankshaft is designed and manufactured for experiments. The calculated results using reduction method and the experimental results agree well The scheme of this study can be utilized in evaluation results agree well. The calculated result are compared with the known data and they are coincident. Also a simple crankshaft is designed and manufactured for experiments. The calculated results using reduction method and the experimental results agree well. The scheme of this study can be utilized in evaluation and development of high speed engine.

  • PDF

Prototype Reduction Schemes와 Mahalanobis 거리를 이용한 Relational Discriminant Analysis (Relational Discriminant Analysis Using Prototype Reduction Schemes and Mahalanobis Distances)

  • 김상운
    • 전자공학회논문지CI
    • /
    • 제43권1호
    • /
    • pp.9-16
    • /
    • 2006
  • RDA(Relational Discriminant Analysis)는 패턴의 특징벡터 대신에 학습 패턴을 대표하는 프로토타입들과의 비유사도 벡터에 기반하여 식별기를 설계하는 방법이다. 따라서 RDA 식별기의 성능은 프로토타입을 선택하는 방법과 비유사도를 측정하는 방법에 따라 결정된다. 본 논문에서는 PRS(Prototype Reduction Schemes)를 이용하여 프로토타입을 추출한 다음, 샘플 벡터들간의 마할라노비스 거리에 의한 상관행렬로 RDA의 식별성능을 향상시키는 방법을 제안한다. 인공 데이터 및 실-생활 데이터를 대상으로 실험한 결과, 제안한 방법의 식별성능이 기존의 방법에 비하여 개선되었음을 확인하였다.

Volumetric changes in the lumpectomy cavity during whole breast irradiation after breast conserving surgery

  • Cho, Heung-Lae;Kim, Cheol-Jin
    • Radiation Oncology Journal
    • /
    • 제29권4호
    • /
    • pp.277-282
    • /
    • 2011
  • Purpose: This study was performed to evaluate the change in the lumpectomy cavity volumes before and after whole breast radiation therapy (WBRT) and to identify factors associated with the change of volume. Materials and Methods: From September 2009 to April 2010, the computed tomography (CT) simulation data from 70 patients obtained before and after WBRT was evaluated. The lumpectomy cavity volumes were contoured based on surgical clips, seroma, and postoperative changes. Significant differences in the data from pre-WBRT CT and post-WBRT CT were assessed. Multiple variables were examined for correlation with volume reduction in the lumpectomy cavity. Results: The mean and median volume reduction in the lumpectomy cavity after WBRT were 17.6 $cm^3$ and 16.1 $cm^3$, respectively with the statistical significance (p < 0.001). The volume reduction in the lumpectomy cavity was inversely correlated with time from surgery to radiation therapy (R = 0.390). The presence of seroma was significantly associated with a volumetric change in the lumpectomy cavity after WBRT (p = 0.011). Conclusion: The volume of lumpectomy cavity reduced significantly after WBRT. As the time from surgery to the start ot WBRT increased, the volume reduction in the lumpectomy cavity during WBRT decreased. A strong correlation was observed between the presence of seroma and the reduced volume. To ensure appropriate coverage and to limit normal tissue exposure during boost irradiation in patients who has seroma at the time of starting WBRT, repeating CT simulation at boost planning is suggested.