• Title/Summary/Keyword: Embedding dimension

Search Result 75, Processing Time 0.021 seconds

Chaotic Analysis of Multi-Sensor Signal in End-Milling Process (엔드밀가공시 복합계측 신호에 의한 공구 마멸의 카오스적 해석)

  • 구세진;이기용;강명창;김정석
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 1997.04a
    • /
    • pp.817-821
    • /
    • 1997
  • Ever since the nonlinearity of machine tool dynamics was established, researchers attempted to make use of this fact to devise better monitoring, diagnostics and system, which were hitherto based on linear models. Theory of chaos, which explains many nonlinear phenomena comes handy for furthering the analysis using nonlinear model. In this study, measuring system will be constructed using multi-sensor (Tool Dynamometer, Acoustic Emission) in end millingprocess. Then, it will be verified that cutting force is low-dimensional deterministic chaos calculating Lyapunov exponents, Fractal dimension, Embedding dimension. Aen it will be investigated that the relations between characteristic parameter caculated form sensor signal and tool wear.

  • PDF

A Study on the Enhancement of Ultrasonic Signal Recognition in Ferrite Carbon Steel Weld Zone Using Neural Networks (신경회로망을 이용한 페라이트계 탄소강 용접부의 초음파 신호 인식 향상에 관한 연구)

  • Yun, In-Sik;Park, Won-Kyou;Yi, Won
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.19 no.1
    • /
    • pp.158-164
    • /
    • 2002
  • This paper proposes the optimization of ultrasonic signal recognition in ferrite carbon steel weld zone using neural networks. For these purposes, the ultrasonic signals for defects as porosity, incomplete penetration and slag inclusion in the weld zone are acquired in the type of time series data. And then their applications evaluated feature extraction based on the time-frequency-attractor domain(peak to peak, rise time, rise slope, fall time, fall slope, pulse duration, power spectrum, and bandwidth) and attractor characteristics (fractal dimension and attractor quadrant) etc. The proposed neural networks system in this study can enhances performance of ultrasonic signal recognition.

An Analysis of 3-D Object Characteristics Using Locally Linear Embedding (시점별 형상의 지역적 선형 사상을 통한 3차원 물체의 특성 분석)

  • Lee, Soo-Chahn;Yun, Il-Dong
    • Journal of Broadcast Engineering
    • /
    • v.14 no.1
    • /
    • pp.81-84
    • /
    • 2009
  • This paper explores the possibility of describing objects from the change in the shape according to the change in viewpoint. Specifically, we sample the shapes from various viewpoints of a 3-D model, and apply dimension reduction by locally linear embedding. A low dimensional distribution of points are constructed, and characteristics of the object are described from this distribution. Also, we propose two 3-D retrieval methods by applying the iterative closest point algorithm, and by applying Fourier transform and measuring similarity by modified Housdorff distance, and present experimental results. The proposed method shows that the change of shape according to the change in viewpoint can describe the characteristics of an object.

Selective Word Embedding for Sentence Classification by Considering Information Gain and Word Similarity (문장 분류를 위한 정보 이득 및 유사도에 따른 단어 제거와 선택적 단어 임베딩 방안)

  • Lee, Min Seok;Yang, Seok Woo;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.105-122
    • /
    • 2019
  • Dimensionality reduction is one of the methods to handle big data in text mining. For dimensionality reduction, we should consider the density of data, which has a significant influence on the performance of sentence classification. It requires lots of computations for data of higher dimensions. Eventually, it can cause lots of computational cost and overfitting in the model. Thus, the dimension reduction process is necessary to improve the performance of the model. Diverse methods have been proposed from only lessening the noise of data like misspelling or informal text to including semantic and syntactic information. On top of it, the expression and selection of the text features have impacts on the performance of the classifier for sentence classification, which is one of the fields of Natural Language Processing. The common goal of dimension reduction is to find latent space that is representative of raw data from observation space. Existing methods utilize various algorithms for dimensionality reduction, such as feature extraction and feature selection. In addition to these algorithms, word embeddings, learning low-dimensional vector space representations of words, that can capture semantic and syntactic information from data are also utilized. For improving performance, recent studies have suggested methods that the word dictionary is modified according to the positive and negative score of pre-defined words. The basic idea of this study is that similar words have similar vector representations. Once the feature selection algorithm selects the words that are not important, we thought the words that are similar to the selected words also have no impacts on sentence classification. This study proposes two ways to achieve more accurate classification that conduct selective word elimination under specific regulations and construct word embedding based on Word2Vec embedding. To select words having low importance from the text, we use information gain algorithm to measure the importance and cosine similarity to search for similar words. First, we eliminate words that have comparatively low information gain values from the raw text and form word embedding. Second, we select words additionally that are similar to the words that have a low level of information gain values and make word embedding. In the end, these filtered text and word embedding apply to the deep learning models; Convolutional Neural Network and Attention-Based Bidirectional LSTM. This study uses customer reviews on Kindle in Amazon.com, IMDB, and Yelp as datasets, and classify each data using the deep learning models. The reviews got more than five helpful votes, and the ratio of helpful votes was over 70% classified as helpful reviews. Also, Yelp only shows the number of helpful votes. We extracted 100,000 reviews which got more than five helpful votes using a random sampling method among 750,000 reviews. The minimal preprocessing was executed to each dataset, such as removing numbers and special characters from text data. To evaluate the proposed methods, we compared the performances of Word2Vec and GloVe word embeddings, which used all the words. We showed that one of the proposed methods is better than the embeddings with all the words. By removing unimportant words, we can get better performance. However, if we removed too many words, it showed that the performance was lowered. For future research, it is required to consider diverse ways of preprocessing and the in-depth analysis for the co-occurrence of words to measure similarity values among words. Also, we only applied the proposed method with Word2Vec. Other embedding methods such as GloVe, fastText, ELMo can be applied with the proposed methods, and it is possible to identify the possible combinations between word embedding methods and elimination methods.

Digital Watermarking Algorithm for Copyright Protection of JPEG Image (JPEG 영상의 저작권 보호를 위한 Digital Watermarking 알고리즘)

  • Park, Eun-Suk;Woo, Jong-Won;Lee, Seok-Hee;Heo, Yoon-Seok;Cho, Ki-Hyung
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.1
    • /
    • pp.296-305
    • /
    • 2000
  • In this paper, we propose the method of embedding the encrypted digital watermark in quantization coefficient when we encode the image data in the process of JPEC. The proposed method is as following. After a DCT coefficient of each block is quantized, we arrange the quantization coefficient as on dimension with a zigzag scan and replace each block. By applying even-odd feature of frequency of the encrypted watermark to a quantization coefficient of some fixed domain of replaced each block and embedding it, we obtain the compressed image data by encoding after placing it in the order prior to replacement. The advantages of the proposed method here are as follows: We can embed many information keeping a secret as much as possible by using the algorithm of block replacement. We can control the amount of embedding of each use, as we embed the encrypted information by selecting some fixed domain of a quantization coefficient, we can fix the embedding data regardless of the image and the value of quantization. We verified the results by experiments and analyzed the efficiency of them in comparison with the former study.

  • PDF

Effect of Dimension Reduction on Prediction Performance of Multivariate Nonlinear Time Series

  • Jeong, Jun-Yong;Kim, Jun-Seong;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • v.14 no.3
    • /
    • pp.312-317
    • /
    • 2015
  • The dynamic system approach in time series has been used in many real problems. Based on Taken's embedding theorem, we can build the predictive function where input is the time delay coordinates vector which consists of the lagged values of the observed series and output is the future values of the observed series. Although the time delay coordinates vector from multivariate time series brings more information than the one from univariate time series, it can exhibit statistical redundancy which disturbs the performance of the prediction function. We apply dimension reduction techniques to solve this problem and analyze the effect of this approach for prediction. Our experiment uses delayed Lorenz series; least squares support vector regression approximates the predictive function. The result shows that linearly preserving projection improves the prediction performance.

Chaotic analysis of tool wear using multi-sensor signal in end-milling process (엔드밀가공시 복합계측 신호를 이용한 공구 마멸의 카오스적 해석)

  • Kim, J.S.;Kang, M.C.;Ku, S.J.
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.14 no.11
    • /
    • pp.93-101
    • /
    • 1997
  • Ever since the nonlinearity of machine tool dynamics was established, researchers attempted to make use of this fact to devise better monitoring, diagnostics and control system, which were hitherto based on linear models. Theory of chaos which explains many nonlinear phenomena comes handy for furthering the analysis using nonlinear model. In this study, measuring system will be constructed using multi-sensor (Tool Dynamometer, Acoustic Emission) in end milling process. Then, it will be verified that cutting force is low-dimensional chaos by calculating Lyapunov exponents. Fractal dimension, embedding dimension. And it will be investigated that the relation between characteristic parameter calculated from sensor signal and tool wear.

  • PDF

Defect Evaluation of Weld Zone in Rails Using Attractor and Distance Amplitude Characteristics Curve (레일 용접부의 결함 검출을 위한 어트랙터의 구성 및 해석에 관한 연구)

  • 윤인식;고준빈;박성두
    • Journal of Welding and Joining
    • /
    • v.18 no.5
    • /
    • pp.77-83
    • /
    • 2000
  • This study proposes the analysis and evaluation method of time series ultrasonic signal using the attractor analysis. Features extracted from time series signal analyze quantitatively characteristics of weld defects. For this purpose, analysis objective in this study is fractal dimension and attractor quadrant feature. Trajectory changes in the attractor indicated a substantial difference in fractal characteristics resulting from distance shifts such as parts of head and flange even though the types of defects are identified. These difference in characteristics of weld defects enables the evaluation of unique characteristics of defects in the weld zone. In quantitative fractal feature extraction, feature values of 3.848 in the case of part of head(crack) and 4.102 in the case of part of web(side hole) and 3.711 in the case of part of flange(crack) were proposed on the basis of fractal dimensions. Proposed attractor analysis and DAC in this study can enhance the precision rate of ultrasonic evaluation for defect signals of rail weld zone such as side hole and crack.

  • PDF

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

  • Park, Jongin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.19-41
    • /
    • 2019
  • According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.

Defect evaluations of weld zone in rails considering phase space-frequency demain (위상공간-주파수 영역을 고려한 레일 용접부의 결함 평가)

  • 윤인식;권성태;장영권;정우현;이찬석
    • Journal of the Korean Society for Railway
    • /
    • v.2 no.2
    • /
    • pp.21-30
    • /
    • 1999
  • This study proposes the analysis and evaluation method of time series ultrasonic signal using the phase space-frequency domain. Features extracted from time series signal analyze quantitatively characteristics of weld defects. For this purpose, analysis objectives in this study are features of time domain and frequency domain. Trajectory changes in the attractor indicated a substantial difference in fractal characteristics resulting from distance shifts such as parts of head and flange even though the types of defects are identified. These differences in characteristics of weld defects enables the evaluation of unique characteristics of defects in the weld zone. In quantitative fractal feature extraction, feature values of 3.848 in the case of part of head(crack) and 4.102 in the case of part of web(side hole) and 3.711 in the case of part of flange(crack) were proposed on the basis of fractal dimension. Proposed phase space-frequency domain method in this study can integrity evaluation for defect signals of rail weld zone such as side hole and crack.

  • PDF