• Title/Summary/Keyword: Smart Study

Search Result 9,757, Processing Time 0.048 seconds

Stock-Index Invest Model Using News Big Data Opinion Mining (뉴스와 주가 : 빅데이터 감성분석을 통한 지능형 투자의사결정모형)

  • Kim, Yoo-Sin;Kim, Nam-Gyu;Jeong, Seung-Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.143-156
    • /
    • 2012
  • People easily believe that news and stock index are closely related. They think that securing news before anyone else can help them forecast the stock prices and enjoy great profit, or perhaps capture the investment opportunity. However, it is no easy feat to determine to what extent the two are related, come up with the investment decision based on news, or find out such investment information is valid. If the significance of news and its impact on the stock market are analyzed, it will be possible to extract the information that can assist the investment decisions. The reality however is that the world is inundated with a massive wave of news in real time. And news is not patterned text. This study suggests the stock-index invest model based on "News Big Data" opinion mining that systematically collects, categorizes and analyzes the news and creates investment information. To verify the validity of the model, the relationship between the result of news opinion mining and stock-index was empirically analyzed by using statistics. Steps in the mining that converts news into information for investment decision making, are as follows. First, it is indexing information of news after getting a supply of news from news provider that collects news on real-time basis. Not only contents of news but also various information such as media, time, and news type and so on are collected and classified, and then are reworked as variable from which investment decision making can be inferred. Next step is to derive word that can judge polarity by separating text of news contents into morpheme, and to tag positive/negative polarity of each word by comparing this with sentimental dictionary. Third, positive/negative polarity of news is judged by using indexed classification information and scoring rule, and then final investment decision making information is derived according to daily scoring criteria. For this study, KOSPI index and its fluctuation range has been collected for 63 days that stock market was open during 3 months from July 2011 to September in Korea Exchange, and news data was collected by parsing 766 articles of economic news media M company on web page among article carried on stock information>news>main news of portal site Naver.com. In change of the price index of stocks during 3 months, it rose on 33 days and fell on 30 days, and news contents included 197 news articles before opening of stock market, 385 news articles during the session, 184 news articles after closing of market. Results of mining of collected news contents and of comparison with stock price showed that positive/negative opinion of news contents had significant relation with stock price, and change of the price index of stocks could be better explained in case of applying news opinion by deriving in positive/negative ratio instead of judging between simplified positive and negative opinion. And in order to check whether news had an effect on fluctuation of stock price, or at least went ahead of fluctuation of stock price, in the results that change of stock price was compared only with news happening before opening of stock market, it was verified to be statistically significant as well. In addition, because news contained various type and information such as social, economic, and overseas news, and corporate earnings, the present condition of type of industry, market outlook, the present condition of market and so on, it was expected that influence on stock market or significance of the relation would be different according to the type of news, and therefore each type of news was compared with fluctuation of stock price, and the results showed that market condition, outlook, and overseas news was the most useful to explain fluctuation of news. On the contrary, news about individual company was not statistically significant, but opinion mining value showed tendency opposite to stock price, and the reason can be thought to be the appearance of promotional and planned news for preventing stock price from falling. Finally, multiple regression analysis and logistic regression analysis was carried out in order to derive function of investment decision making on the basis of relation between positive/negative opinion of news and stock price, and the results showed that regression equation using variable of market conditions, outlook, and overseas news before opening of stock market was statistically significant, and classification accuracy of logistic regression accuracy results was shown to be 70.0% in rise of stock price, 78.8% in fall of stock price, and 74.6% on average. This study first analyzed relation between news and stock price through analyzing and quantifying sensitivity of atypical news contents by using opinion mining among big data analysis techniques, and furthermore, proposed and verified smart investment decision making model that could systematically carry out opinion mining and derive and support investment information. This shows that news can be used as variable to predict the price index of stocks for investment, and it is expected the model can be used as real investment support system if it is implemented as system and verified in the future.

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

  • Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.57-73
    • /
    • 2021
  • Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.

A Methodology of Multimodal Public Transportation Network Building and Path Searching Using Transportation Card Data (교통카드 기반자료를 활용한 복합대중교통망 구축 및 경로탐색 방안 연구)

  • Cheon, Seung-Hoon;Shin, Seong-Il;Lee, Young-Ihn;Lee, Chang-Ju
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.3
    • /
    • pp.233-243
    • /
    • 2008
  • Recognition for the importance and roles of public transportation is increasing because of traffic problems in many cities. In spite of this paradigm change, previous researches related with public transportation trip assignment have limits in some aspects. Especially, in case of multimodal public transportation networks, many characters should be considered such as transfers. operational time schedules, waiting time and travel cost. After metropolitan integrated transfer discount system was carried out, transfer trips are increasing among traffic modes and this takes the variation of users' route choices. Moreover, the advent of high-technology public transportation card called smart card, public transportation users' travel information can be recorded automatically and this gives many researchers new analytical methodology for multimodal public transportation networks. In this paper, it is suggested that the methodology for establishment of brand new multimodal public transportation networks based on computer programming methods using transportation card data. First, we propose the building method of integrated transportation networks based on bus and urban railroad stations in order to make full use of travel information from transportation card data. Second, it is offered how to connect the broken transfer links by computer-based programming techniques. This is very helpful to solve the transfer problems that existing transportation networks have. Lastly, we give the methodology for users' paths finding and network establishment among multi-modes in multimodal public transportation networks. By using proposed methodology in this research, it becomes easy to build multimodal public transportation networks with existing bus and urban railroad station coordinates. Also, without extra works including transfer links connection, it is possible to make large-scaled multimodal public transportation networks. In the end, this study can contribute to solve users' paths finding problem among multi-modes which is regarded as an unsolved issue in existing transportation networks.

Estimation and Mapping of Methane Emissions from Rice Paddies in Korea: Analysis of Regional Differences and Characteristics (전국 논에서 발생하는 메탄 배출량의 산정 및 지도화: 지역 격차 및 특성 분석)

  • Choi, Sung-Won;Kim, Joon;Kang, Minseok;Lee, Seung Hoon;Kang, Namgoo;Shim, Kyo-Moon
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.20 no.1
    • /
    • pp.88-100
    • /
    • 2018
  • Methane emissions from rice paddies are the largest source of greenhouse gases in the agricultural sector, but there are significant regional differences depending on the surrounding conditions and cultivation practices. To visualize these differences and to analyze their causes and characteristics, the methane emissions from each administrative district in South Korea were calculated according to the IPCC guidelines using the data from the 2010 Agriculture, Forestry and Fisheries Census, and then the results were mapped by using the ArcGIS. The nationwide average of methane emissions per unit area was $380{\pm}74kg\;CH_4\;ha^{-1}\;yr^{-1}$. The western region showed a trend toward higher values than the eastern region. One of the major causes resulting in such regional differences was the $SF_o$ (scaling factor associated with the application of organic matter), where the number of cultivation days played an important role to either offset or deepen the differences. Comparison of our results against the actual methane emissions data observed by eddy covariance flux measurement in the three KoFlux rice paddy sites in Gimje, Haenam and Cheorwon showed some differences but encouraging results with a difference of 10 % or less depending on the sites and years. Using the updated GWP (global warming potential) value of 28, the national total methane emission in 2010 was estimated to be $8,742,000tons\;CO_2eq$ - 13% lower than that of the National Greenhouse Gas Inventory Report (i.e., $10,048,000tons\;CO_2eq$). The administrative districts-based map of methane emissions developed in this study can help identify the regional differences, and the analysis of their key controlling factors will provide important scientific basis for the practical policy makings for methane mitigation.

Characteristic Study of Small-sized and Planer Resonator for Mobile Device in Magnetic Wireless Power Transfer (소형 모바일 기기용 공진형 무선전력전송 시스템의 공진기 평면화 및 소형화에 따른 특성 연구)

  • Lee, Hoon-Hee;Jung, Chang-Won
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.4
    • /
    • pp.16-21
    • /
    • 2017
  • In this paper, a Small-sized and planer resonator design of Magnetic Resonance - Wireless Power Transfer(MR-WPT) were proposed for practical applications of mobile devices, such as a laptop, a smart-phone and a tablet pc. The proposed MR-WPT system were based on four coil MR-WPT and designed as a transmitter part (Tx) and a receiver part (Rx) both are the same shape with the same loop and resonator. There are four different spiral coil type of resonators with variable of line length, width, gap and turns in $50mm{\times}50mm$ size. The both of top and bottom side of substrate(acrylic; ${\varepsilon}_r=2.56$, tan ${\delta}=0.008$) ere used to generate high inductance and capacitance in limited small volume. Loops were designed on the same plane of resonator to reduce their volume, and there are three different size. The proposed MR-WPT system were fabricated with two acrylic substrate plane of Tx and Rx each, the Rx and Tx loops and resonators were fabricated of copper sheets. There are 12 combinations of 3 loops and 4 resonators, each combination were measured to calculate transfer efficiency and resonance frequency in transfer distance from 1cm to 5cm. The measured results, the highest transfer efficiency was about 70%, and average transfer efficiency was 40%, on the resonance frequency was about 6.78 MHz, which is standard band by A4WP. We proposed small-sized and planer resonator of MR-WPT and showed possibility of mobile applications for small devices.

A Study on Music Summarization (음악요약 생성에 관한 연구)

  • Kim Sung-Tak;Kim Sang-Ho;Kim Hoi-Rin;Choi Ji-Hoon;Lee Han-Kyu;Hong Jin-Woo
    • Journal of Broadcast Engineering
    • /
    • v.11 no.1 s.30
    • /
    • pp.3-14
    • /
    • 2006
  • Music summarization means a technique which automatically generates the most importantand representative a part or parts ill music content. The techniques of music summarization have been studied with two categories according to summary characteristics. The first one is that the repeated part is provided as music summary and the second provides the combined segments which consist of segments with different characteristics as music summary in music content In this paper, we propose and evaluate two kinds of music summarization techniques. The algorithm using multi-level vector quantization which provides a repeated part as music summary gives fixed-length music summary is evaluated by overlapping ration between hand-made repeated parts and automatically generated summary. As results, the overlapping ratios of conventional methods are 42.2% and 47.4%, but that of proposed method with fixed-length summary is 67.1%. Optimal length music summary is evaluated by the portion of overlapping between summary and repeated part which is different length according to music content and the result shows that automatically-generated summary expresses more effective part than fixed-length summary with optimal length. The cluster-based algorithm using 2-D similarity matrix and k-means algorithm provides the combined segments as music summary. In order to evaluate this algorithm, we use MOS test consisting of two questions(How many similar segments are in summarized music? How many segments are included in same structure?) and the results show good performance.

Prediction of field failure rate using data mining in the Automotive semiconductor (데이터 마이닝 기법을 이용한 차량용 반도체의 불량률 예측 연구)

  • Yun, Gyungsik;Jung, Hee-Won;Park, Seungbum
    • Journal of Technology Innovation
    • /
    • v.26 no.3
    • /
    • pp.37-68
    • /
    • 2018
  • Since the 20th century, automobiles, which are the most common means of transportation, have been evolving as the use of electronic control devices and automotive semiconductors increases dramatically. Automotive semiconductors are a key component in automotive electronic control devices and are used to provide stability, efficiency of fuel use, and stability of operation to consumers. For example, automotive semiconductors include engines control, technologies for managing electric motors, transmission control units, hybrid vehicle control, start/stop systems, electronic motor control, automotive radar and LIDAR, smart head lamps, head-up displays, lane keeping systems. As such, semiconductors are being applied to almost all electronic control devices that make up an automobile, and they are creating more effects than simply combining mechanical devices. Since automotive semiconductors have a high data rate basically, a microprocessor unit is being used instead of a micro control unit. For example, semiconductors based on ARM processors are being used in telematics, audio/video multi-medias and navigation. Automotive semiconductors require characteristics such as high reliability, durability and long-term supply, considering the period of use of the automobile for more than 10 years. The reliability of automotive semiconductors is directly linked to the safety of automobiles. The semiconductor industry uses JEDEC and AEC standards to evaluate the reliability of automotive semiconductors. In addition, the life expectancy of the product is estimated at the early stage of development and at the early stage of mass production by using the reliability test method and results that are presented as standard in the automobile industry. However, there are limitations in predicting the failure rate caused by various parameters such as customer's various conditions of use and usage time. To overcome these limitations, much research has been done in academia and industry. Among them, researches using data mining techniques have been carried out in many semiconductor fields, but application and research on automotive semiconductors have not yet been studied. In this regard, this study investigates the relationship between data generated during semiconductor assembly and package test process by using data mining technique, and uses data mining technique suitable for predicting potential failure rate using customer bad data.

The Analysis of the Successful Factors from User Side of MMORPG (사용자 측면에서의 MMORPG <월드 오브 워크래프트> 성공요인 분석)

  • Baek, Jaeyong;Kim, Kenneth Chi Ho
    • Cartoon and Animation Studies
    • /
    • s.42
    • /
    • pp.151-175
    • /
    • 2016
  • The game industry has evolved from mobile games to PC online games after the smart-phone industry was opened up. In this environment, the game industry has rather been negatively developing its commercials means than the sufficient fundamental entertainment to the users. Especially, many games were released with better graphic qualities yet poor originality, continuing to be popular without enhancing the market itself. Moreover, the user's recognition level has improved. The users share their online gaming experience easily with the development of network environment. They receive the feedbacks on the quality of the game through the online channels and media by sharing them together. The high margin of the game industry will lead to the negative feedbacks of the users, effecting them to critique the content although the market looks good for now. The game industry's evolution has to be reviewed in the perspective of users, to look back at the successful cases of the past before the mobile era by analyzing and indicating the quality of the games and content's direction. This research is focused on the success factors of from the user's point of view, which has been widely claimed as a popular game franchise publicly before the mobile games had risen. WOW has been the most successful MMORPG game with its user record of 1.2 million till now. For these reasons, this study analyzes 's success factors from the user's point of view by configuring five expert groups, sequentially applying expert group survey, interview, Jobs-to-be-done and Fishbein Model as UX methodologies based on the business model to see through its long term rein in the industry. Consequently, The success factors from the user side of MMORPG provides an opportunity for the users to interact deeply with the game by (1) using well designed 'world view' over 10 years, (2) providing 'national policy' that is based on the locations of the users' culture and language, (3) providing 'expansions' with changes in time to give the digging elements to the users.

Survey of ICT Apply to Plastic Greenhouse, Rack·Pinion Adaption to Single Span and CFD Analysis (온실 ICT융복합 실태조사와 복숭아형 랙피니언천창 적용 단동온실 및 CFD 유동해석)

  • Cho, Kyu Jeong;Kim, Ki Young;Yang, Won Mo
    • Journal of Bio-Environment Control
    • /
    • v.24 no.4
    • /
    • pp.308-316
    • /
    • 2015
  • This study was conducted to investigate the situation of ICT apply to plastic greenhouse, and the results be apply to design of new one. A CFD analysis were conducted to monitering the ventilation and energy saving of the single span greenhouse newly designed. The causes of delay to apply ICT to plastic greenhouse are the high cost for installation(24%), insufficiency of after services(19%), often disorder(16%), unskillful management of soft ware(15%), insufficient ICT efficiency(13%) and unsatisfying of income increase(12%). The parts of problem occurred in ICT plastic greenhouse are the structure, actuator, environmental control system and sensor(approximate 14%, respectively), remote control technique(13%), plant management technique(12%), energy saving technique(10%) and utilization of software(8%). In the condition of lateral window closed, the average wind speed changed to slow, but it was faster in the condition of leeward side window opened than in the condition of lee and winward side window opened. The air movement in the condition of lateral window closed occur by air moving fan not by out air. It is not affect the room temperature but effective the uniformity of room temperature. The average temperature of low height greenhouse was uniform than high height one. The average temperature in condition of 3rd curtain opened become same with outside temperature after 2 hours, but take more 5 hours in condition of 3rd curtain closed.

Quality characteristics of domestic and imported commercial plain wheat flour (시판 우리밀과 수입밀 중력 밀가루의 품질 특성 비교)

  • Kwak, Han Sub;Kim, Mi Jeong;Kim, Hoon;Kim, Sang Sook
    • Korean Journal of Food Science and Technology
    • /
    • v.49 no.3
    • /
    • pp.304-310
    • /
    • 2017
  • The aim of this study was to compare the physicochemical properties of domestic and imported commercial plain wheat flour and dough. Four types of domestic wheat flour (DW; DW1-4) were compared to four types of imported wheat flour (IW; IW1-4). DWs exhibited lower moisture content, lightness (L), and whiteness, and higher protein content, redness (a), and yellowness (b), than those exhibited by IWs. Solvent retention capacity of DWs and IWs was similar; however, DWs showed higher gluten performance index. Pasting properties, analyzed by rapid visco analyzer (RVA), were similar for DW1, DW2, and IWs; however, DW3 and DW4 showed different RVA patterns. Considering that DW3 and DW4 were organic wheat flour, possible incorporation of damaged kernel might increase amylase activities resulting in decreased peak viscosity. Dough resistance (108.4-159.9 g) and extensibility (11.8-16.7 mm) of IWs were higher than those of DWs (78.0-118.7 g, 8.7-12.5 mm, respectively).