• Title/Summary/Keyword: data driven tools

Search Result 72, Processing Time 0.022 seconds

Design of Web-Based Simulation Framework for Real-Time Infographics (실시간 인포그래픽을 위한 웹 기반의 시뮬레이션 프레임워크 설계)

  • Shin, Seung-Hyeok
    • Journal of Advanced Navigation Technology
    • /
    • v.19 no.5
    • /
    • pp.411-416
    • /
    • 2015
  • The service size of an IoT environment is determined by the various types of sensors. A gateway for relaying sensor information from various sensors is a representative middleware system, and an infographics showing the information with a graphical presentation of data and information is a client system for representing real-time information efficiently, it is necessary a similar test bed with IoT environment to develop a real-time infographics displaying a large amount of information effectively. The testing tools used in developing the existing network systems are mostly made to be suitable for functional testing and performance testing of the driven unit. In this paper, we proposed a mean which is web-based simulation framework to create a variety of information required for real-time infographics development, and evaluate the function of the system proposed by the test function of the comparison with the previous network test tool.

Applications of Machine Learning Models for the Estimation of Reservoir CO2 Emissions (저수지 CO2 배출량 산정을 위한 기계학습 모델의 적용)

  • Yoo, Jisu;Chung, Se-Woong;Park, Hyung-Seok
    • Journal of Korean Society on Water Environment
    • /
    • v.33 no.3
    • /
    • pp.326-333
    • /
    • 2017
  • The lakes and reservoirs have been reported as important sources of carbon emissions to the atmosphere in many countries. Although field experiments and theoretical investigations based on the fundamental gas exchange theory have proposed the quantitative amounts of Net Atmospheric Flux (NAF) in various climate regions, there are still large uncertainties at the global scale estimation. Mechanistic models can be used for understanding and estimating the temporal and spatial variations of the NAFs considering complicated hydrodynamic and biogeochemical processes in a reservoir, but these models require extensive and expensive datasets and model parameters. On the other hand, data driven machine learning (ML) algorithms are likely to be alternative tools to estimate the NAFs in responding to independent environmental variables. The objective of this study was to develop random forest (RF) and multi-layer artificial neural network (ANN) models for the estimation of the daily $CO_2$ NAFs in Daecheong Reservoir located in Geum River of Korea, and compare the models performance against the multiple linear regression (MLR) model that proposed in the previous study (Chung et al., 2016). As a result, the RF and ANN models showed much enhanced performance in the estimation of the high NAF values, while MLR model significantly under estimated them. Across validation with 10-fold random samplings was applied to evaluate the performance of three models, and indicated that the ANN model is best, and followed by RF and MLR models.

A Study on Composition and Utilization of Digital Literacy Education elements Using Open Contents (오픈 콘텐츠를 활용한 디지털 리터러시 학습 요소 구성과 활용)

  • Hong, Myunghui;Lee, Soonyoung
    • Journal of The Korean Association of Information Education
    • /
    • v.22 no.6
    • /
    • pp.711-721
    • /
    • 2018
  • The development of artificial intelligence technology and the shift to a software-driven society are raising the need for digital literacy education on how to access, understand, use, create and share new open content in a variety of sustainable open content. At this point in time, this paper defines the digital literacy as the subliteracy concept for data, tools, and device elements. It is defined as a concept that includes cognitive and non-cognitive abilities and is stratified by computer literacy, ICT literacy, and information literacy. Open content is also defined as teaching-learning materials that can be used and shared freely by anyone, such as the Open Education Resource (OER) and the Open Access movement. Based on the two definitions, a three-step strategy for digital literacy education was developed to select open content in the digital environment, followed by a digital literacy education plan, and finally, an education frame to foster digital literacy capabilities.

Digital Twin technology for Urban Policy Making (A Case Study of Policy Digital Twin of Sejong City) (디지털트윈 기술의 도시 정책 활용 사례 (세종시 도시행정 디지털트윈 프로젝트를 중심으로))

  • Jung, Y.J.;Cho, I.Y.;Lee, J.W.;Kim, B.H.;Lee, S.H.;Lim, C.G.;Lee, C.H.;Paik, E.H.;Jin, K.S.;Kim, Y.C.;Lee, S.M.;Choi, M.S.;KIM, T.H.;Chang, M.J.;Kim, S.O.;Kim, H.K.;Jung, S.J.;Lee, S.Y.;Ann, J.H.
    • Electronics and Telecommunications Trends
    • /
    • v.36 no.2
    • /
    • pp.43-55
    • /
    • 2021
  • National and social issues are becoming increasingly common, but traditional policy-making methods are no longer effective. Therefore, evidence-based policy making is emerging as an alternative paradigm. Digital twin technology is one of the digital support tools for the new data-driven policy-making process. This study presents ongoing government experiments in the world where digital twin technology is applied to policy making and describes our experience in developing digital twin platforms in Sejong-the de facto administrative capital of South Korea.

Research on regional spatial information analysis platform about NTIS raw data (국가과학기술지식 원시데이터에 관한 지역 공간정보 분석 플랫폼 연구)

  • Lim, Jung-Sun;Kim, Sanggook;Bae, Seoung Hun;Kim, Kwang-Hoon;Won, Dong-Kyu
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.2
    • /
    • pp.21-35
    • /
    • 2020
  • Due to the coronavirus pandemic and diplomatic disputes, governments are actively developing a policy to revitalize·reshore manufacturing and to diversify international cooperations. In order to develop such a policy, it is very important to compare and analyze domestic·international geospatial information. Over the decade, the US·EC governments have conducted a series of national researches to build data-based tools that can monitor·analyze regional geospatial information driven by government R&D investments. In the case of the EC system, it can compare geospatial information in domestic and international(including Korea) regions. Compared to US·EC cases, Korean examples of national researches with available data analplatform need future improvements. Current study is investigating an automated analysis methodologies using "National Institute of Science and Technology Information (NTIS)" DB, which was national security data until recently. Research on data-mining regional geospatial information can contribute to support policy fields that need to discover new issues in response to unexpected social problems such as recently faced corona and trade disputes.

A Study on the Potential and Limitation of Pre-producing Dramas through Social Analysis -focusing on a jtbc drama - (소셜 분석을 통한 사전제작 드라마의 가능성과 한계에 관한 연구 -jtbc <맨투맨>을 중심으로-)

  • Kim, Kyung-Ae;Ku, Jin-Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.2
    • /
    • pp.164-172
    • /
    • 2018
  • This paper examines the relevance of pre-production and storytelling in big data analysis and, focusing on JTBC's Man to Man series, looks at how the drama's storytelling should be structured. In this study, we conducted text mining on blogs focused on a particular topic to read the viewer's thoughts on pre-produced dramas and on 67 blogs written about Pre-Production Dramas from 2016.12.15 to 2017.12.15. Also, we conducted sentiment analysis about the Man to Man series, which is not only a pre-production drama, but also has storytelling issues. The blog text extraction and text mining were analyzed using the OutWit Hub and the R, and the tools.provided by social metrics were used to make sentiment analyses of the larger data. Sentiment analysis revealed that the viewers of the Man to Man series did not agree with the romance between Kim Sul-woo and Cha Do-ha, due to the lack of reality in the female characters. Therefore, it was concluded that it is crucial to increase the reality of the characters in order to increase the audience's empathy. These studies will continue to be necessary, because they will form the basis for digitally driven storytelling studies and will provide valuable materials for conducting predictions and instructions in the cultural content industry.

Development of Grid-Based Conceptual Hydrologic Model (격자기반의 개념적 수문모형의 개발)

  • Kim, Byung-Sik;Yoon, Seon-Kyoo;Yang, Dong-Min;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.43 no.7
    • /
    • pp.667-679
    • /
    • 2010
  • The distributed hydrologic model has been considerably improved due to rapid development of computer hardware technology as well as the increased accessibility and the applicability of hydro-geologic information using GIS. It has been acknowledged that physically-based distributed hydrologic model require significant amounts of data for their calibration, so its application at ungauged catchments is very limited. In this regard, this study was intended to develop a distributed hydrologic model (S-RAT) that is mainly based on conceptually grid-based water balance model. The proposed model shows advantages as a new distributed rainfall-runoff model in terms of their simplicity and model performance. Another advantage of the proposed model is to effectively assess spatio-temporal variation for the entire runoff process. In addition, S-RAT does not rely on any commercial GIS pre-processing tools because a built-in GIS pre-processing module was developed and included in the model. Through the application to the two pilot basins, it was found that S-RAT model has temporal and spatial transferability of parameters and also S-RAT model can be effectively used as a radar data-driven rainfall-runoff model.

Contactless Data Society and Reterritorialization of the Archive (비접촉 데이터 사회와 아카이브 재영토화)

  • Jo, Min-ji
    • The Korean Journal of Archival Studies
    • /
    • no.79
    • /
    • pp.5-32
    • /
    • 2024
  • The Korean government ranked 3rd among 193 UN member countries in the UN's 2022 e-Government Development Index. Korea, which has consistently been evaluated as a top country, can clearly be said to be a leading country in the world of e-government. The lubricant of e-government is data. Data itself is neither information nor a record, but it is a source of information and records and a resource of knowledge. Since administrative actions through electronic systems have become widespread, the production and technology of data-based records have naturally expanded and evolved. Technology may seem value-neutral, but in fact, technology itself reflects a specific worldview. The digital order of new technologies, armed with hyper-connectivity and super-intelligence, not only has a profound influence on traditional power structures, but also has an a similar influence on existing information and knowledge transmission media. Moreover, new technologies and media, including data-based generative artificial intelligence, are by far the hot topic. It can be seen that the all-round growth and spread of digital technology has led to the augmentation of human capabilities and the outsourcing of thinking. This also involves a variety of problems, ranging from deep fakes and other fake images, auto profiling, AI lies hallucination that creates them as if they were real, and copyright infringement of machine learning data. Moreover, radical connectivity capabilities enable the instantaneous sharing of vast amounts of data and rely on the technological unconscious to generate actions without awareness. Another irony of the digital world and online network, which is based on immaterial distribution and logical existence, is that access and contact can only be made through physical tools. Digital information is a logical object, but digital resources cannot be read or utilized without some type of device to relay it. In that respect, machines in today's technological society have gone beyond the level of simple assistance, and there are points at which it is difficult to say that the entry of machines into human society is a natural change pattern due to advanced technological development. This is because perspectives on machines will change over time. Important is the social and cultural implications of changes in the way records are produced as a result of communication and actions through machines. Even in the archive field, what problems will a data-based archive society face due to technological changes toward a hyper-intelligence and hyper-connected society, and who will prove the continuous activity of records and data and what will be the main drivers of media change? It is time to research whether this will happen. This study began with the need to recognize that archives are not only records that are the result of actions, but also data as strategic assets. Through this, author considered how to expand traditional boundaries and achieves reterritorialization in a data-driven society.

Study on data preprocessing methods for considering snow accumulation and snow melt in dam inflow prediction using machine learning & deep learning models (머신러닝&딥러닝 모델을 활용한 댐 일유입량 예측시 융적설을 고려하기 위한 데이터 전처리에 대한 방법 연구)

  • Jo, Youngsik;Jung, Kwansue
    • Journal of Korea Water Resources Association
    • /
    • v.57 no.1
    • /
    • pp.35-44
    • /
    • 2024
  • Research in dam inflow prediction has actively explored the utilization of data-driven machine learning and deep learning (ML&DL) tools across diverse domains. Enhancing not just the inherent model performance but also accounting for model characteristics and preprocessing data are crucial elements for precise dam inflow prediction. Particularly, existing rainfall data, derived from snowfall amounts through heating facilities, introduces distortions in the correlation between snow accumulation and rainfall, especially in dam basins influenced by snow accumulation, such as Soyang Dam. This study focuses on the preprocessing of rainfall data essential for the application of ML&DL models in predicting dam inflow in basins affected by snow accumulation. This is vital to address phenomena like reduced outflow during winter due to low snowfall and increased outflow during spring despite minimal or no rain, both of which are physical occurrences. Three machine learning models (SVM, RF, LGBM) and two deep learning models (LSTM, TCN) were built by combining rainfall and inflow series. With optimal hyperparameter tuning, the appropriate model was selected, resulting in a high level of predictive performance with NSE ranging from 0.842 to 0.894. Moreover, to generate rainfall correction data considering snow accumulation, a simulated snow accumulation algorithm was developed. Applying this correction to machine learning and deep learning models yielded NSE values ranging from 0.841 to 0.896, indicating a similarly high level of predictive performance compared to the pre-snow accumulation application. Notably, during the snow accumulation period, adjusting rainfall during the training phase was observed to lead to a more accurate simulation of observed inflow when predicted. This underscores the importance of thoughtful data preprocessing, taking into account physical factors such as snowfall and snowmelt, in constructing data models.

The Analysis on the Relationship between Firms' Exposures to SNS and Stock Prices in Korea (기업의 SNS 노출과 주식 수익률간의 관계 분석)

  • Kim, Taehwan;Jung, Woo-Jin;Lee, Sang-Yong Tom
    • Asia pacific journal of information systems
    • /
    • v.24 no.2
    • /
    • pp.233-253
    • /
    • 2014
  • Can the stock market really be predicted? Stock market prediction has attracted much attention from many fields including business, economics, statistics, and mathematics. Early research on stock market prediction was based on random walk theory (RWT) and the efficient market hypothesis (EMH). According to the EMH, stock market are largely driven by new information rather than present and past prices. Since it is unpredictable, stock market will follow a random walk. Even though these theories, Schumaker [2010] asserted that people keep trying to predict the stock market by using artificial intelligence, statistical estimates, and mathematical models. Mathematical approaches include Percolation Methods, Log-Periodic Oscillations and Wavelet Transforms to model future prices. Examples of artificial intelligence approaches that deals with optimization and machine learning are Genetic Algorithms, Support Vector Machines (SVM) and Neural Networks. Statistical approaches typically predicts the future by using past stock market data. Recently, financial engineers have started to predict the stock prices movement pattern by using the SNS data. SNS is the place where peoples opinions and ideas are freely flow and affect others' beliefs on certain things. Through word-of-mouth in SNS, people share product usage experiences, subjective feelings, and commonly accompanying sentiment or mood with others. An increasing number of empirical analyses of sentiment and mood are based on textual collections of public user generated data on the web. The Opinion mining is one domain of the data mining fields extracting public opinions exposed in SNS by utilizing data mining. There have been many studies on the issues of opinion mining from Web sources such as product reviews, forum posts and blogs. In relation to this literatures, we are trying to understand the effects of SNS exposures of firms on stock prices in Korea. Similarly to Bollen et al. [2011], we empirically analyze the impact of SNS exposures on stock return rates. We use Social Metrics by Daum Soft, an SNS big data analysis company in Korea. Social Metrics provides trends and public opinions in Twitter and blogs by using natural language process and analysis tools. It collects the sentences circulated in the Twitter in real time, and breaks down these sentences into the word units and then extracts keywords. In this study, we classify firms' exposures in SNS into two groups: positive and negative. To test the correlation and causation relationship between SNS exposures and stock price returns, we first collect 252 firms' stock prices and KRX100 index in the Korea Stock Exchange (KRX) from May 25, 2012 to September 1, 2012. We also gather the public attitudes (positive, negative) about these firms from Social Metrics over the same period of time. We conduct regression analysis between stock prices and the number of SNS exposures. Having checked the correlation between the two variables, we perform Granger causality test to see the causation direction between the two variables. The research result is that the number of total SNS exposures is positively related with stock market returns. The number of positive mentions of has also positive relationship with stock market returns. Contrarily, the number of negative mentions has negative relationship with stock market returns, but this relationship is statistically not significant. This means that the impact of positive mentions is statistically bigger than the impact of negative mentions. We also investigate whether the impacts are moderated by industry type and firm's size. We find that the SNS exposures impacts are bigger for IT firms than for non-IT firms, and bigger for small sized firms than for large sized firms. The results of Granger causality test shows change of stock price return is caused by SNS exposures, while the causation of the other way round is not significant. Therefore the correlation relationship between SNS exposures and stock prices has uni-direction causality. The more a firm is exposed in SNS, the more is the stock price likely to increase, while stock price changes may not cause more SNS mentions.