• Title/Summary/Keyword: 빅데이터 기법

Search Result 785, Processing Time 0.023 seconds

Derivation of Green Infrastructure Planning Factors for Reducing Particulate Matter - Using Text Mining - (미세먼지 저감을 위한 그린인프라 계획요소 도출 - 텍스트 마이닝을 활용하여 -)

  • Seok, Youngsun;Song, Kihwan;Han, Hyojoo;Lee, Junga
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.49 no.5
    • /
    • pp.79-96
    • /
    • 2021
  • Green infrastructure planning represents landscape planning measures to reduce particulate matter. This study aimed to derive factors that may be used in planning green infrastructure for particulate matter reduction using text mining techniques. A range of analyses were carried out by focusing on keywords such as 'particulate matter reduction plan' and 'green infrastructure planning elements'. The analyses included Term Frequency-Inverse Document Frequency (TF-IDF) analysis, centrality analysis, related word analysis, and topic modeling analysis. These analyses were carried out via text mining by collecting information on previous related research, policy reports, and laws. Initially, TF-IDF analysis results were used to classify major keywords relating to particulate matter and green infrastructure into three groups: (1) environmental issues (e.g., particulate matter, environment, carbon, and atmosphere), target spaces (e.g., urban, park, and local green space), and application methods (e.g., analysis, planning, evaluation, development, ecological aspect, policy management, technology, and resilience). Second, the centrality analysis results were found to be similar to those of TF-IDF; it was confirmed that the central connectors to the major keywords were 'Green New Deal' and 'Vacant land'. The results from the analysis of related words verified that planning green infrastructure for particulate matter reduction required planning forests and ventilation corridors. Additionally, moisture must be considered for microclimate control. It was also confirmed that utilizing vacant space, establishing mixed forests, introducing particulate matter reduction technology, and understanding the system may be important for the effective planning of green infrastructure. Topic analysis was used to classify the planning elements of green infrastructure based on ecological, technological, and social functions. The planning elements of ecological function were classified into morphological (e.g., urban forest, green space, wall greening) and functional aspects (e.g., climate control, carbon storage and absorption, provision of habitats, and biodiversity for wildlife). The planning elements of technical function were classified into various themes, including the disaster prevention functions of green infrastructure, buffer effects, stormwater management, water purification, and energy reduction. The planning elements of the social function were classified into themes such as community function, improving the health of users, and scenery improvement. These results suggest that green infrastructure planning for particulate matter reduction requires approaches related to key concepts, such as resilience and sustainability. In particular, there is a need to apply green infrastructure planning elements in order to reduce exposure to particulate matter.

A Study on the Correlation between Uniaxial Compressive Strength of Rock by Elastic Wave Velocity and Elastic Modulus of Granite in Seoul and Gyeonggi Region (서울·경기지역 화강암의 탄성파속도와 탄성계수에 의한 암석의 일축압축강도와의 상관성 연구)

  • Son, In-Hwan;Kim, Byong-kuk;Lee, Byok-Kyu;Jang, Seung-jin;Lee, Su-Gon
    • Journal of the Society of Disaster Information
    • /
    • v.15 no.2
    • /
    • pp.249-258
    • /
    • 2019
  • Purpose: The purpose of this study is to attain the correlation analysis and thereby to deduce the uniaxial compressive strength of rock specimens through the elastic wave velocity and the elastic modulus among the physical characteristics measured from the rock specimens collected during drilling investigations in Seoul and Gyeonggi region. Method: Experiments were conducted in the laboratory with 119 granite specimens in order to derive the correlation between the compressive strength of the rocks and elastic wave velocity and elastic modulus. Results: In the case of granite, the results of the analysis of the interaction between the compressive strength of a rock and the elastic wave velocity and elastic modulus were found to be less reliable in the relation equation as a whole. And it is believed that the estimation of the compressive strength by the elastic wave velocity and elastic modulus is less used because of the composition of non-homogeneous particles of granite. Conclusion: In this study, the analysis of correlation between the compressive strength of a rock and the elastic wave velocity and elastic modulus was performed with simple regression analysis and multiple regression analysis. The coefficient determination ($R^2$) of simple regression analysis was shown between 0.61 and 0.67. Multiple regression analysis was 0.71. Thus, using multiple regression analysis when estimating compressive strength can increase the reliability of the correlation. Also, in the future, a variety of statistical analysis techniques such as recovery analysis, and artificial neural network analysis, and big data analysis can lead to more reliable results when estimating the compressive sterength of a rock based on the elastic wave velocity and elastic modulus.

Analyzing the Performance of the South Korean Men's National Football Team Using Social Network Analysis: Focusing on the Manager Bento's Matches (사회연결망분석을 활용한 한국 남자축구대표팀 경기성과 분석: 벤투 감독 경기를 중심으로)

  • Yeonsik Jung;Eunkyung Kang;Sung-Byung Yang
    • Knowledge Management Research
    • /
    • v.24 no.2
    • /
    • pp.241-262
    • /
    • 2023
  • The phenomena and game records that occur in sports matches are being analyzed in the field of sports game analysis, utilizing advanced technologies and various scientific analysis methods. Among these methods, social network analysis is actively employed in analyzing pass networks. As football is a representative sport in which the game unfolds through player interactions, efforts are being made to provide new insights into the game using social network analysis, which were previously unattainable. Consequently, this study aims to analyze the changes in pass networks over time for a specific football team and compare them in different scenarios, including variations in the game's nature (Qatar World Cup games vs. A match games) and alterations in the opposing team (higher FIFA rankers vs. lower FIFA rankers). To elaborate, we selected ten matches from the games of the Korean national football team following Coach Bento's appointment, extracted network indicators for these matches, and applied four indicators (efficiency, cohesion, vulnerability, and activity/leadership) from a football team's performance evaluation model to the extracted data for analysis under different circumstances. The research findings revealed a significant increase in cohesion and a substantial decrease in vulnerability during the analysis of game performance over time. In the comparative analysis based on changes in the game's nature, Qatar World Cup matches exhibited superior performance across all aspects of the evaluation model compared to A matches. Lastly, in the comparative analysis considering the variations in the opposing team, matches against lower FIFA rankers displayed superior performance in all aspects of the evaluation model in comparison to matches against top FIFA rankers. We hope that the outcomes of this study can serve as essential foundational data for the selection of football team coaches and the development of game strategies, thereby contributing to the enhancement of the team's performance.

Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning (영화 흥행에 영향을 미치는 새로운 변수 개발과 이를 이용한 머신러닝 기반의 주간 박스오피스 예측)

  • Song, Junga;Choi, Keunho;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.67-83
    • /
    • 2018
  • The Korean film industry with significant increase every year exceeded the number of cumulative audiences of 200 million people in 2013 finally. However, starting from 2015 the Korean film industry entered a period of low growth and experienced a negative growth after all in 2016. To overcome such difficulty, stakeholders like production company, distribution company, multiplex have attempted to maximize the market returns using strategies of predicting change of market and of responding to such market change immediately. Since a film is classified as one of experiential products, it is not easy to predict a box office record and the initial number of audiences before the film is released. And also, the number of audiences fluctuates with a variety of factors after the film is released. So, the production company and distribution company try to be guaranteed the number of screens at the opining time of a newly released by multiplex chains. However, the multiplex chains tend to open the screening schedule during only a week and then determine the number of screening of the forthcoming week based on the box office record and the evaluation of audiences. Many previous researches have conducted to deal with the prediction of box office records of films. In the early stage, the researches attempted to identify factors affecting the box office record. And nowadays, many studies have tried to apply various analytic techniques to the factors identified previously in order to improve the accuracy of prediction and to explain the effect of each factor instead of identifying new factors affecting the box office record. However, most of previous researches have limitations in that they used the total number of audiences from the opening to the end as a target variable, and this makes it difficult to predict and respond to the demand of market which changes dynamically. Therefore, the purpose of this study is to predict the weekly number of audiences of a newly released film so that the stakeholder can flexibly and elastically respond to the change of the number of audiences in the film. To that end, we considered the factors used in the previous studies affecting box office and developed new factors not used in previous studies such as the order of opening of movies, dynamics of sales. Along with the comprehensive factors, we used the machine learning method such as Random Forest, Multi Layer Perception, Support Vector Machine, and Naive Bays, to predict the number of cumulative visitors from the first week after a film release to the third week. At the point of the first and the second week, we predicted the cumulative number of visitors of the forthcoming week for a released film. And at the point of the third week, we predict the total number of visitors of the film. In addition, we predicted the total number of cumulative visitors also at the point of the both first week and second week using the same factors. As a result, we found the accuracy of predicting the number of visitors at the forthcoming week was higher than that of predicting the total number of them in all of three weeks, and also the accuracy of the Random Forest was the highest among the machine learning methods we used. This study has implications in that this study 1) considered various factors comprehensively which affect the box office record and merely addressed by other previous researches such as the weekly rating of audiences after release, the weekly rank of the film after release, and the weekly sales share after release, and 2) tried to predict and respond to the demand of market which changes dynamically by suggesting models which predicts the weekly number of audiences of newly released films so that the stakeholders can flexibly and elastically respond to the change of the number of audiences in the film.

Automatic Speech Style Recognition Through Sentence Sequencing for Speaker Recognition in Bilateral Dialogue Situations (양자 간 대화 상황에서의 화자인식을 위한 문장 시퀀싱 방법을 통한 자동 말투 인식)

  • Kang, Garam;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.2
    • /
    • pp.17-32
    • /
    • 2021
  • Speaker recognition is generally divided into speaker identification and speaker verification. Speaker recognition plays an important function in the automatic voice system, and the importance of speaker recognition technology is becoming more prominent as the recent development of portable devices, voice technology, and audio content fields continue to expand. Previous speaker recognition studies have been conducted with the goal of automatically determining who the speaker is based on voice files and improving accuracy. Speech is an important sociolinguistic subject, and it contains very useful information that reveals the speaker's attitude, conversation intention, and personality, and this can be an important clue to speaker recognition. The final ending used in the speaker's speech determines the type of sentence or has functions and information such as the speaker's intention, psychological attitude, or relationship to the listener. The use of the terminating ending has various probabilities depending on the characteristics of the speaker, so the type and distribution of the terminating ending of a specific unidentified speaker will be helpful in recognizing the speaker. However, there have been few studies that considered speech in the existing text-based speaker recognition, and if speech information is added to the speech signal-based speaker recognition technique, the accuracy of speaker recognition can be further improved. Hence, the purpose of this paper is to propose a novel method using speech style expressed as a sentence-final ending to improve the accuracy of Korean speaker recognition. To this end, a method called sentence sequencing that generates vector values by using the type and frequency of the sentence-final ending appearing in the utterance of a specific person is proposed. To evaluate the performance of the proposed method, learning and performance evaluation were conducted with a actual drama script. The method proposed in this study can be used as a means to improve the performance of Korean speech recognition service.