• Title/Summary/Keyword: distance system

Search Result 7,487, Processing Time 0.04 seconds

Application of Remote Sensing Techniques to Survey and Estimate the Standing-Stock of Floating Debris in the Upper Daecheong Lake (원격탐사 기법 적용을 통한 대청호 상류 유입 부유쓰레기 조사 및 현존량 추정 연구)

  • Youngmin Kim;Seon Woong Jang ;Heung-Min Kim;Tak-Young Kim;Suho Bak
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_1
    • /
    • pp.589-597
    • /
    • 2023
  • Floating debris in large quantities from land during heavy rainfall has adverse social, economic, and environmental impacts, but the monitoring system for the concentration area and amount is insufficient. In this study, we proposed an efficient monitoring method for floating debris entering the river during heavy rainfall in Daecheong Lake, the largest water supply source in the central region, and applied remote sensing techniques to estimate the standing-stock of floating debris. To investigate the status of floating debris in the upper of Daecheong Lake, we used a tracking buoy equipped with a low-orbit satellite communication terminal to identify the movement route and behavior characteristics, and used a drone to estimate the potential concentration area and standing-stock of floating debris. The location tracking buoys moved rapidly during the period when the cumulative rainfall for 3 days increased by more than 200 to 300 mm. In the case of Hotan Bridge, which showed the longest distance, it moved about 72.8 km for one day, and the maximum moving speed at this time was 5.71 km/h. As a result of calculating the standing-stock of floating debris using a drone after heavy rainfall, it was found to be 658.8 to 9,165.4 tons, with the largest amount occurring in the Seokhori area. In this study, we were able to identify the main concentrations of floating debris by using location-tracking buoys and drones. It is believed that remote sensing-based monitoring methods, which are more mobile and quicker than traditional monitoring methods, can contribute to reducing the cost of collecting and processing large amounts of floating debris that flows in during heavy rain periods in the future.

Distribution, Preservation Characteristics of Land and River Natural Aggregates in Nonsan City, Korea (논산시 하천 및 육상 골재 자원의 부존 현황과 특성)

  • Hyun Ho Yoon;Sei Sun Hong;Min Han;Jin-Young Lee
    • Economic and Environmental Geology
    • /
    • v.57 no.2
    • /
    • pp.143-159
    • /
    • 2024
  • Natural aggregate is an essential resource for human activities, closely related to construction. The aggregate demand has been increasing annually, and due to the nature of the resource, it is difficult to procure from distant locations. This study identifies the distribution and characteristics of aggregate-bearing areas as part of a municipal-level aggregate resource survey conducted in Nonsan City, Korea, in 2023. Nonsan City is located approximately 35 km straight distance from the Geum River estuary and lies at the passageway of the main stream of the Geum River. The topography of Nonsan City features eastern mountainous areas and western plains, creating an east-high-west-low geomorphic setting, with 33 streams distributed across the city, including tributaries of the Geum River like Nonsan Stream, Noseong Stream, and Ganggyeong Stream. All streams originate from the highlands in the north and east, converge with Nonsan Stream, and then join the west bank of the main stream of the Geum River at the western boundary of Nonsan City. Drilling core results show shallow depths in the highlands to the north and east, deepening towards the west, reaching a maximum depth of 25 m near the main stream of the Geum River. The total reserve of land aggregates is calculated to be 246,789,000 m3, with a developable amount of 172,750,000 m3. The total reserve of river aggregates is 5,236,000 m3, with a developable amount of 3,765,000 m3. The distribution of aggregates varies according to the geomorphic, geologic, and development pattern of the river system. Reserves are scarce in mountainous areas but are abundant in regions with rivers and wide alluvial plains, although reserves appear at depths greater than 4m. The distribution of aggregate resources in Nonsan City is influenced by stream activities and sea level changes, with the tidal range of the Yellow Sea acting as an unfavorable condition for the preservation of aggregate resources.

Development of Yóukè Mining System with Yóukè's Travel Demand and Insight Based on Web Search Traffic Information (웹검색 트래픽 정보를 활용한 유커 인바운드 여행 수요 예측 모형 및 유커마이닝 시스템 개발)

  • Choi, Youji;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.155-175
    • /
    • 2017
  • As social data become into the spotlight, mainstream web search engines provide data indicate how many people searched specific keyword: Web Search Traffic data. Web search traffic information is collection of each crowd that search for specific keyword. In a various area, web search traffic can be used as one of useful variables that represent the attention of common users on specific interests. A lot of studies uses web search traffic data to nowcast or forecast social phenomenon such as epidemic prediction, consumer pattern analysis, product life cycle, financial invest modeling and so on. Also web search traffic data have begun to be applied to predict tourist inbound. Proper demand prediction is needed because tourism is high value-added industry as increasing employment and foreign exchange. Among those tourists, especially Chinese tourists: Youke is continuously growing nowadays, Youke has been largest tourist inbound of Korea tourism for many years and tourism profits per one Youke as well. It is important that research into proper demand prediction approaches of Youke in both public and private sector. Accurate tourism demands prediction is important to efficient decision making in a limited resource. This study suggests improved model that reflects latest issue of society by presented the attention from group of individual. Trip abroad is generally high-involvement activity so that potential tourists likely deep into searching for information about their own trip. Web search traffic data presents tourists' attention in the process of preparation their journey instantaneous and dynamic way. So that this study attempted select key words that potential Chinese tourists likely searched out internet. Baidu-Chinese biggest web search engine that share over 80%- provides users with accessing to web search traffic data. Qualitative interview with potential tourists helps us to understand the information search behavior before a trip and identify the keywords for this study. Selected key words of web search traffic are categorized by how much directly related to "Korean Tourism" in a three levels. Classifying categories helps to find out which keyword can explain Youke inbound demands from close one to far one as distance of category. Web search traffic data of each key words gathered by web crawler developed to crawling web search data onto Baidu Index. Using automatically gathered variable data, linear model is designed by multiple regression analysis for suitable for operational application of decision and policy making because of easiness to explanation about variables' effective relationship. After regression linear models have composed, comparing with model composed traditional variables and model additional input web search traffic data variables to traditional model has conducted by significance and R squared. after comparing performance of models, final model is composed. Final regression model has improved explanation and advantage of real-time immediacy and convenience than traditional model. Furthermore, this study demonstrates system intuitively visualized to general use -Youke Mining solution has several functions of tourist decision making including embed final regression model. Youke Mining solution has algorithm based on data science and well-designed simple interface. In the end this research suggests three significant meanings on theoretical, practical and political aspects. Theoretically, Youke Mining system and the model in this research are the first step on the Youke inbound prediction using interactive and instant variable: web search traffic information represents tourists' attention while prepare their trip. Baidu web search traffic data has more than 80% of web search engine market. Practically, Baidu data could represent attention of the potential tourists who prepare their own tour as real-time. Finally, in political way, designed Chinese tourist demands prediction model based on web search traffic can be used to tourism decision making for efficient managing of resource and optimizing opportunity for successful policy.

Extension Method of Association Rules Using Social Network Analysis (사회연결망 분석을 활용한 연관규칙 확장기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.111-126
    • /
    • 2017
  • Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

Edge to Edge Model and Delay Performance Evaluation for Autonomous Driving (자율 주행을 위한 Edge to Edge 모델 및 지연 성능 평가)

  • Cho, Moon Ki;Bae, Kyoung Yul
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.191-207
    • /
    • 2021
  • Up to this day, mobile communications have evolved rapidly over the decades, mainly focusing on speed-up to meet the growing data demands of 2G to 5G. And with the start of the 5G era, efforts are being made to provide such various services to customers, as IoT, V2X, robots, artificial intelligence, augmented virtual reality, and smart cities, which are expected to change the environment of our lives and industries as a whole. In a bid to provide those services, on top of high speed data, reduced latency and reliability are critical for real-time services. Thus, 5G has paved the way for service delivery through maximum speed of 20Gbps, a delay of 1ms, and a connecting device of 106/㎢ In particular, in intelligent traffic control systems and services using various vehicle-based Vehicle to X (V2X), such as traffic control, in addition to high-speed data speed, reduction of delay and reliability for real-time services are very important. 5G communication uses high frequencies of 3.5Ghz and 28Ghz. These high-frequency waves can go with high-speed thanks to their straightness while their short wavelength and small diffraction angle limit their reach to distance and prevent them from penetrating walls, causing restrictions on their use indoors. Therefore, under existing networks it's difficult to overcome these constraints. The underlying centralized SDN also has a limited capability in offering delay-sensitive services because communication with many nodes creates overload in its processing. Basically, SDN, which means a structure that separates signals from the control plane from packets in the data plane, requires control of the delay-related tree structure available in the event of an emergency during autonomous driving. In these scenarios, the network architecture that handles in-vehicle information is a major variable of delay. Since SDNs in general centralized structures are difficult to meet the desired delay level, studies on the optimal size of SDNs for information processing should be conducted. Thus, SDNs need to be separated on a certain scale and construct a new type of network, which can efficiently respond to dynamically changing traffic and provide high-quality, flexible services. Moreover, the structure of these networks is closely related to ultra-low latency, high confidence, and hyper-connectivity and should be based on a new form of split SDN rather than an existing centralized SDN structure, even in the case of the worst condition. And in these SDN structural networks, where automobiles pass through small 5G cells very quickly, the information change cycle, round trip delay (RTD), and the data processing time of SDN are highly correlated with the delay. Of these, RDT is not a significant factor because it has sufficient speed and less than 1 ms of delay, but the information change cycle and data processing time of SDN are factors that greatly affect the delay. Especially, in an emergency of self-driving environment linked to an ITS(Intelligent Traffic System) that requires low latency and high reliability, information should be transmitted and processed very quickly. That is a case in point where delay plays a very sensitive role. In this paper, we study the SDN architecture in emergencies during autonomous driving and conduct analysis through simulation of the correlation with the cell layer in which the vehicle should request relevant information according to the information flow. For simulation: As the Data Rate of 5G is high enough, we can assume the information for neighbor vehicle support to the car without errors. Furthermore, we assumed 5G small cells within 50 ~ 250 m in cell radius, and the maximum speed of the vehicle was considered as a 30km ~ 200 km/hour in order to examine the network architecture to minimize the delay.

A Review of Current Status and Placeness on the Yusang-Goksu Ruins in Hwanggak-dong, Geumma, Iksan (익산 금마 황각동 유상곡수 유적 일대의 현황과 장소성에 대한 일고찰)

  • Rho, Jae-Hyun;Han, Min-Soon;Seo, Youn-Mi;Park, Yool-Jin
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.40 no.3
    • /
    • pp.20-35
    • /
    • 2022
  • This study was conducted on the locational results of the 'Yusanggoksu(流觴曲水)' petroglyphs located in Hwanggak-dong(黃閣洞), Shinsong-ri, Geumma-myeon, Iksan-si through literature study, analysis of old maps and aerial photos, field observations, drone photography, elevation surveys, and interviews with residents. It was attempted for the purpose of illuminating and preserving the relics of the domestic Yusanggoksu garden by clarifying the spatiality of this place by tracing the spatiality and examining the possibility of enjoying the Yusanggoksu water system in this place. The conclusion of this study is as follows. The area around Hwanggak-dong, where the Yusanggoksu remains, has been selected as the most beautiful scenic spot in Iksan in various documents. The origin of 'Hwanggak' is considered to be closely related to the nickname of Uijeongbu(議政府). In other words, he paid attention to the relationship with Yanggok, So Se-yang(蘇世讓), who served as Chan-seong Jwa(左贊成). In particular, he paid attention to the relationship with his birthplace, Taeheojeong, a separate book, and Toehyudang, a retreat hall), tombs, and posthumous Confucian academies were distributed in the vicinity. Haseo-dae(荷鋤臺), a wide rock on which a hoe is hung on a rock after field work, seems to express a leisurely rural life and a simple and hermit life, based on the examples of Chinese and Korean poetry. The dark blood on the upper part of the Seobwi Rock with the inscription 'Yusanggoksu', which is the core of this site, is identified as a chailgong(遮日孔) to support the water system, and Ilgan-pavilion and Mojeong(茅亭) nearby are to support the yusanggoksu. It seems to have performed a spatial function for The inscription 'Hwanggak-dong' engraved on the front of Deungzanbawi is the gateway to Hwanggakdongcheon(黃閣洞天) and identified the idealized world existing in the village. Judging from the documentary records of the Iksan-gun 『Chongswaelog(叢瑣錄)』, the rock letters 'Hwanggak-dong' and 'Haseodae' were engraved on March 29, 1901, the 5th year of Gwangmu, the 5th year of the Korean Empire, by Iksan-gun Governor Oh Haeng-mook(吳宖默) and his acquaintance Seokseong Kim In-gil(金寅吉) Confirmed. Also, considering the tense of Lee Bong-gu's 「Hwanggakdongun(黃閣洞韻)」 and So Jin-deok, a descendant of Yanggok, 「Hwanggakdongsihoe(黃閣洞詩會)」, it is presumed that it was related to Goksuyeon(曲水宴) in Hwanggak-dong. It can be inferred that the current affairs meetings were held at least until the early days of Japanese colonial rule. Meanwhile, the maximum width of the current curved waterway was calculated as 11.3m and the transverse slope was 15.0%. If so, it is estimated that the width and extension distance of the curved waterway would have been much longer. Judging from the use of mochun(暮春), drinking and poetry, the tense 'Hwanggakdongsihoe' related to the Yusanggoksu relics in Hwanggak-dong, and the existence of a pavilion presumed to be Yusangjeong(流觴亭) called Ilgan-pavilion in the nearby Yusanggoksu site It is confirmed that it was a space where Yusanggoksuyeon(流觴曲水宴) spread at least until the end of the Joseon Dynasty. Unfortunately, it remains a limitation of the study that it cannot be confirmed due to lack of data on the rock characters of 'Yusanggoksu' and those who enjoyed it before the end of the Joseon Dynasty. This is an area that needs to be elucidated through continuous efforts to find data on this issue in the future.

Characteristics and classification of paddy soils on the Gimje-Mangyeong plains (김제만경평야(金堤萬頃平野)의 답토양특성(沓土壤特性)과 그 분류(分類)에 관(關)한 연구(硏究))

  • Shin, Yong Hwa
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.5 no.2
    • /
    • pp.1-38
    • /
    • 1972
  • This study, designed to establish a classification system of paddy soils and suitability groups on productivity and management of paddy land based on soil characteristics, has been made for the paddy soils on the Gimje-Mangyeong plains. The morphological, physical and chemical properties of the 15 paddy soil series found on these plains are briefly as follows: Ten soil series (Baeggu, Bongnam, Buyong, Gimje, Gongdeog, Honam, Jeonbug, Jisan, Mangyeong and Suam) have a B horizon (cambic B), two soil series (Geugrag and Hwadong) have a Bt horizon (argillic B), and three soil series (Gwanghwal, Hwagye and Sindab) have no B or Bt horizons. Uniquely, both the Bongnam and Gongdeog series contain a muck layer in the lower part of subsoil. Four soil series (Baeggu, Gongdeog, Gwanghwal and Sindab) generally are bluish gray and dark gray, and eight soil series (Bongnam, Buyong, Gimje, Honam, Jeonbug, Jisan, Mangyeong and Suam) are either gray or grayish brown. Three soil series (Geugrag, Hwadong and Hwagye), however, are partially gleyed in the surface and subsurface, but have a yellowish brown to brown subsoil or substrata. Seven soil series (Bongnam, Buyong, Geugrag, Gimje, Gongdeog, Honam and Hwadong) are of fine clayey texture, three soil series (Baeggu, Jeonbug and Jisan) belong to fine loamy and fine silty, three soil series (Gwanghwal, Mangyeong and Suam) to coarse loamy and coarse silty, and two soil series (Hwagye and Sindab) to sandy and sandy skeletal texture classes. The carbon content of the surface soil ranges from 0.29 to 2.18 percent, mostly 1.0 to 2.0 percent. The total nitrogen content of the surface soil ranges from 0.03 to 0.25 percent, showing a tendency to decrease irregularly with depth. The C/N ratio in the surface soil ranges from 4.6 to 15.5, dominantly from 8 to 10. The C/N ratio in the subsoil and substrata, however, has a wide range from 3.0 to 20.25. The soil reaction ranges from 4.5 to 8.0. All soil series except the Gwanghwal and Mangyeong series belong to the acid reaction class. The cation exchange cpacity in the surface soil ranges from 5 to 13 milliequivalents per 100 grams of soil, and in all the subsoil and substrata except those of a sandy texture, from 10 to 20 milliequivalents per 100 grams of soil. The base saturation of the soil series except Baeggu and Gongdeog is more than 60 percent. The active iron content of the surface soil ranges from 0.45 to 1.81 ppm, easily-reduceable manganese from 15 to 148 ppm, and available silica from 36 to 366 ppm. The iron and manganese are generally accumulated in a similar position (10 to 70cm. depth), and silica occurs in the same horizon with that of iron and manganese, or in the deeper horizons in the soil profile. The properties of each soil series extending from the sea shore towards the continental plains change with distance and they are related with distance (x) as follows: y(surface soil, clay content) = $$-0.2491x^2+6.0388x-1.1251$$ y(subsoil or subsurface soil, clay content) = $$-0.31646x^2+7.84818x-2.50008$$ y(surface soil, organic carbon content) = $$-0.0089x^2+0.2192x+0.1366$$ y(subsoil or subsurface soil, pH) = $$-0.0178x^2-0.04534x+8.3531$$ Soil profile development, soil color, depositional and organic layers, soil texture and soil reaction etc. are thought to be the major items that should be considered in a paddy soil classification. It was found that most of the soils belonging to the moderately well, somewhat poorly and poorly drained fine and medium textured soils and moderately deep fine textured soils over coarse materials, produce higher paddy yields in excess of 3,750 kg/ha. and most of the soils belonging to the coarse textured soils, well drained fine textured soils, moderately deep medium textured soils over coarse materials and saline soils, produce yields less than 3,750kg/ha. Soil texture of the profile, available soil depth, salinity and gleying of the surface and subsurface soils etc. seem to be the major factors determining rice yields, and these factors are considered when establishing suitability groups for paddy land. The great group, group, subgroup, family and series are proposed for the classification categories of paddy soils. The soil series is the basic category of the classification. The argillic horizon (Bt horizon) and cambic horizon (B horizon) are proposed as two diagnostic horizons of great group level for the determination of the morphological properties of soils in the classification. The specific soil characteristics considered in the group and subgroup levels are soil color of the profile (bluish gray, gray or yellowish brown), salinity (salic), depositonal (fluvic) and muck layers (mucky), and gleying of surface and subsurface soils (gleyic). The family levels are classified on the basis of soil reaction, soil texture and gravel content of the profile. The definitions are given on each classification category, diagnostic horizons and specific soil characteristics respectively. The soils on these plains are classified in eight subgroups and examined under the existing classification system. Further, the suitability group, can be divided into two major categories, suitability class and subclass. The soils within a suitability class are similar in potential productivity and limitation on use and management. Class 1 through 4 are distinguished from each other by combination of soil characteristics. Subclasses are divided from classes that have the same kind of dominant limitations such as slope(e), wettness(w), sandy(s), gravels(g), salinity(t) and non-gleying of the surface and subsurface soils(n). The above suitability classes and subclasses are examined, and the definitions are given. Seven subclasses are found on these plains for paddy soils. The classification and suitability group of 15 paddy soil series on the Gimje-Mangyeong plains may now be tabulated as follows.

  • PDF

Methodology for Identifying Issues of User Reviews from the Perspective of Evaluation Criteria: Focus on a Hotel Information Site (사용자 리뷰의 평가기준 별 이슈 식별 방법론: 호텔 리뷰 사이트를 중심으로)

  • Byun, Sungho;Lee, Donghoon;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.23-43
    • /
    • 2016
  • As a result of the growth of Internet data and the rapid development of Internet technology, "big data" analysis has gained prominence as a major approach for evaluating and mining enormous data for various purposes. Especially, in recent years, people tend to share their experiences related to their leisure activities while also reviewing others' inputs concerning their activities. Therefore, by referring to others' leisure activity-related experiences, they are able to gather information that might guarantee them better leisure activities in the future. This phenomenon has appeared throughout many aspects of leisure activities such as movies, traveling, accommodation, and dining. Apart from blogs and social networking sites, many other websites provide a wealth of information related to leisure activities. Most of these websites provide information of each product in various formats depending on different purposes and perspectives. Generally, most of the websites provide the average ratings and detailed reviews of users who actually used products/services, and these ratings and reviews can actually support the decision of potential customers in purchasing the same products/services. However, the existing websites offering information on leisure activities only provide the rating and review based on one stage of a set of evaluation criteria. Therefore, to identify the main issue for each evaluation criterion as well as the characteristics of specific elements comprising each criterion, users have to read a large number of reviews. In particular, as most of the users search for the characteristics of the detailed elements for one or more specific evaluation criteria based on their priorities, they must spend a great deal of time and effort to obtain the desired information by reading more reviews and understanding the contents of such reviews. Although some websites break down the evaluation criteria and direct the user to input their reviews according to different levels of criteria, there exist excessive amounts of input sections that make the whole process inconvenient for the users. Further, problems may arise if a user does not follow the instructions for the input sections or fill in the wrong input sections. Finally, treating the evaluation criteria breakdown as a realistic alternative is difficult, because identifying all the detailed criteria for each evaluation criterion is a challenging task. For example, if a review about a certain hotel has been written, people tend to only write one-stage reviews for various components such as accessibility, rooms, services, or food. These might be the reviews for most frequently asked questions, such as distance between the nearest subway station or condition of the bathroom, but they still lack detailed information for these questions. In addition, in case a breakdown of the evaluation criteria was provided along with various input sections, the user might only fill in the evaluation criterion for accessibility or fill in the wrong information such as information regarding rooms in the evaluation criteria for accessibility. Thus, the reliability of the segmented review will be greatly reduced. In this study, we propose an approach to overcome the limitations of the existing leisure activity information websites, namely, (1) the reliability of reviews for each evaluation criteria and (2) the difficulty of identifying the detailed contents that make up the evaluation criteria. In our proposed methodology, we first identify the review content and construct the lexicon for each evaluation criterion by using the terms that are frequently used for each criterion. Next, the sentences in the review documents containing the terms in the constructed lexicon are decomposed into review units, which are then reconstructed by using the evaluation criteria. Finally, the issues of the constructed review units by evaluation criteria are derived and the summary results are provided. Apart from the derived issues, the review units are also provided. Therefore, this approach aims to help users save on time and effort, because they will only be reading the relevant information they need for each evaluation criterion rather than go through the entire text of review. Our proposed methodology is based on the topic modeling, which is being actively used in text analysis. The review is decomposed into sentence units rather than considering the whole review as a document unit. After being decomposed into individual review units, the review units are reorganized according to each evaluation criterion and then used in the subsequent analysis. This work largely differs from the existing topic modeling-based studies. In this paper, we collected 423 reviews from hotel information websites and decomposed these reviews into 4,860 review units. We then reorganized the review units according to six different evaluation criteria. By applying these review units in our methodology, the analysis results can be introduced, and the utility of proposed methodology can be demonstrated.

Spatio-Temporal Monitoring of Soil CO2 Fluxes and Concentrations after Artificial CO2 Release (인위적 CO2 누출에 따른 토양 CO2 플럭스와 농도의 시공간적 모니터링)

  • Kim, Hyun-Jun;Han, Seung Hyun;Kim, Seongjun;Yun, Hyeon Min;Jun, Seong-Chun;Son, Yowhan
    • Journal of Environmental Impact Assessment
    • /
    • v.26 no.2
    • /
    • pp.93-104
    • /
    • 2017
  • CCS (Carbon Capture and Storage) is a technical process to capture $CO_2$ from industrial and energy-based sources, to transfer and sequestrate impressed $CO_2$ in geological formations, oceans, or mineral carbonates. However, potential $CO_2$ leakage exists and causes environmental problems. Thus, this study was conducted to analyze the spatial and temporal variations of $CO_2$ fluxes and concentrations after artificial $CO_2$ release. The Environmental Impact Evaluation Test Facility (EIT) was built in Eumseong, Korea in 2015. Approximately 34kg $CO_2$ /day/zone were injected at Zones 2, 3, and 4 among the total of 5 zones from October 26 to 30, 2015. $CO_2$ fluxes were measured every 30 minutes at the surface at 0m, 1.5m, 2.5m, and 10m from the $CO_2$ releasing well using LI-8100A until November 13, 2015, and $CO_2$ concentrations were measured once a day at 15cm, 30cm, and 60cm depths at every 0m, 1.5m, 2.5m, 5m, and 10m from the well using GA5000 until November 28, 2015. $CO_2$ flux at 0m from the well started increasing on the fifth day after $CO_2$ release started, and continued to increase until November 13 even though the artificial $CO_2$ release stopped. $CO_2$ fluxes measured at 2.5m, 5.0m, and 10m from the well were not significantly different with each other. On the other hand, soil $CO_2$ concentration was shown as 38.4% at 60cm depth at 0m from the well in Zone 3 on the next day after $CO_2$ release started. Soil $CO_2$ was horizontally spreaded overtime, and detected up to 5m away from the well in all zones until $CO_2$ release stopped. Also, soil $CO_2$ concentrations at 30cm and 60cm depths at 0m from the well were measured similarly as $50.6{\pm}25.4%$ and $55.3{\pm}25.6%$, respectively, followed by 30cm depth ($31.3{\pm}17.2%$) which was significantly lower than those measured at the other depths on the final day of $CO_2$ release period. Soil $CO_2$ concentrations at all depths in all zones were gradually decreased for about 1 month after $CO_2$ release stopped, but still higher than those of the first day after $CO_2$ release stared. In conclusion, the closer the distance from the well and the deeper the depth, the higher $CO_2$ fluxes and concentrations occurred. Also, long-term monitoring should be required because the leaked $CO_2$ gas can remains in the soil for a long time even if the leakage stopped.

Feasibility of Deep Learning Algorithms for Binary Classification Problems (이진 분류문제에서의 딥러닝 알고리즘의 활용 가능성 평가)

  • Kim, Kitae;Lee, Bomi;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.95-108
    • /
    • 2017
  • Recently, AlphaGo which is Bakuk (Go) artificial intelligence program by Google DeepMind, had a huge victory against Lee Sedol. Many people thought that machines would not be able to win a man in Go games because the number of paths to make a one move is more than the number of atoms in the universe unlike chess, but the result was the opposite to what people predicted. After the match, artificial intelligence technology was focused as a core technology of the fourth industrial revolution and attracted attentions from various application domains. Especially, deep learning technique have been attracted as a core artificial intelligence technology used in the AlphaGo algorithm. The deep learning technique is already being applied to many problems. Especially, it shows good performance in image recognition field. In addition, it shows good performance in high dimensional data area such as voice, image and natural language, which was difficult to get good performance using existing machine learning techniques. However, in contrast, it is difficult to find deep leaning researches on traditional business data and structured data analysis. In this study, we tried to find out whether the deep learning techniques have been studied so far can be used not only for the recognition of high dimensional data but also for the binary classification problem of traditional business data analysis such as customer churn analysis, marketing response prediction, and default prediction. And we compare the performance of the deep learning techniques with that of traditional artificial neural network models. The experimental data in the paper is the telemarketing response data of a bank in Portugal. It has input variables such as age, occupation, loan status, and the number of previous telemarketing and has a binary target variable that records whether the customer intends to open an account or not. In this study, to evaluate the possibility of utilization of deep learning algorithms and techniques in binary classification problem, we compared the performance of various models using CNN, LSTM algorithm and dropout, which are widely used algorithms and techniques in deep learning, with that of MLP models which is a traditional artificial neural network model. However, since all the network design alternatives can not be tested due to the nature of the artificial neural network, the experiment was conducted based on restricted settings on the number of hidden layers, the number of neurons in the hidden layer, the number of output data (filters), and the application conditions of the dropout technique. The F1 Score was used to evaluate the performance of models to show how well the models work to classify the interesting class instead of the overall accuracy. The detail methods for applying each deep learning technique in the experiment is as follows. The CNN algorithm is a method that reads adjacent values from a specific value and recognizes the features, but it does not matter how close the distance of each business data field is because each field is usually independent. In this experiment, we set the filter size of the CNN algorithm as the number of fields to learn the whole characteristics of the data at once, and added a hidden layer to make decision based on the additional features. For the model having two LSTM layers, the input direction of the second layer is put in reversed position with first layer in order to reduce the influence from the position of each field. In the case of the dropout technique, we set the neurons to disappear with a probability of 0.5 for each hidden layer. The experimental results show that the predicted model with the highest F1 score was the CNN model using the dropout technique, and the next best model was the MLP model with two hidden layers using the dropout technique. In this study, we were able to get some findings as the experiment had proceeded. First, models using dropout techniques have a slightly more conservative prediction than those without dropout techniques, and it generally shows better performance in classification. Second, CNN models show better classification performance than MLP models. This is interesting because it has shown good performance in binary classification problems which it rarely have been applied to, as well as in the fields where it's effectiveness has been proven. Third, the LSTM algorithm seems to be unsuitable for binary classification problems because the training time is too long compared to the performance improvement. From these results, we can confirm that some of the deep learning algorithms can be applied to solve business binary classification problems.