• Title/Summary/Keyword: Location Based System

Search Result 3,504, Processing Time 0.038 seconds

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

Derivation of Digital Music's Ranking Change Through Time Series Clustering (시계열 군집분석을 통한 디지털 음원의 순위 변화 패턴 분류)

  • Yoo, In-Jin;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.171-191
    • /
    • 2020
  • This study focused on digital music, which is the most valuable cultural asset in the modern society and occupies a particularly important position in the flow of the Korean Wave. Digital music was collected based on the "Gaon Chart," a well-established music chart in Korea. Through this, the changes in the ranking of the music that entered the chart for 73 weeks were collected. Afterwards, patterns with similar characteristics were derived through time series cluster analysis. Then, a descriptive analysis was performed on the notable features of each pattern. The research process suggested by this study is as follows. First, in the data collection process, time series data was collected to check the ranking change of digital music. Subsequently, in the data processing stage, the collected data was matched with the rankings over time, and the music title and artist name were processed. Each analysis is then sequentially performed in two stages consisting of exploratory analysis and explanatory analysis. First, the data collection period was limited to the period before 'the music bulk buying phenomenon', a reliability issue related to music ranking in Korea. Specifically, it is 73 weeks starting from December 31, 2017 to January 06, 2018 as the first week, and from May 19, 2019 to May 25, 2019. And the analysis targets were limited to digital music released in Korea. In particular, digital music was collected based on the "Gaon Chart", a well-known music chart in Korea. Unlike private music charts that are being serviced in Korea, Gaon Charts are charts approved by government agencies and have basic reliability. Therefore, it can be considered that it has more public confidence than the ranking information provided by other services. The contents of the collected data are as follows. Data on the period and ranking, the name of the music, the name of the artist, the name of the album, the Gaon index, the production company, and the distribution company were collected for the music that entered the top 100 on the music chart within the collection period. Through data collection, 7,300 music, which were included in the top 100 on the music chart, were identified for a total of 73 weeks. On the other hand, in the case of digital music, since the cases included in the music chart for more than two weeks are frequent, the duplication of music is removed through the pre-processing process. For duplicate music, the number and location of the duplicated music were checked through the duplicate check function, and then deleted to form data for analysis. Through this, a list of 742 unique music for analysis among the 7,300-music data in advance was secured. A total of 742 songs were secured through previous data collection and pre-processing. In addition, a total of 16 patterns were derived through time series cluster analysis on the ranking change. Based on the patterns derived after that, two representative patterns were identified: 'Steady Seller' and 'One-Hit Wonder'. Furthermore, the two patterns were subdivided into five patterns in consideration of the survival period of the music and the music ranking. The important characteristics of each pattern are as follows. First, the artist's superstar effect and bandwagon effect were strong in the one-hit wonder-type pattern. Therefore, when consumers choose a digital music, they are strongly influenced by the superstar effect and the bandwagon effect. Second, through the Steady Seller pattern, we confirmed the music that have been chosen by consumers for a very long time. In addition, we checked the patterns of the most selected music through consumer needs. Contrary to popular belief, the steady seller: mid-term pattern, not the one-hit wonder pattern, received the most choices from consumers. Particularly noteworthy is that the 'Climbing the Chart' phenomenon, which is contrary to the existing pattern, was confirmed through the steady-seller pattern. This study focuses on the change in the ranking of music over time, a field that has been relatively alienated centering on digital music. In addition, a new approach to music research was attempted by subdividing the pattern of ranking change rather than predicting the success and ranking of music.

Improving Usage of the Korea Meteorological Administration's Digital Forecasts in Agriculture: 2. Refining the Distribution of Precipitation Amount (기상청 동네예보의 영농활용도 증진을 위한 방안: 2. 강수량 분포 상세화)

  • Kim, Dae-Jun;Yun, Jin I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.15 no.3
    • /
    • pp.171-177
    • /
    • 2013
  • The purpose of this study is to find a scheme to scale down the KMA (Korea Meteorological Administration) digital precipitation maps to the grid cell resolution comparable to the rural landscape scale in Korea. As a result, we suggest two steps procedure called RATER (Radar Assisted Topography and Elevation Revision) based on both radar echo data and a mountain precipitation model. In this scheme, the radar reflection intensity at the constant altitude of 1.5 km is applied first to the KMA local analysis and prediction system (KLAPS) 5 km grid cell to obtain 1 km resolution. For the second step the elevation and topography effect on the basis of 270 m digital elevation model (DEM) which represented by the Parameter-elevation Regressions on Independent Slopes Model (PRISM) is applied to the 1 km resolution data to produce the 270 m precipitation map. An experimental watershed with about $50km^2$ catchment area was selected for evaluating this scheme and automated rain gauges were deployed to 13 locations with the various elevations and slope aspects. 19 cases with 1 mm or more precipitation per day were collected from January to May in 2013 and the corresponding KLAPS daily precipitation data were treated with the second step procedure. For the first step, the 24-hour integrated radar echo data were applied to the KLAPS daily precipitation to produce the 1 km resolution data across the watershed. Estimated precipitation at each 1 km grid cell was then regarded as the real world precipitation observed at the center location of the grid cell in order to derive the elevation regressions in the PRISM step. We produced the digital precipitation maps for all the 19 cases by using RATER and extracted the grid cell values corresponding to 13 points from the maps to compare with the observed data. For the cases of 10 mm or more observed precipitation, significant improvement was found in the estimated precipitation at all 13 sites with RATER, compared with the untreated KLAPS 5 km data. Especially, reduction in RMSE was 35% on 30 mm or more observed precipitation.

Analysis of dosimetric leaf gap variation on dose rate variation for dynamic IMRT (동적 세기조절방사선 치료 시 선량률 변화에 따른 선량학적엽간격 변화 분석)

  • Yang, Myung Sic;Park, Ju Kyeong;Lee, Seung Hun;Kim, Yang Su;Lee, Sun Young;Cha, Seok Yong
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.28 no.1
    • /
    • pp.47-55
    • /
    • 2016
  • To evaluate the position accuracy of the MLC. This study analyzed the variations of the dosimetric leaf gap(DLG) and MLC transmission factor to reflect the location of the MLC leaves according to the dose rate variation for dynamic IMRT. We used the 6 MV and 10 MV X-ray beams from linear accelerator with a Millennium 120 MLC system. We measured the variation of DLG and MLC transmission factor at depth of 10 cm for the water phantom by varying the dose rate to 200, 300, 400, 500 and 600 MU/min using the CC13 and FC-65G chambers. For 6 MV X-ray beam, a result of measuring based on a dose rate 400 MU/min by varying the dose rate to 200, 300, 400, 500 and 600 MU/min of the difference rate was respectively -2.59, -1.89, 0.00, -0.58, -2.89%. For 10 MV X-ray beam, the difference rate was respectively ?2.52, -1.69, 0.00, +1.28, -1.98%. The difference rate of MLC transmission factor was in the range of about ${\pm}1%$ of the measured values at the two types of energy and all of the dose rates. This study evaluated the variation of DLG and MLC transmission factor for the dose rate variation for dynamic IMRT. The difference of the MLC transmission factor according to the dose rate variation is negligible, but, the difference of the DLG was found to be large. Therefore, when randomly changing the dose rate dynamic IMRT, it may significantly affect the dose delivered to the tumor. Unless you change the dose rate during dynamic IMRT, it is thought that is to be the more accurate radiation therapy.

  • PDF

Prediction of Isothermal and Reacting Flows in Widely-Spaced Coaxial Jet, Diffusion-Flame Combustor (큰 지름비를 가지는 동축제트 확산화염 연소기내의 등온 및 연소 유동장의 예측)

  • O, Gun-Seop;An, Guk-Yeong;Kim, Yong-Mo;Lee, Chang-Sik
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.20 no.7
    • /
    • pp.2386-2396
    • /
    • 1996
  • A numerical simulation has been performed for isothermal and reacting flows in an exisymmetric, bluff-body research combustor. The present formulation is based on the density-weighted averaged Navier-Stokes equations together with a k-epsilon. turbulence model and a modified eddy-breakup combustion model. The PISO algorithm is employed for solution of thel Navier-Stokes system. Comparison between measurements and predictions are made for a centerline axial velocities, location of stagnation points, strength of recirculation zone, and temperature profile. Even though the numerical simulation gives acceptable agreement with experimental data in many respects, the present model is defictient in predicting the recoveryt rate of a central near-wake region, the non-isotropic turbulence effects, and variation of turbulent Schmidt number. Several possible explanations for these discrepancies have been discussed.

Characteristics of Park Program Operation of Seoul Metropolitan Government (서울시의 공원 프로그램 운영 특성)

  • Cho, Yun Joo;Chae, Young;Wee, Man-Gyu;Jung, Sang Hak;Song, Hyeong Nam;Kim, Yun-Geum
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.2
    • /
    • pp.10-19
    • /
    • 2020
  • The park program can adeptly cope with the diversification of leisure needs in accordance with the changing times. The program also makes the relationship between the users and the park itself closer. For this reason, the Seoul Metropolitan Government has operated a variety of programs, beginning with the Botanical Class Program at the Namsan Outdoor Botanical Garden in 1997. The government additionally began to organize park programs by establishing the Park and Leisure Department and three Park Greenery Offices. However, research on park programs is mainly focused on park users. Therefore, this study intends to reveal the structure of the programs by studying the program operation. The specific purposes of this study are '1. Review the institutional characteristics that underlie the operation of the Park Program in Seoul by examining the relevant laws, the operation organizations, and the personnel composition, 2. Analyze the operation methods, such as procurement and the execution of the program, operation costs, and public-private cooperation methods, etc. 3. Analyze the composition and contents of the program from 2015 to 2017, and process and identify the relationship between the structure of the program operation and the program itself.' Summarizing the results obtained from the study, as far as the structure of the first program operation, the support laws were not systematic, but the operating organization was working to establish a system. The second characteristic of the operation is that most of the budget was funded by local governments, but the level of citizen involvement was low. Third, when we looked at the characteristics of the program, the number of programs increased, but they were focused on a specific theme and few programs actively used the park facilities. Based on the results, three tasks can be proposed. The first is that the 'Act on Parks and Green Spaces' should include the concepts and support for park programs. Second, there is a need to change from the ideas of the quantitative increase of programs to qualitative improvements. Lastly, it is necessary to reorganize the Green Seoul Bureau of the Seoul Metropolitan Government into a citizen-led and leisure-oriented organization to promote the park leisure culture. This study has significance, as it was conducted with a service provider, not a program user, unlike many previous park program-related studies. The results of this study will be able to contribute not only to the Seoul Metropolitan Government, but also to other local governments to suggest the direction of the management and the operation of the park for the consumer, and consequently, it will help prepare the long-term vision of parks as the closest leisure location for most citizens.

A Study on a Type of Regeneration Project on Old Industrial Complex (노후산업단지 재생사업 추진 유형에 관한 연구)

  • Kim, Joo-hoon;Byun, Byung-seol
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.21 no.2
    • /
    • pp.192-211
    • /
    • 2018
  • With significant influences of old industrial complex in September 2009, Ministry of Land, Infrastructure and Transport chose the 4 districts for the first pilot project. In December 2014, the second pilot project districts were established. In addition, there were 10 districts in April 2016 and 5 districts in April 2016 as the third pilot project and 5 districts in March 2017 as the fourth pilot project. In order to promote smooth business operation of the recycling business, we introduced the effective area designation and special system as stipulated in Article 39.12-13 of the Industrial Location and Development Act revised in May 2015. The effective area, It is a method that can promote propagation and diffusion of the rehabilitation business through visualization by making effective the promotion of the rehabilitation business and by promoting the business in consideration of the geographical feature of the region and industry group, The setting of the unreasonable effective area is based on the criteria and classification of the plan and the objective promotion method according to the individual characteristics of the aged industrial park because the delay of the rehabilitation business and the possibility of the increase of many problems are presented Be sure to Data Envelopment Analysis (DEA) and the old industrial complex database were constructed and utilized to classify the types of recycling projects. Therefore, in this study, it is necessary to strengthen the competitiveness of aged industrial complex by examining the correlation between the diagnosis of 83 aged industrial complex sites and the rehabilitation projects supported by the Ministry of Land, and the types of business promotion for aged industrial parks. It can be used as a guideline for the feasibility of the project.

A Study on Seismic Liquefaction Risk Map of Electric Power Utility Tunnel in South-East Korea (국내 동남권 지역의 전력구 지반에 대한 지진시 액상화 위험도 작성 연구)

  • Choi, Jae-soon;Park, Inn-Joon;Hwang, Kyengmin;Jang, Jungbum
    • Journal of the Korean GEO-environmental Society
    • /
    • v.19 no.10
    • /
    • pp.13-19
    • /
    • 2018
  • Following the 2016 Gyeongju earthquake, the Pohang Earthquake occurred in 2017, and the south-east region in Korea is under the threat of an earthquake. Especially, in the Pohang Earthquake, the liquefaction phenomenon occurred in the sedimentation area of the coast, and preparation of countermeasures is very important. The soil liquefaction can affect the underground facilities directly as well as various structures on the ground. Therefore, it is necessary to identify the liquefaction risk of facilities and the structures against the possible earthquakes and to prepare countermeasures to minimize them. In this study, we investigated the seismic liquefaction risk about the electric power utility tunnels in the southeast area where the earthquake occurred in Korea recently. In the analysis of seismic liquefaction risk, the earthquake with return period 1000 years and liquefaction potential index are used. The liquefaction risk analysis was conducted in two stages. In the first stage, the liquefaction risk was analyzed by calculating the liquefaction potential index using the ground survey data of the location of electric power utility tunnels in the southeast region. At that time, the seismic amplification in soil layer was considered by soil amplification factor according to the soil classification. In the second stage, the liquefaction risk analysis based on the site response analyses inputted 3 earthquake records were performed for the locations determined to be dangerous from the first step analysis, and the final liquefaction potential index was recalculated. In the analysis, the site investigation data were used from the National Geotechnical Information DB Center. Finally, it can be found that the proposed two stage assessments for liquefaction risk that the macro assessment of liquefaction risk for the underground facilities including the electric power utility tunnel in Korea is carried out at the first stage, and the second risk assessment is performed again with site response analysis for the dangerous regions of the first stage assessment is reasonable and effective.

The Behavior Analysis of Exhibition Visitors using Data Mining Technique at the KIDS & EDU EXPO for Children (유아교육 박람회에서 데이터마이닝 기법을 이용한 전시 관람 행동 패턴 분석)

  • Jung, Min-Kyu;Kim, Hyea-Kyeong;Choi, Il-Young;Lee, Kyoung-Jun;Kim, Jae-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.2
    • /
    • pp.77-96
    • /
    • 2011
  • An exhibition is defined as market events for specific duration to present exhibitors' main products to business or private visitors, and it plays a key role as effective marketing channels. As the importance of exhibition is getting more and more, domestic exhibition industry has achieved such a great quantitative growth. But, In contrast to the quantitative growth of domestic exhibition industry, the qualitative growth of Exhibition has not achieved competent growth. In order to improve the quality of exhibition, we need to understand the preference or behavior characteristics of visitors and to increase the level of visitors' attention and satisfaction through the understanding of visitors. So, in this paper, we used the observation survey method which is a kind of field research to understand visitors and collect the real data for the analysis of behavior pattern. And this research proposed the following methodology framework consisting of three steps. First step is to select a suitable exhibition to apply for our method. Second step is to implement the observation survey method. And we collect the real data for further analysis. In this paper, we conducted the observation survey method to obtain the real data of the KIDS & EDU EXPO for Children in SETEC. Our methodology was conducted on 160 visitors and 78 booths from November 4th to 6th in 2010. And, the last step is to analyze the record data through observation. In this step, we analyze the feature of exhibition using Demographic Characteristics collected by observation survey method at first. And then we analyze the individual booth features by the records of visited booth. Through the analysis of individual booth features, we can figure out what kind of events attract the attention of visitors and what kind of marketing activities affect the behavior pattern of visitors. But, since previous research considered only individual features influenced by exhibition, the research about the correlation among features is not performed much. So, in this research, additional analysis is carried out to supplement the existing research with data mining techniques. And we analyze the relation among booths using data mining techniques to know behavior patterns of visitors. Among data mining techniques, we make use of two data mining techniques, such as clustering analysis and ARM(Association Rule Mining) analysis. In clustering analysis, we use K-means algorithm to figure out the correlation among booths. Through data mining techniques, we figure out that there are two important features to affect visitors' behavior patterns in exhibition. One is the geographical features of booths. The other is the exhibit contents of booths. Those features are considered when the organizer of exhibition plans next exhibition. Therefore, the results of our analysis are expected to provide guideline to understanding visitors and some valuable insights for the exhibition from the earlier phases of exhibition planning. Also, this research would be a good way to increase the quality of visitor satisfaction. Visitors' movement paths, booth location, and distances between each booth are considered to plan next exhibition in advance. This research was conducted at the KIDS & EDU EXPO for Children in SETEC(Seoul Trade Exhibition & Convention), but it has some constraints to be applied directly to other exhibitions. Also, the results were derived from a limited number of data samples. In order to obtain more accurate and reliable results, it is necessary to conduct more experiments based on larger data samples and exhibitions on a variety of genres.

Evaluation of Electron Boost Fields based on Surgical Clips and Operative Scars in Definitive Breast Irradiation (유방보존술 후 방사선치료에서 수술 흉터와 삽입된 클립을 이용한 전자설 추가 방사선 조사야 평가)

  • Lee, Re-Na;Chung, Eun-Ah;Lee, Ji-Hye;Suh, Hyun-Suk
    • Radiation Oncology Journal
    • /
    • v.23 no.4
    • /
    • pp.236-242
    • /
    • 2005
  • Purpose: To evaluate the role of surgical clips and scars in determining electron boost field for early stage breast cancer undergoing conserving surgery and postoperative radiotherapy and to provide an optimal method in drawing the boost field. Materials and Methods: Twenty patients who had $4{\sim}7$ surgical clips in the excision cavity were selected for this study. The depth informations were obtained to determine electron energy by measuring the distance from the skin to chest wall (SCD) and to the clip implanted in the most posterior area of tumor bed. Three different electron fields were outlined on a simulation film. The radiological tumor bed was determined by connecting all the clips implanted during surgery Clinical field (CF) was drawn by adding 3 cm margin around surgical scar. Surgical field (SF) was drawn by adding 2 cm margin around surgical clips and an Ideal field (IF) was outlined by adding 2 cm margin around both scar and clips. These fields were digitized into our planning system to measure the area of each separate field. The areas of the three different electron boost fields were compared. Finally, surgical clips were contoured on axial CT images and dose volume histogram was plotted to investigate 3-dimensional coverage of the clips. Results : The average depth difference between SCD and the maximal clip location was $0.7{\pm}0.55cm$. Greater difference of 5 mm or more was seen in 12 patients. The average shift between the borders of scar and clips were 1.7 1.2, 1.2, and 0.9 cm in superior, inferior, medial, and lateral directions, respectively. The area of the CF was larger than SF and IF in 6y20 patients. In 15/20 patients, the area difference between SF and if was less than 5%. One to three clips were seen outside the CF in 15/20 patients. In addition, dosimetrically inadequate coverage of clips (less than 80% of prescribed dose) were observed in 17/20 patients when CF was used as the boost field. Conclusion: The electron field determined from clinical scar underestimates the tumor bed in superior-inferior direction significantly and thereby underdosing the tissue at risk. The electron field obtained from surgical clips alone dose not cover the entire scar properly As a consequence, our technique, which combines the surgical clips and clinical scars in determining electron boost field, was proved to be effective in minimizing the geographical miss as well as normal tissue complications.