• Title/Summary/Keyword: 수치모델 검증

Search Result 1,079, Processing Time 0.025 seconds

A personalized TV service under Open network environment (개방형 환경에서의 개인 맞춤형 TV 서비스)

  • Lye, Ji-Hye;Pyo, Sin-Ji;Im, Jeong-Yeon;Kim, Mun-Churl;Lim, Sun-Hwan;Kim, Sang-Ki
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2006.11a
    • /
    • pp.279-282
    • /
    • 2006
  • IP망을 이용한 IPTV 방송 서비스가 새로운 수익 모델로 인정받고 현재 국내의 KT, SKT 등이 IPTV 시범서비스를 준비하거나 진행 중에 있다 이 IPTV 서비스는 이전의 단방향 방송과는 달리 사용자와의 인터렉션을 중시하는 양방향 방송을 표방하기 때문에 지금까지의 방송과는 다른 혁신적인 방송서비스가 기대된다. 하지만 IPTV 서비스에 있어서 여러 통신사와 방송사가 참여할 수 있을 것으로 보여지는 것과는 달리 실상은 몇몇 거대 통신기업이 자신들의 망을 이용하는 가입자들을 상대로 한정된 사업을 벌이고 있다. 이는 IPTV 서비스를 위한 인프라가 구축되어 있지 않고 방통융합망의 개념을 만족시키기 위해 서비스 개발자가 알아야 할 프로토콜들이 너무나 많기 때문이다. 따라서 본 논문에서는 이러한 상황을 타개할 수 있는 수단을 Open API로 제안한다. 맞춤형 방송을 위한 시나리오를 TV-Anytime의 벤치마킹과 유저 시나리오를 참고하여 재구성하고 이 시나리오로부터 IPTV 방송 서비스를 위한 방통융합망의 기본적이고 강력한 기능들을 Open API 함수로 정의하였다. 여기에서의 방송 서비스는 NDR, EPG, 개인 맞춤형 광고 서비스를 말하며 각 서비스를 위한 서버는 통합망 위에 존재하고 이 서버들이 개방하는 API들은 다른 응용프로그램에 의해 사용되는 것이기 때문에 가장 기본적인 기능을 정의하게 된다. 또한, 제안한 Open API 함수를 이용하여 개인 맞춤형 방송 응용 서비스를 구현함으로써 서비스 검증을 하였다. Open API는 웹서비스를 통해 공개된 기능들로써 게이트웨이를 통해 다른 망에서 사용할 수 있게 된다. Open API 함수의 정의는 함수 이름, 기능, 입 출력 파라메터로 이루어져 있다. 사용자 맞춤 서비스를 위해 전달되는 사용자 상세 정보와 콘텐츠 상세 정보는 TV-Anytime 포럼에서 정의한 메타데이터 스키마를 이용하여 정의하였다.가능하게 한다. 제안된 방법은 프레임 간 모드 결정을 고속화함으로써 스케일러블 비디오 부호화기의 연산량과 복잡도를 최대 57%감소시킨다. 그러나 연산량 감소에 따른 비트율의 증가나 화질의 열화는 최대 1.74% 비트율 증가 및 0.08dB PSNR 감소로 무시할 정도로 작다., 반드시 이에 대한 검증이 필요함을 알 수 있었다. 현지관측에 비해 막대한 비용과 시간을 절약할 수 있는 위성영상해석방법을 이용한 방법은 해양수질파악이 가능할 것으로 판단되며, GIS를 이용하여 다양하고 복잡한 자료를 데이터베이스화함으로써 가시화하고, 이를 기초로 공간분석을 실시함으로써 환경요소별 공간분포에 대한 파악을 통해 수치모형실험을 이용한 각종 환경영향의 평가 및 예측을 위한 기초자료로 이용이 가능할 것으로 사료된다.염총량관리 기본계획 시 구축된 모형 매개변수를 바탕으로 분석을 수행하였다. 일차오차분석을 이용하여 수리매개변수와 수질매개변수의 수질항목별 상대적 기여도를 파악해 본 결과, 수리매개변수는 DO, BOD, 유기질소, 유기인 모든 항목에 일정 정도의 상대적 기여도를 가지고 있는 것을 알 수 있었다. 이로부터 수질 모형의 적용 시 수리 매개변수 또한 수질 매개변수의 추정 시와 같이 보다 세심한 주의를 기울여 추정할 필요가 있을 것으로 판단된다.변화와 기흉 발생과의 인과관계를 확인하고 좀 더 구체화하기 위한 연구가 필요할 것이다.게 이루어질 수 있을 것으로 기대된다.는 초과수익률이 상승하지만, 이후로는 감소하므로, 반전거래전략을 활용하는 경우 주식투자기간은 24개월이하의 중단기가 적합함을 발견하였다. 이상의 행태적 측면과 투자성과측면의 실증결과를 통하여 한국주식시장에 있어서 시장수익률을 평균적으로 초과할 수 있는 거래전략은 존재하므로 이러한 전략을 개발 및 활용할 수 있으며, 특히, 한국주식시장에 적합한 거래전략은 반전거래전략이고, 이 전략의 유용성은 투자자가 설정한 투자기간보다

  • PDF

Re-validation of the Revised Systems Thinking Measuring Instrument for Vietnamese High School Students and Comparison of Latent Means between Korean and Vietnamese High School Students (베트남 고등학생을 대상으로 한 개정 시스템 사고 검사 도구 재타당화 및 한국과 베트남 고등학생의 잠재 평균 비교)

  • Hyonyong Lee;Nguyen Thi Thuy;Byung-Yeol Park;Jaedon Jeon;Hyundong Lee
    • Journal of the Korean earth science society
    • /
    • v.45 no.2
    • /
    • pp.157-171
    • /
    • 2024
  • The purposes of this study were: (1) to revalidate the revised Systems Thinking Measuring Instrument (Re_STMI) reported by Lee et al. (2024) among Vietnamese high school students and (2) to investigate the differences in systems thinking abilities between Korean and Vietnamese high school students. To achieve this, data from 234 Vietnamese high school students who responded to translated Re_STMI consisting of 20 items and an Scale consisting of 20 items were used. Validity analysis was conducted through item response analysis (Item Reliability, Item Map, Infit and Outfit MNSQ, DIF between male and female) and exploratory factor analysis (principal axis factor analysis using Promax). Furthermore, structural equation modeling was employed with data from 475 Korean high school students to verify the latent mean analysis. The results were as follows: First, in the item response analysis of the 20 translated Re_STMI items in Vietnamese, the Item Reliability was .97, and the Infit MNSQ ranged from .67 to 1.38. The results from the Item Map and DIF analysis align with previous findings. In the exploratory factor analysis, all items were loaded onto intended sub-factors, with sub-factor reliabilities ranging from .662 to .833 and total reliability at .876. Confirmatory factor analysis for latent mean analysis between Korean and Vietnamese students yielded acceptable model fit indices (χ2/df: 2.830, CFI: .931, TLI: .918, SRMR: .043, RMSEA: .051). Lastly, the latent mean analysis between Korean and Vietnamese students revealed a small effect size in systems analysis, mental models, team learning, and shared vision factors, whereas a medium effect size was observed in personal mastery factors, with Vietnamese high school students showing significantly higher results in systems thinking. This study confirmed the reliability and validity of the Re_STMI items. Furthermore, international comparative studies on systems thinking using Re_STMI translated into Vietnamese, English, and other languages are warranted in the context of students' systems thinking analysis.

How to improve the accuracy of recommendation systems: Combining ratings and review texts sentiment scores (평점과 리뷰 텍스트 감성분석을 결합한 추천시스템 향상 방안 연구)

  • Hyun, Jiyeon;Ryu, Sangyi;Lee, Sang-Yong Tom
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.219-239
    • /
    • 2019
  • As the importance of providing customized services to individuals becomes important, researches on personalized recommendation systems are constantly being carried out. Collaborative filtering is one of the most popular systems in academia and industry. However, there exists limitation in a sense that recommendations were mostly based on quantitative information such as users' ratings, which made the accuracy be lowered. To solve these problems, many studies have been actively attempted to improve the performance of the recommendation system by using other information besides the quantitative information. Good examples are the usages of the sentiment analysis on customer review text data. Nevertheless, the existing research has not directly combined the results of the sentiment analysis and quantitative rating scores in the recommendation system. Therefore, this study aims to reflect the sentiments shown in the reviews into the rating scores. In other words, we propose a new algorithm that can directly convert the user 's own review into the empirically quantitative information and reflect it directly to the recommendation system. To do this, we needed to quantify users' reviews, which were originally qualitative information. In this study, sentiment score was calculated through sentiment analysis technique of text mining. The data was targeted for movie review. Based on the data, a domain specific sentiment dictionary is constructed for the movie reviews. Regression analysis was used as a method to construct sentiment dictionary. Each positive / negative dictionary was constructed using Lasso regression, Ridge regression, and ElasticNet methods. Based on this constructed sentiment dictionary, the accuracy was verified through confusion matrix. The accuracy of the Lasso based dictionary was 70%, the accuracy of the Ridge based dictionary was 79%, and that of the ElasticNet (${\alpha}=0.3$) was 83%. Therefore, in this study, the sentiment score of the review is calculated based on the dictionary of the ElasticNet method. It was combined with a rating to create a new rating. In this paper, we show that the collaborative filtering that reflects sentiment scores of user review is superior to the traditional method that only considers the existing rating. In order to show that the proposed algorithm is based on memory-based user collaboration filtering, item-based collaborative filtering and model based matrix factorization SVD, and SVD ++. Based on the above algorithm, the mean absolute error (MAE) and the root mean square error (RMSE) are calculated to evaluate the recommendation system with a score that combines sentiment scores with a system that only considers scores. When the evaluation index was MAE, it was improved by 0.059 for UBCF, 0.0862 for IBCF, 0.1012 for SVD and 0.188 for SVD ++. When the evaluation index is RMSE, UBCF is 0.0431, IBCF is 0.0882, SVD is 0.1103, and SVD ++ is 0.1756. As a result, it can be seen that the prediction performance of the evaluation point reflecting the sentiment score proposed in this paper is superior to that of the conventional evaluation method. In other words, in this paper, it is confirmed that the collaborative filtering that reflects the sentiment score of the user review shows superior accuracy as compared with the conventional type of collaborative filtering that only considers the quantitative score. We then attempted paired t-test validation to ensure that the proposed model was a better approach and concluded that the proposed model is better. In this study, to overcome limitations of previous researches that judge user's sentiment only by quantitative rating score, the review was numerically calculated and a user's opinion was more refined and considered into the recommendation system to improve the accuracy. The findings of this study have managerial implications to recommendation system developers who need to consider both quantitative information and qualitative information it is expect. The way of constructing the combined system in this paper might be directly used by the developers.

Assessment and Prediction of Stand Yield in Cryptomeria japonica Stands (삼나무 임분수확량 평가 및 예측)

  • Son, Yeong Mo;Kang, Jin Taek;Hwang, Jeong Sun;Park, Hyun;Lee, Kang Su
    • Journal of Korean Society of Forest Science
    • /
    • v.104 no.3
    • /
    • pp.421-426
    • /
    • 2015
  • The objective of this paper is to look into the growth of Cryptomeria japonica stand in South Korea along with the evaluation on their yields, followed by their carbon stocks and removals. A total of 106 sample plots were selected from Jeonnam, Gyeongnam, and Jeju, where the groups of standard are grown. We only used 92 plots data except outlier. As part of the analysis, the Weibull diameter distribution was applied. In order to estimate the diameter distribution, the growth estimation equation for each of the growth factors including the height, the diameter at breast height, and the basal area was drafted out and the verification for each equation was examined. The site index for figuring out the forest productivity of Cryptomeria japonica stand for each district was also developed as a Schumacher model and 30yr was used as a reference age for the estimation of the site index. It was found that the site index for Cryptomeria japonica stand in South Korea ranges from 10 to 16 and this result was used as a standard for developing the stand yield table. According to the site 14 in the stand yield table, the mean annual increment (MAI) of the Cryptomeria japonica reaches $7.6m^3/ha$ on its 25yr and its growing stock is estimated to be at $190.1m^3/ha$. This volume is about $20m^3$ as high as that of the Chamaesyparis obtusa. Furthermore, the annual carbon absorptions for a Cryptomeria japonica stand reached the peak at 25yr, which is 2.14 tC/ha/yr, $7.83tCO_2/ha/yr$. When compared to the other conifers, this rate is slightly higher than that of a Chamaecyparis obtusa ($7.5tCO_2/ha/yr$) but lower than that of the Pinus koraiensis ($10.4tCO_2/ha/yr$) and Larix kaempferi ($11.2tCO_2/ha/yr$). With such research result as a base, it is necessary to come up with the ways to enhance the utilization of Cryptomeria japonica as timbers, besides making use of their growth data.

The Construction of QoS Integration Platform for Real-time Negotiation and Adaptation Stream Service in Distributed Object Computing Environments (분산 객체 컴퓨팅 환경에서 실시간 협약 및 적응 스트림 서비스를 위한 QoS 통합 플랫폼의 구축)

  • Jun, Byung-Taek;Kim, Myung-Hee;Joo, Su-Chong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.11S
    • /
    • pp.3651-3667
    • /
    • 2000
  • Recently, in the distributed multimedia environments based on internet, as radical growing technologies, the most of researchers focus on both streaming technology and distributed object thchnology, Specially, the studies which are tried to integrate the streaming services on the distributed object technology have been progressing. These technologies are applied to various stream service mamgements and protocols. However, the stream service management mexlels which are being proposed by the existing researches are insufficient for suporting the QoS of stream services. Besides, the existing models have the problems that cannot support the extensibility and the reusability, when the QoS-reiatedfunctions are being developed as a sub-module which is suited on the specific-purpose application services. For solving these problems, in this paper. we suggested a QoS Integrated platform which can extend and reuse using the distributed object technologies, and guarantee the QoS of the stream services. A structure of platform we suggested consists of three components such as User Control Module(UCM), QoS Management Module(QoSM) and Stream Object. Stream Object has Send/Receive operations for transmitting the RTP packets over TCP/IP. User Control ModuleI(UCM) controls Stream Objects via the COREA service objects. QoS Management Modulel(QoSM) has the functions which maintain the QoS of stream service between the UCMs in client and server. As QoS control methexlologies, procedures of resource monitoring, negotiation, and resource adaptation are executed via the interactions among these comiXments mentioned above. For constmcting this QoS integrated platform, we first implemented the modules mentioned above independently, and then, used IDL for defining interfaces among these mexlules so that can support platform independence, interoperability and portability base on COREA. This platform is constructed using OrbixWeb 3.1c following CORBA specification on Solaris 2.5/2.7, Java language, Java, Java Media Framework API 2.0, Mini-SQL1.0.16 and multimedia equipments. As results for verifying this platform functionally, we showed executing results of each module we mentioned above, and a numerical data obtained from QoS control procedures on client and server's GUI, while stream service is executing on our platform.

  • PDF

Tectonic Movement in the Korean Peninsula (II): A Geomorphological Interpretation of the Spatial Distribution of Earthquakes (한반도의 지반운동 (II): 한반도 지진분포의 지형학적 해석)

  • Park, Soo-Jin
    • Journal of the Korean Geographical Society
    • /
    • v.42 no.4
    • /
    • pp.488-505
    • /
    • 2007
  • The purposes of this research are twofold; 1) to verify spatial differences of tectonic movement using the spatial distribution of earthquakes, and 2) to infer mechanisms that generate spatial accumulation patterns of earthquakes in the Korean Peninsula. The first part of this sequential paper (Park, 2007) argues that the Korean Peninsula consists of four geostructural regions in which tectonic deformation and consequent geomorphological development patterns are different from each other Since this conclusion has been made by terrain analyses alone, it is necessary to verify this suggestion using other independent geophysical data. Because earthquakes are results of movement and deformation of land masses moving in different directions, the distribution of earthquake epicenters may be used to identify the direction and rates of land mass movement. This paper first analysed the spatial distribution of earthquakes using spatial statistics, and then results were compared with the spatial arrangement of geostructural regions. The spatial distribution of earthquakes in the Korean Peninsula can be summarized as the followings; firstly, the intensity of earthquakes shows only weak spatial dependency, and shows large difference even at adjacent regions. Secondly, the epicenter distribution has a clear spatial accumulation pattern, even though the intensity of earthquake shows a random pattern. Thirdly, the high density area of earthquakes shows a clear 'L' shape, passing through Pyeongannam-do, centered at Pyeongyang, and Hwanghae-do, Seosan and Pohang. The correlation coefficient between the density of earthquakes and distance from geostructral region boundaries is much higher than those between the density of fault lines and distance from tectonic division boundaries. Since fault lines and tectonic divisions in the Korean Peninsula are the results of long-term geological development, there is an apparent scale discrepancy to find significant correlations with earthquakes. This result verifies the research hypothesis that the Korean Peninsula is divided into four geostructral regions in which each has its own moving direction and spatial deformation characteristics. The existence of geostructural regions is also supported by the movement parrerns of land masses estimated from the GPS measurements. This conclusion is expected to provide a new perspective to understand the geomorphological developments and the earthquake occurrences in the Korean Peninsula.

A Basic Study on Spatial Configuration of Gang-jin Nongsanbyeoleop (강진 농산별업(農山別業)의 공간구성에 대한 기초 연구)

  • Seo, Dong-Il;Lee, Jae-Keun
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.30 no.2
    • /
    • pp.64-71
    • /
    • 2012
  • This is a basic study for recovering original form of Nongsanbyeoleop(農山別業) in Gangjin, Jeonnam, created in the latter part of Joseon period and the estimation of originla form at the time of creation was conducted by analyzing related literature and inspecting the actual site. "Joseokruki(朝夕樓記)" of Dasan Jung, Yak Yong could estimate spatial structure and using form of Nongsanbyeoleop and the arrangement of spatial structure in literature could be confirm by on-the-site inspection. The results of this study are as follows. The first, Nongsanbyeoleop managed spatial factors applying natural topography. For the spatial characteristics of Nongsanbyeoleop, the location of ancestral ritual space including deceased father's tomb and tomb house far from the main levee of Yun, Kwang Taek, a father of Yun, Seo Yu by 1.9km and housekeeping could be confirmed. The second, spatial estimation by "Joseokruki" could be possible. "Joseokruki" describes Joseokru.Youngmojae.Hanokkwan.Cheokyunjung.Sangam as construction factors, Wundang.Kookdan.Nokwunoh. as plant factors, Sookyunggan.Keumkoji.Nokeumjung.Uijanghae as hydroponic factors and Pyoeunkok.Aengjakang as natural topography factors. However, most of them were disappeared and at present, only Youngmojae, Keumgoji, Kukdan and Wundang show the past trace. The third is for the changed space of Nongsanbyeoleop and its reason. The surrounding space of Nongsanbyeoleop was planated by land arrangement in 1960s and it played a role of topographical damage because it's recognized as the plane factor including Nongsanbyeoleop's surrounding landscape rather than dotted factor. The forth, the actual measurement of Nongsanbyeoleop and digitalization of manual map of numerical value are judged to be sufficient to apply as the basic material for recovering garden in the future. Because of the diatahce changing method applied at that time, the garden recovery of Nongsanbyeoleop intended to be concreted and 3D model established by digitalized basic materials is considered to apply for multilateral studying. Thus, Nongsanbyeoleop which is byeolseo including the tomb of deceased father based on the conceptual hyo thought shows clear differences from the organized factors of Byolseowonrim of precedent studies and the importance of Byolseowonrim is sufficient. But, the constructional factors which cannot know disappeared spatial factors and accurate location became the limitation of this study. In the future, clear verification of original form must be progressed by excavation which can confirm the location of construction factors.

A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)

  • Jung, Ye Lim;Kim, Ji Hui;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.

Development of Yóukè Mining System with Yóukè's Travel Demand and Insight Based on Web Search Traffic Information (웹검색 트래픽 정보를 활용한 유커 인바운드 여행 수요 예측 모형 및 유커마이닝 시스템 개발)

  • Choi, Youji;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.155-175
    • /
    • 2017
  • As social data become into the spotlight, mainstream web search engines provide data indicate how many people searched specific keyword: Web Search Traffic data. Web search traffic information is collection of each crowd that search for specific keyword. In a various area, web search traffic can be used as one of useful variables that represent the attention of common users on specific interests. A lot of studies uses web search traffic data to nowcast or forecast social phenomenon such as epidemic prediction, consumer pattern analysis, product life cycle, financial invest modeling and so on. Also web search traffic data have begun to be applied to predict tourist inbound. Proper demand prediction is needed because tourism is high value-added industry as increasing employment and foreign exchange. Among those tourists, especially Chinese tourists: Youke is continuously growing nowadays, Youke has been largest tourist inbound of Korea tourism for many years and tourism profits per one Youke as well. It is important that research into proper demand prediction approaches of Youke in both public and private sector. Accurate tourism demands prediction is important to efficient decision making in a limited resource. This study suggests improved model that reflects latest issue of society by presented the attention from group of individual. Trip abroad is generally high-involvement activity so that potential tourists likely deep into searching for information about their own trip. Web search traffic data presents tourists' attention in the process of preparation their journey instantaneous and dynamic way. So that this study attempted select key words that potential Chinese tourists likely searched out internet. Baidu-Chinese biggest web search engine that share over 80%- provides users with accessing to web search traffic data. Qualitative interview with potential tourists helps us to understand the information search behavior before a trip and identify the keywords for this study. Selected key words of web search traffic are categorized by how much directly related to "Korean Tourism" in a three levels. Classifying categories helps to find out which keyword can explain Youke inbound demands from close one to far one as distance of category. Web search traffic data of each key words gathered by web crawler developed to crawling web search data onto Baidu Index. Using automatically gathered variable data, linear model is designed by multiple regression analysis for suitable for operational application of decision and policy making because of easiness to explanation about variables' effective relationship. After regression linear models have composed, comparing with model composed traditional variables and model additional input web search traffic data variables to traditional model has conducted by significance and R squared. after comparing performance of models, final model is composed. Final regression model has improved explanation and advantage of real-time immediacy and convenience than traditional model. Furthermore, this study demonstrates system intuitively visualized to general use -Youke Mining solution has several functions of tourist decision making including embed final regression model. Youke Mining solution has algorithm based on data science and well-designed simple interface. In the end this research suggests three significant meanings on theoretical, practical and political aspects. Theoretically, Youke Mining system and the model in this research are the first step on the Youke inbound prediction using interactive and instant variable: web search traffic information represents tourists' attention while prepare their trip. Baidu web search traffic data has more than 80% of web search engine market. Practically, Baidu data could represent attention of the potential tourists who prepare their own tour as real-time. Finally, in political way, designed Chinese tourist demands prediction model based on web search traffic can be used to tourism decision making for efficient managing of resource and optimizing opportunity for successful policy.