• Title/Summary/Keyword: time prediction

Search Result 5,939, Processing Time 0.087 seconds

Content-based Recommendation Based on Social Network for Personalized News Services (개인화된 뉴스 서비스를 위한 소셜 네트워크 기반의 콘텐츠 추천기법)

  • Hong, Myung-Duk;Oh, Kyeong-Jin;Ga, Myung-Hyun;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.57-71
    • /
    • 2013
  • Over a billion people in the world generate new news minute by minute. People forecasts some news but most news are from unexpected events such as natural disasters, accidents, crimes. People spend much time to watch a huge amount of news delivered from many media because they want to understand what is happening now, to predict what might happen in the near future, and to share and discuss on the news. People make better daily decisions through watching and obtaining useful information from news they saw. However, it is difficult that people choose news suitable to them and obtain useful information from the news because there are so many news media such as portal sites, broadcasters, and most news articles consist of gossipy news and breaking news. User interest changes over time and many people have no interest in outdated news. From this fact, applying users' recent interest to personalized news service is also required in news service. It means that personalized news service should dynamically manage user profiles. In this paper, a content-based news recommendation system is proposed to provide the personalized news service. For a personalized service, user's personal information is requisitely required. Social network service is used to extract user information for personalization service. The proposed system constructs dynamic user profile based on recent user information of Facebook, which is one of social network services. User information contains personal information, recent articles, and Facebook Page information. Facebook Pages are used for businesses, organizations and brands to share their contents and connect with people. Facebook users can add Facebook Page to specify their interest in the Page. The proposed system uses this Page information to create user profile, and to match user preferences to news topics. However, some Pages are not directly matched to news topic because Page deals with individual objects and do not provide topic information suitable to news. Freebase, which is a large collaborative database of well-known people, places, things, is used to match Page to news topic by using hierarchy information of its objects. By using recent Page information and articles of Facebook users, the proposed systems can own dynamic user profile. The generated user profile is used to measure user preferences on news. To generate news profile, news category predefined by news media is used and keywords of news articles are extracted after analysis of news contents including title, category, and scripts. TF-IDF technique, which reflects how important a word is to a document in a corpus, is used to identify keywords of each news article. For user profile and news profile, same format is used to efficiently measure similarity between user preferences and news. The proposed system calculates all similarity values between user profiles and news profiles. Existing methods of similarity calculation in vector space model do not cover synonym, hypernym and hyponym because they only handle given words in vector space model. The proposed system applies WordNet to similarity calculation to overcome the limitation. Top-N news articles, which have high similarity value for a target user, are recommended to the user. To evaluate the proposed news recommendation system, user profiles are generated using Facebook account with participants consent, and we implement a Web crawler to extract news information from PBS, which is non-profit public broadcasting television network in the United States, and construct news profiles. We compare the performance of the proposed method with that of benchmark algorithms. One is a traditional method based on TF-IDF. Another is 6Sub-Vectors method that divides the points to get keywords into six parts. Experimental results demonstrate that the proposed system provide useful news to users by applying user's social network information and WordNet functions, in terms of prediction error of recommended news.

Impact of Semantic Characteristics on Perceived Helpfulness of Online Reviews (온라인 상품평의 내용적 특성이 소비자의 인지된 유용성에 미치는 영향)

  • Park, Yoon-Joo;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.29-44
    • /
    • 2017
  • In Internet commerce, consumers are heavily influenced by product reviews written by other users who have already purchased the product. However, as the product reviews accumulate, it takes a lot of time and effort for consumers to individually check the massive number of product reviews. Moreover, product reviews that are written carelessly actually inconvenience consumers. Thus many online vendors provide mechanisms to identify reviews that customers perceive as most helpful (Cao et al. 2011; Mudambi and Schuff 2010). For example, some online retailers, such as Amazon.com and TripAdvisor, allow users to rate the helpfulness of each review, and use this feedback information to rank and re-order them. However, many reviews have only a few feedbacks or no feedback at all, thus making it hard to identify their helpfulness. Also, it takes time to accumulate feedbacks, thus the newly authored reviews do not have enough ones. For example, only 20% of the reviews in Amazon Review Dataset (Mcauley and Leskovec, 2013) have more than 5 reviews (Yan et al, 2014). The purpose of this study is to analyze the factors affecting the usefulness of online product reviews and to derive a forecasting model that selectively provides product reviews that can be helpful to consumers. In order to do this, we extracted the various linguistic, psychological, and perceptual elements included in product reviews by using text-mining techniques and identifying the determinants among these elements that affect the usability of product reviews. In particular, considering that the characteristics of the product reviews and determinants of usability for apparel products (which are experiential products) and electronic products (which are search goods) can differ, the characteristics of the product reviews were compared within each product group and the determinants were established for each. This study used 7,498 apparel product reviews and 106,962 electronic product reviews from Amazon.com. In order to understand a review text, we first extract linguistic and psychological characteristics from review texts such as a word count, the level of emotional tone and analytical thinking embedded in review text using widely adopted text analysis software LIWC (Linguistic Inquiry and Word Count). After then, we explore the descriptive statistics of review text for each category and statistically compare their differences using t-test. Lastly, we regression analysis using the data mining software RapidMiner to find out determinant factors. As a result of comparing and analyzing product review characteristics of electronic products and apparel products, it was found that reviewers used more words as well as longer sentences when writing product reviews for electronic products. As for the content characteristics of the product reviews, it was found that these reviews included many analytic words, carried more clout, and related to the cognitive processes (CogProc) more so than the apparel product reviews, in addition to including many words expressing negative emotions (NegEmo). On the other hand, the apparel product reviews included more personal, authentic, positive emotions (PosEmo) and perceptual processes (Percept) compared to the electronic product reviews. Next, we analyzed the determinants toward the usefulness of the product reviews between the two product groups. As a result, it was found that product reviews with high product ratings from reviewers in both product groups that were perceived as being useful contained a larger number of total words, many expressions involving perceptual processes, and fewer negative emotions. In addition, apparel product reviews with a large number of comparative expressions, a low expertise index, and concise content with fewer words in each sentence were perceived to be useful. In the case of electronic product reviews, those that were analytical with a high expertise index, along with containing many authentic expressions, cognitive processes, and positive emotions (PosEmo) were perceived to be useful. These findings are expected to help consumers effectively identify useful product reviews in the future.

A Study on Recent Research Trend in Management of Technology Using Keywords Network Analysis (키워드 네트워크 분석을 통해 살펴본 기술경영의 최근 연구동향)

  • Kho, Jaechang;Cho, Kuentae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.101-123
    • /
    • 2013
  • Recently due to the advancements of science and information technology, the socio-economic business areas are changing from the industrial economy to a knowledge economy. Furthermore, companies need to do creation of new value through continuous innovation, development of core competencies and technologies, and technological convergence. Therefore, the identification of major trends in technology research and the interdisciplinary knowledge-based prediction of integrated technologies and promising techniques are required for firms to gain and sustain competitive advantage and future growth engines. The aim of this paper is to understand the recent research trend in management of technology (MOT) and to foresee promising technologies with deep knowledge for both technology and business. Furthermore, this study intends to give a clear way to find new technical value for constant innovation and to capture core technology and technology convergence. Bibliometrics is a metrical analysis to understand literature's characteristics. Traditional bibliometrics has its limitation not to understand relationship between trend in technology management and technology itself, since it focuses on quantitative indices such as quotation frequency. To overcome this issue, the network focused bibliometrics has been used instead of traditional one. The network focused bibliometrics mainly uses "Co-citation" and "Co-word" analysis. In this study, a keywords network analysis, one of social network analysis, is performed to analyze recent research trend in MOT. For the analysis, we collected keywords from research papers published in international journals related MOT between 2002 and 2011, constructed a keyword network, and then conducted the keywords network analysis. Over the past 40 years, the studies in social network have attempted to understand the social interactions through the network structure represented by connection patterns. In other words, social network analysis has been used to explain the structures and behaviors of various social formations such as teams, organizations, and industries. In general, the social network analysis uses data as a form of matrix. In our context, the matrix depicts the relations between rows as papers and columns as keywords, where the relations are represented as binary. Even though there are no direct relations between papers who have been published, the relations between papers can be derived artificially as in the paper-keyword matrix, in which each cell has 1 for including or 0 for not including. For example, a keywords network can be configured in a way to connect the papers which have included one or more same keywords. After constructing a keywords network, we analyzed frequency of keywords, structural characteristics of keywords network, preferential attachment and growth of new keywords, component, and centrality. The results of this study are as follows. First, a paper has 4.574 keywords on the average. 90% of keywords were used three or less times for past 10 years and about 75% of keywords appeared only one time. Second, the keyword network in MOT is a small world network and a scale free network in which a small number of keywords have a tendency to become a monopoly. Third, the gap between the rich (with more edges) and the poor (with fewer edges) in the network is getting bigger as time goes on. Fourth, most of newly entering keywords become poor nodes within about 2~3 years. Finally, keywords with high degree centrality, betweenness centrality, and closeness centrality are "Innovation," "R&D," "Patent," "Forecast," "Technology transfer," "Technology," and "SME". The results of analysis will help researchers identify major trends in MOT research and then seek a new research topic. We hope that the result of the analysis will help researchers of MOT identify major trends in technology research, and utilize as useful reference information when they seek consilience with other fields of study and select a new research topic.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

Wave Analysis and Spectrum Estimation for the Optimal Design of the Wave Energy Converter in the Hupo Coastal Sea (파력발전장치 설계를 위한후포 연안의 파랑 분석 및 스펙트럼 추정)

  • Kweon, Hyuck-Min;Cho, Hongyeon;Jeong, Weon-Mu
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.25 no.3
    • /
    • pp.147-153
    • /
    • 2013
  • There exist various types of the WEC (Wave Energy Converter), and among them, the point absorber is the most popularly investigated type. However, it is difficult to find examples of systematically measured data analysis for the design of the point absorber type of power buoy in the world. The study investigates the wave load acting on the point absorber type resonance power buoy wave energy extraction system proposed by Kweon et al. (2010). This study analyzes the time series spectra with respect to the three-year wave data (2002.05.01~2005.03.29) measured using the pressure type wave gage at the seaside of north breakwater of Hupo harbor located in the east coast of the Korean peninsula. From the analysis results, it could be deduced that monthly wave period and wave height variations were apparent and that monthly wave powers were unevenly distributed annually. The average wave steepness of the usual wave was 0.01, lower than that of the wind wave range of 0.02-0.04. The mode of the average wave period has the value of 5.31 sec, while mode of the wave height of the applicable period has the value of 0.29 m. The occurrence probability of the peak period is a bi-modal type, with a mode value between 4.47 sec and 6.78 sec. The design wave period can be selected from the above four values of 0.01, 5.31, 4.47, 6.78. About 95% of measured wave heights are below 1 m. Through this study, it was found that a resonance power buoy system is necessary in coastal areas with low wave energy and that the optimal design for overcoming the uneven monthly distribution of wave power is a major task in the development of a WEF (Wave Energy Farm). Finding it impossible to express the average spectrum of the usual wave in terms of the standard spectrum equation, this study proposes a new spectrum equation with three parameters, with which basic data for the prediction of the power production using wave power buoy and the fatigue analysis of the system can be given.

Kinetic and Statistical Analysis of Adsorption and Photocatalysis on Sulfamethoxazole Degradation by UV/$TiO_2$/HAP System (UV/$TiO_2$/HAP 시스템에서 Sulfamethoxazole의 흡착과 광촉매반응에 대한 동역학적 및 통계적 해석)

  • Chun, Suk-Young;Chang, Soon-Woong
    • Journal of the Korean GEO-environmental Society
    • /
    • v.13 no.5
    • /
    • pp.5-12
    • /
    • 2012
  • Antibiotics have been considered emerging compounds due to their continuous input and persistence in environment. Due to the limited biodegradability and widespread use of these antibiotics, an incomplete removal is attained in conventional wastewater treatment plants and relative large quantities are released into the environment. In this study, it was determined the adsorption and photocatalysis kinetics of antibiotics (Sulfamethoxazole, SMX) with various catalyst (Titanium dioxide; $TiO_2$, Hydroxyapatite; HAP) conditions under UV/$TiO_2$/HAP system. In addition, the statistical analysis of response surface methods (RSM) was used to determine the effects of operating parameters on UV/$TiO_2$/HAP system. $TiO_2$/HAP adsorbent were found to follow the pseudo second order reaction in the adsorption. In the result of applied intrapaticle diffusion model, the constants of reaction rate were $TiO_2$=$0.064min^{-1}$, HAP=$0.2866min^{-1}$ and $TiO_2$/HAP=$0.3708min^{-1}$, respectively.The result of RSM, term of regression analysis in analysis of variance (ANOVA) showed significantly p-value (p<0.05) and high coefficients for determination values($R^2$=96.2%, $R^2_{Adj}$=89.3%) that allowed satisfactory prediction of second order regression model. And the estimated optimal conditions for Y(Sulfamethoxazole removal efficiency, %) were $x_1$(initial concentration of Sulfamethoxazole)=-0.7828, $x_2$(amount of catalyst)=0.9974 and $x_3$(reation time)=0.5738 by coded parameters, respectively. According to the result of intraparticle diffusion model and photocatalysis experiments, it was shown that the $TiO_2$/HAP was more effective system than conventional AOPs(advanced oxidation processes, UV/$TiO_2$ system).

Study on the Economic Co-operation action by analyzing the North Korea's Military Negotiations - Focusing on Inter-Korean Military Talks and working level talks about Gaeseong industrial complex - (북한 협상모델 분석을 통한 경제협력 실천방안 연구 - 남북 군사협상 및 개성공단 실무회담 사례를 중심으로-)

  • Lee, Sung-Choon
    • International Commerce and Information Review
    • /
    • v.15 no.3
    • /
    • pp.353-384
    • /
    • 2013
  • When it comes to the current inter-Korean relationship, the two Koreas are in the step of core practical negotiation on exchanges and cooperation away from deadlock so far. It is expected that they will have more and frequent meetings in the future. Therefore, now is the time the South Korea needed to come up with systematic countermeasures because there is nothing more important and giving more impact on our society than the matter of North Korea. As the purpose of social science lies with the explanation and prediction of the social phenomena in the society, it is considered to be meaningful to analyze the representative military negotiations such as the defense ministry-level talks, general-level talks, and working-level talks between the two Koreas where the participants from South Korea consisting of the military representatives discussed with their counterparts of North Korea since the signing of the armistice in Korea on July 27, 1953. This study analyzes and evaluates the behaviors of North Korea's military negotiations with the South Korea in the Kim Jong-il era on the overall basis. In particular, the research tries to prove that the behaviors of military negotiations under Kim Jeong-il regime were made in the frame of the negotiation model by analyzing many negotiations presented in 'With Century', Kim Il Sung's Memoirs under his anti-Japan-guerilla era and suggesting the analysis frame of anti-Japan-guerilla style negotiation model. Based on the results of this proof, the study looks at carefully the specific characteristics of anti-Japan guerrilla-type negotiation.

  • PDF

Microbiological Quality of Agricultural Water in Jeollabuk-do and the Population Changes of Pathogenic Escherichia Coli O157:H7 in Agricultural Water Depending on Temperature and Water Quality (전라북도 지역 농업용수의 미생물학적 특성 및 온도와 수질에 따른 농업용수의 병원성대장균 O157:H7 밀도 변화)

  • Hwang, Injun;Ham, Hyeonheui;Park, Daesoo;Chae, Hyobeen;Kim, Se-Ri;Kim, Hwang-Yong;Kim, Hyun Ju;Kim, Won-Il
    • Korean Journal of Environmental Agriculture
    • /
    • v.38 no.4
    • /
    • pp.254-261
    • /
    • 2019
  • BACKGROUND: Agricultural water is known to be one of the major routes in bacterial contamination of fresh vegetable. However, there is a lack of fundamental data on the microbial safety of agricultural water in Korea. METHODS AND RESULTS: We investigated the density of indicator bacteria in the surface water samples from 31 sites collected in April, July, and October 2018, while the groundwater samples were collected from 20 sites within Jeollabuk-do in April and July 2018. In surface water, the mean density of coliform, fecal coliform, and Escherichia coli was 2.7±0.55, 1.9±0.71, and 1.4±0.58 log CFU/100 mL, respectively, showing the highest bacterial density in July. For groundwater, the mean density of coliform, fecal coliform, and E. coli was 1.9±0.58, 1.4±0.37, and 1.0±0.33 log CFU/ 100mL, respectively, showing no significant difference between sampling time. The survival of E. coli O157:H7 were prolonged in water with higher organic matter contents such as total nitrogen (TN), and nitrate-nitrogen (NO3-N). The reduction rates of E. coli O157:H7 in the water showed greater in order of 25, 35, 5, and 15℃. CONCLUSION: These results can be utilized as fundamental data for prediction the microbiological contamination of agricultural water and the development of microbial prevention technology.

A Comparative Study on Failure Pprediction Models for Small and Medium Manufacturing Company (중소제조기업의 부실예측모형 비교연구)

  • Hwangbo, Yun;Moon, Jong Geon
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.11 no.3
    • /
    • pp.1-15
    • /
    • 2016
  • This study has analyzed predication capabilities leveraging multi-variate model, logistic regression model, and artificial neural network model based on financial information of medium-small sized companies list in KOSDAQ. 83 delisted companies from 2009 to 2012 and 83 normal companies, i.e. 166 firms in total were sampled for the analysis. Modelling with training data was mobilized for 100 companies inlcuding 50 delisted ones and 50 normal ones at random out of the 166 companies. The rest of samples, 66 companies, were used to verify accuracies of the models. Each model was designed by carrying out T-test with 79 financial ratios for the last 5 years and identifying 9 significant variables. T-test has shown that financial profitability variables were major variables to predict a financial risk at an early stage, and financial stability variables and financial cashflow variables were identified as additional significant variables at a later stage of insolvency. When predication capabilities of the models were compared, for training data, a logistic regression model exhibited the highest accuracy while for test data, the artificial neural networks model provided the most accurate results. There are differences between the previous researches and this study as follows. Firstly, this study considered a time-series aspect in light of the fact that failure proceeds gradually. Secondly, while previous studies constructed a multivariate discriminant model ignoring normality, this study has reviewed the regularity of the independent variables, and performed comparisons with the other models. Policy implications of this study is that the reliability for the disclosure documents is important because the simptoms of firm's fail woule be shown on financial statements according to this paper. Therefore institutional arragements for restraing moral laxity from accounting firms or its workers should be strengthened.

  • PDF

Utility of B-type Natriuretic Peptide in Patients with Acute Respiratory Distress Syndrome (급성호흡곤란증후군 환자에 있어서 B-type Natriuretic Peptide의 유용성)

  • Rhee, Chin Kook;Joo, Young Bin;Kim, Seok Chan;Park, Sung Hak;Lee, Sook Young;Koh, Yoon Seok;Kim, Young Kyoon
    • Tuberculosis and Respiratory Diseases
    • /
    • v.62 no.5
    • /
    • pp.389-397
    • /
    • 2007
  • Background B-type natriuretic peptide (BNP) has been shown to be strong mortality predictors in a wide variety of cardiovascular syndromes. Little is known about BNP in patients with acute respiratory distress syndrome (ARDS). We studied whether BNP can predict mortality in patients with ARDS. Method Echocardiographic study was done to all patients with ARDS, and we excluded patient with low ejection fraction (less than 50%) or showing any features of diastolic dysfunction. 47 patients were enrolled between December, 2003 and February, 2006. Parameters including BNP were obtained within 24h hours at the time of enrollment. Result Mean BNP concentrations and APACHE II scores differed between the survivors and nonsurvivors (BNP, $219.5{\pm}57.7pg/mL$ vs $492.3{\pm}88.8pg/mL$; p=0.013, APACHE II score, $17.4{\pm}1.6$ vs $23.1{\pm}1.3$, p=0.009, respectively). With the use of the threshold value for BNP of 585 pg/mL, the specificity for the prediction of mortality was 94%. The threshold value for APACHE II of 15.5 showed sensitivity of 87%. 'APACHE II + $11{\times}logBNP$' showed sensitivity 63%, and specificity 82%, using threshold value for 46.14. Conclusion BNP concentrations and APCHE II scores were more elevated in nonsurvivors than survivors in patients with ARDS who have normal ejection fraction. BNP can predict mortality. Further study should be done.