• 제목/요약/키워드: 소셜 데이터 분석

검색결과 735건 처리시간 0.027초

Buffer Cache Management based on Nonvolatile Memory to Improve the Performance of Smartphone Storage (스마트폰 저장장치의 성능개선을 위한 비휘발성메모리 기반의 버퍼캐쉬 관리)

  • Choi, Hyunkyoung;Bahn, Hyokyung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • 제16권3호
    • /
    • pp.7-12
    • /
    • 2016
  • DRAM is commonly used as a smartphone memory medium, but extending its capacity is challenging due to DRAM's large battery consumption and density limit. Meanwhile, smartphone applications such as social network services need increasingly large memory, resulting in long latency due to additional storage accesses. To alleviate this situation, we adopt emerging nonvolatile memory (NVRAM) as smartphone's buffer cache and propose an efficient management scheme. The proposed scheme stores all dirty data in NVRAM, thereby reducing the number of storage accesses. Moreover, it separately exploits read and write histories of data accesses, leading to more efficient management of volatile and nonvolatile buffer caches, respectively. Trace-driven simulations show that the proposed scheme improves I/O performances significantly.

A Model to Predict Popularity of Internet Posts on Internet Forum Sites (인터넷 토론 게시판의 게시물 인기도 예측 모델)

  • Lee, Yun-Jung;Jung, In-Jun;Woo, Gyun
    • The KIPS Transactions:PartD
    • /
    • 제19D권1호
    • /
    • pp.113-120
    • /
    • 2012
  • Today, Internet users can easily create and share the digital contents with others through various online content sharing services such as YouTube. So, many portal sites are flooded with lots of user created contents (UCC) in various media such as texts and videos. Estimating popularity of UCC is a crucial concern to both users and the site administrators. This paper proposes a method to predict the popularity of Internet articles, a kind of UCC, using the dynamics of the online contents themselves. To analyze the dynamics, we regarded the access counts of Internet posts as the popularity of them and analyzed the variation of the access counts. We derived a model to predict the popularity of a post represented by the time series of access counts, which is based on an exponential function. According to the experimental results, the difference between the actual access counts and the predicted ones is not more than 10 for 20,532 posts, which cover about 90.7% of the test set.

A Study on the Development of Intelligent Contents and Interactive Storytelling System (지능형콘텐츠 개발과 인터렉티브 스토리텔링 시스템 연구)

  • Lee, Eun Ryoung;Kim, Kio Chung
    • Journal of Digital Convergence
    • /
    • 제11권1호
    • /
    • pp.423-430
    • /
    • 2013
  • The development of information technology introduced digital contents and Social Network Services(SNS), and allowed the virtual transaction and communication between users called "the experience knowledge" advanced from "the objective knowledge." This paper will analyze interactive storytelling system creating different types of stories on narrative genre about family history, personal history and so on. Through analysis on narrative interviews, direct observations, documentations and visual records, contents about CEO story, corporate story, family story and especially family history will be categorized into sampleDB and informationDB. Accumulated contents will allow the user to increase the value and usage of the contents through interactive storytelling system by restructuring the contents on family history. This research has developed writing tool data model using different digital contents such as texts, images and pictures to encourage open communications between first generations and third generations in Korea. Furthermore, researched about connected system on interactive storytelling creation device using various genre of family story that has been data based.

Changes and Applications of Rural Tourism in the Post-COVID-19 Era through Social Data Analysis (소셜데이터 분석을 통한 포스트 코로나 시대 농촌관광의 변화와 적용방안)

  • Kim, Young-Jin;Lee, Sung-hee;Son, Yong-hoon
    • Journal of Korean Society of Rural Planning
    • /
    • 제27권4호
    • /
    • pp.43-54
    • /
    • 2021
  • This study analysed changes in rural tourism between before and after COVID-19 using LDA topic analysis. In order to understand the changes in rural tourism, blog data including the keyword 'Gochang-gun travel' was used. As a result of LDA topic analysis with blog data retrieved, the study found nine topics in 2019 and 2020. 2019 and 2020 are, generally, consistent in topics, but the three topics related to rural experiential tourism that appeared in 2019 did not appear in 2020. In 2020, three new topics emerged: Beach vacations and campings. New travel activities of noncontact with other people(Untact tourism in Korean context) in the COVID-19 era, and The negative impacts on travel businesses and behaviours from COVID-19. Especially, the adverse effects of COVID-19 have made an enormous decline in rural experience tourism destinations and cancellation of local festivals. On the other hand, new tourism activities have emerged due to COVID-19. Those activities have included camping, drive-thru destinations, and cycling. Ecological and natural tourist sites such as Ungok Wetland, Seonunsan Mountain, Seonunsa Temple, and Gusipo Beach appeared. These tourist destinations have a quiet atmosphere and less density place noncontacting with other people when visiting. Also, because overseas travel has become difficult, long-term stay travel in rural areas has appeared. This study indicates that COVID-19 has less impacted rural tourism than other tourism destinations with these positive and negative impacts.

Urban Landscape Image Study by Text Mining and Factor Analysis - Focused on Lotte World Tower - (텍스트 마이닝과 인자분석에 의한 도시경관이미지 연구 - 롯데월드타워를 대상으로 -)

  • Woo, Kyung-Sook;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • 제45권4호
    • /
    • pp.104-117
    • /
    • 2017
  • This study compares the results of landscape image analysis using text mining techniques and factor analysis for Lotte World Tower, which is the first atypical skyscraper building in Korea, and identifies landscape images of the site to determine possibilities of use. Lotte World Tower's landscape image has been extracted from text mining analysis focusing on adjectives such as 'new', 'transformational', 'unusual', 'novelty', 'impressive', and 'unique', and phrases such as in the process of change, people's active elements(caliber, outing, project, night view), media(newspaper, blog), and climate(weather, season). As a result of the factor analysis, factors affecting the landscape image of Lotte World Tower were symbolic, aesthetic, and formative. Identification, which is a morphological feature, has characteristics of scale and visibility but it is not statistically significant in preference. Rather, the psychological factors such as the symbolism with characteristics such as poison and specialty, harmony with the characteristics of the surrounding environment, and beautiful aesthetic characteristics were an influence on the landscape image. The common results of the two research methods show that psychological characteristics such as factors that can represent and represent the city affect the landscape image more greatly than the morphological and physical characteristics such as location and location of the building. In addition, the text mining technique can identify nouns and adjectives corresponding to the images that people see and feel, and confirms the relationship between the derived keywords, so that it can focus the process of forming the landscape image and further the image of the city. It would appear to be a suitable method to complement the limitation of landscape research. This study is meaningful in that it confirms the possibility that big data can be utilized in landscape analysis, which is one research field of landscape architecture, and is significant for understanding the information of a big data base and contribute to enlarging the landscape research area.

Construction of Consumer Confidence index based on Sentiment analysis using News articles (뉴스기사를 이용한 소비자의 경기심리지수 생성)

  • Song, Minchae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • 제23권3호
    • /
    • pp.1-27
    • /
    • 2017
  • It is known that the economic sentiment index and macroeconomic indicators are closely related because economic agent's judgment and forecast of the business conditions affect economic fluctuations. For this reason, consumer sentiment or confidence provides steady fodder for business and is treated as an important piece of economic information. In Korea, private consumption accounts and consumer sentiment index highly relevant for both, which is a very important economic indicator for evaluating and forecasting the domestic economic situation. However, despite offering relevant insights into private consumption and GDP, the traditional approach to measuring the consumer confidence based on the survey has several limits. One possible weakness is that it takes considerable time to research, collect, and aggregate the data. If certain urgent issues arise, timely information will not be announced until the end of each month. In addition, the survey only contains information derived from questionnaire items, which means it can be difficult to catch up to the direct effects of newly arising issues. The survey also faces potential declines in response rates and erroneous responses. Therefore, it is necessary to find a way to complement it. For this purpose, we construct and assess an index designed to measure consumer economic sentiment index using sentiment analysis. Unlike the survey-based measures, our index relies on textual analysis to extract sentiment from economic and financial news articles. In particular, text data such as news articles and SNS are timely and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. There exist two main approaches to the automatic extraction of sentiment from a text, we apply the lexicon-based approach, using sentiment lexicon dictionaries of words annotated with the semantic orientations. In creating the sentiment lexicon dictionaries, we enter the semantic orientation of individual words manually, though we do not attempt a full linguistic analysis (one that involves analysis of word senses or argument structure); this is the limitation of our research and further work in that direction remains possible. In this study, we generate a time series index of economic sentiment in the news. The construction of the index consists of three broad steps: (1) Collecting a large corpus of economic news articles on the web, (2) Applying lexicon-based methods for sentiment analysis of each article to score the article in terms of sentiment orientation (positive, negative and neutral), and (3) Constructing an economic sentiment index of consumers by aggregating monthly time series for each sentiment word. In line with existing scholarly assessments of the relationship between the consumer confidence index and macroeconomic indicators, any new index should be assessed for its usefulness. We examine the new index's usefulness by comparing other economic indicators to the CSI. To check the usefulness of the newly index based on sentiment analysis, trend and cross - correlation analysis are carried out to analyze the relations and lagged structure. Finally, we analyze the forecasting power using the one step ahead of out of sample prediction. As a result, the news sentiment index correlates strongly with related contemporaneous key indicators in almost all experiments. We also find that news sentiment shocks predict future economic activity in most cases. In almost all experiments, the news sentiment index strongly correlates with related contemporaneous key indicators. Furthermore, in most cases, news sentiment shocks predict future economic activity; in head-to-head comparisons, the news sentiment measures outperform survey-based sentiment index as CSI. Policy makers want to understand consumer or public opinions about existing or proposed policies. Such opinions enable relevant government decision-makers to respond quickly to monitor various web media, SNS, or news articles. Textual data, such as news articles and social networks (Twitter, Facebook and blogs) are generated at high-speeds and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. Although research using unstructured data in economic analysis is in its early stages, but the utilization of data is expected to greatly increase once its usefulness is confirmed.

Social Network Analysis for the Effective Adoption of Recommender Systems (추천시스템의 효과적 도입을 위한 소셜네트워크 분석)

  • Park, Jong-Hak;Cho, Yoon-Ho
    • Journal of Intelligence and Information Systems
    • /
    • 제17권4호
    • /
    • pp.305-316
    • /
    • 2011
  • Recommender system is the system which, by using automated information filtering technology, recommends products or services to the customers who are likely to be interested in. Those systems are widely used in many different Web retailers such as Amazon.com, Netfix.com, and CDNow.com. Various recommender systems have been developed. Among them, Collaborative Filtering (CF) has been known as the most successful and commonly used approach. CF identifies customers whose tastes are similar to those of a given customer, and recommends items those customers have liked in the past. Numerous CF algorithms have been developed to increase the performance of recommender systems. However, the relative performances of CF algorithms are known to be domain and data dependent. It is very time-consuming and expensive to implement and launce a CF recommender system, and also the system unsuited for the given domain provides customers with poor quality recommendations that make them easily annoyed. Therefore, predicting in advance whether the performance of CF recommender system is acceptable or not is practically important and needed. In this study, we propose a decision making guideline which helps decide whether CF is adoptable for a given application with certain transaction data characteristics. Several previous studies reported that sparsity, gray sheep, cold-start, coverage, and serendipity could affect the performance of CF, but the theoretical and empirical justification of such factors is lacking. Recently there are many studies paying attention to Social Network Analysis (SNA) as a method to analyze social relationships among people. SNA is a method to measure and visualize the linkage structure and status focusing on interaction among objects within communication group. CF analyzes the similarity among previous ratings or purchases of each customer, finds the relationships among the customers who have similarities, and then uses the relationships for recommendations. Thus CF can be modeled as a social network in which customers are nodes and purchase relationships between customers are links. Under the assumption that SNA could facilitate an exploration of the topological properties of the network structure that are implicit in transaction data for CF recommendations, we focus on density, clustering coefficient, and centralization which are ones of the most commonly used measures to capture topological properties of the social network structure. While network density, expressed as a proportion of the maximum possible number of links, captures the density of the whole network, the clustering coefficient captures the degree to which the overall network contains localized pockets of dense connectivity. Centralization reflects the extent to which connections are concentrated in a small number of nodes rather than distributed equally among all nodes. We explore how these SNA measures affect the performance of CF performance and how they interact to each other. Our experiments used sales transaction data from H department store, one of the well?known department stores in Korea. Total 396 data set were sampled to construct various types of social networks. The dependant variable measuring process consists of three steps; analysis of customer similarities, construction of a social network, and analysis of social network patterns. We used UCINET 6.0 for SNA. The experiments conducted the 3-way ANOVA which employs three SNA measures as dependant variables, and the recommendation accuracy measured by F1-measure as an independent variable. The experiments report that 1) each of three SNA measures affects the recommendation accuracy, 2) the density's effect to the performance overrides those of clustering coefficient and centralization (i.e., CF adoption is not a good decision if the density is low), and 3) however though the density is low, the performance of CF is comparatively good when the clustering coefficient is low. We expect that these experiment results help firms decide whether CF recommender system is adoptable for their business domain with certain transaction data characteristics.

Assessment of Public Awareness on Invasive Alien Species of Freshwater Ecosystem Using Conservation Culturomics (보전문화체학 접근방식을 통한 생태계교란 생물인 담수 외래종의 대중인식 평가)

  • Park, Woong-Bae;Do, Yuno
    • Journal of Wetlands Research
    • /
    • 제23권4호
    • /
    • pp.364-371
    • /
    • 2021
  • Public awareness of alien species can vary by generation, period, or specific events associated with these species. An understanding of public awareness is important for the management of alien species because differences in public awareness can affect the establishment and implementation of management plans. We analyzed digital texts on social media platforms, news articles, and internet search volumes used in conservation culturomics to understand public interest and sentiment regarding alien freshwater species. The number of tweets, number of news articles, and relative search volume to 11 freshwater alien species were extracted to determine public interest. Additionally, the trend over time, seasonal variability, and repetition period of these data were confirmed. We also calculated the sentiment score and analyzed public sentiment in the collected data using sentiment analysis based on text mining techniques. The American bullfrog, nutria, bluegill, and largemouth bass drew relatively more public interest than other species. Some species showed repeated patterns in the number of Twitter posts, media coverage, and internet searches found according to the specified periods. The text mining analysis results showed negative sentiments from most people regarding alien freshwater species. Particularly, negative sentiments increased over the years after alien species were designated as ecologically disturbing species.

Analysis of Use Behavior of Urban Park Users Expressing Depression on Social Media Using Text Mining Technique (텍스트 마이닝 기법을 활용한 SNS 상에서 우울감을 언급한 도시공원 이용자의 이용행태 분석)

  • Oh, Jiyeon;Nam, Seongwoo;Lee, Peter Sang-Hoon
    • The Journal of the Korea Contents Association
    • /
    • 제22권6호
    • /
    • pp.319-328
    • /
    • 2022
  • The purpose of this study was to investigate the relationship between depression due to the COVID-19 pandemic and park use behaviors using on line posts. During the period of the pandemic prevention activities, text data containing both 'park' and 'depression' were collected from blogs and cafes in the search engine of Naver and Daum, then analyzed using Text Mining and Social Network techniques. As a result, the main usage behaviors of park users who mentioned depression were 'look', 'stroll(walk)' and 'eat'. Other types of behaviors were connected centering around 'look', one of the communication behaviors. Also, from CONCOR analysis, as the cluster referred from communication behavior and dynamic behavior was formed as a single behavior type, it was considered park users with depression perceived the park as the space for communication and physical activities. As the spread of COVID-19 caused the restriction of communication activities, the users might consider parks as one of the solutions. In addition, it was considered that passive usage behaviors have prevailed rather than active ones due to the depression. Resulting outcomes would be useful to plan helpful urban park for citizens. It is necessary to further analyze the park use behavior of users in relation to the period of before/after the COVID-19 pandemic and the existence/nonexistence of depression.

A Study on the Acceptance Factors of the Capital Market Sentiment Index (자본시장 심리지수의 수용요인에 관한 연구)

  • Kim, Suk-Hwan;Kang, Hyoung-Goo
    • Journal of Intelligence and Information Systems
    • /
    • 제26권3호
    • /
    • pp.1-36
    • /
    • 2020
  • This study is to reveal the acceptance factors of the Market Sentiment Index (MSI) created by reflecting the investor sentiment extracted by processing unstructured big data. The research model was established by exploring exogenous variables based on the rational behavior theory and applying the Technology Acceptance Model (TAM). The acceptance of MSI provided to investors in the stock market was found to be influenced by the exogenous variables presented in this study. The results of causal analysis are as follows. First, self-efficacy, investment opportunities, Innovativeness, and perceived cost significantly affect perceived ease of use. Second, Diversity of services and perceived benefits have a statistically significant impact on perceived usefulness. Third, Perceived ease of use and perceived usefulness have a statistically significant effect on attitude to use. Fourth, Attitude to use statistically significantly influences the intention to use, and the investment opportunities as an independent variable affects the intention to use. Fifth, the intention to use statistically significantly affects the final dependent variable, the intention to use continuously. The mediating effect between the independent and dependent variables of the research model is as follows. First, The indirect effect on the causal route from diversity of services to continuous use intention was 0.1491, which was statistically significant at the significance level of 1%. Second, The indirect effect on the causal route from perceived benefit to continuous use intention was 0.1281, which was statistically significant at the significance level of 1%. The results of the multi-group analysis are as follows. First, for groups with and without stock investment experience, multi-group analysis was not possible because the measurement uniformity between the two groups was not secured. Second, the analysis result of the difference in the effect of independent variables of male and female groups on the intention to use continuously, where measurement uniformity was secured between the two groups, In the causal route from usage attitude to usage intention, women are higher than men. And in the causal route from use intention to continuous use intention, males were very high and showed statistically significant difference at significance level 5%.