• Title/Summary/Keyword: subject-cluster

Search Result 161, Processing Time 0.025 seconds

A Study on Analysis of Research Data Repository in Humanities and Social Sciences (re3data를 기반으로 한 인문사회 RDR 연구)

  • Cho, Jane;Park, Jong-Do
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.2
    • /
    • pp.69-87
    • /
    • 2019
  • As the discussions on sharing research data prevail by the chance of the inauguration of the International Open Data Charter, research support organizations in the United States, the United Kingdom, and Japan are encouraging researchers to deposit their findings in a credible repository. Humanities and social sciences field, in which research data sharing culture and storage infrastructure are immature compared to life science and natural science, also needs to establish and operate a reliable storage infrastructure to guarantee the continuous access and utilization of data. This study analyzed the overall operational status of 305 subject repositories registered in re3data for the humanities and social sciences and clustered them according to the operational level using 5 indicators. As a result, 70% of the population were identified as universal clusters, and 20% of the excellent cluster was found to have the largest number of linguistic fields and the German-operated. In addition, this study confirmed through correspondence analysis that there is a relation between the sub-theme fields of humanities and social sciences and the types of data to be archived. The history and art domians are related to images, and social studies are related to statistical data. Linguistics has also been analyzed to be related to audio, plain text, and code.

Sell-sumer: The New Typology of Influencers and Sales Strategy in Social Media (셀슈머(Sell-sumer)로 진화한 인플루언서의 새로운 유형과 소셜미디어에서의 세일즈 전략)

  • Shin, Hajin;Kim, Sulim;Hong, Manny;Hwang, Bom Nym;Yang, Hee-Dong
    • Knowledge Management Research
    • /
    • v.22 no.4
    • /
    • pp.217-235
    • /
    • 2021
  • As 49% of the world's population uses social media platforms, communication and content sharing within social media are becoming more active than ever. In this environmental base, the one-person media market grew rapidly and formed public opinion, creating a new trend called sell-sumer. This study defined new types of influencers by product category by analyzing the subject concentration of the commercial/non-commercial keywords of influencers and the impact of the ratio of commercial postings on sales. It is hoped that influencers working within social media will be helpful to new sales strategies that are transformed into sell-sumers. The method of this study classifies influencers' commercial/non-commercial posts using Python, performs text mining using KoNLPy, and calculates similarity between FastText-based words. As a result, it has been confirmed that the higher the keyword theme concentration of the influencer's commercial posting, the higher the sales. In addition, it was confirmed through the cluster analysis that the influencer types for each product category were classified into four types and that there was a significant difference between groups according to sales. In other words, the implications of this study may suggest empirical solutions of social media sales strategies for influencers working on social media and marketers who want to use them as marketing tools.

Analysis of Regional Economic Ripple Effects of Port Logistics Industry in Gwangyang City - Focusing on Exogenous Specified Input-Output Model - (광양시 항만물류산업의 지역경제 파급효과 분석 - 외생화 산업연관모형을 중심으로 -)

  • Kim, Min-Seong;Na, Ju-Mong
    • Journal of Korea Port Economic Association
    • /
    • v.39 no.2
    • /
    • pp.77-95
    • /
    • 2023
  • The regional infrastructure industries of Gwangyang City, the subject of this study, are Gwangyang Port and Gwangyang Steel Mill. Therefore, it is necessary to analyze the regional economic ripple effects of the port logistics industry in Gwangyang City. In this study, a multi-stage approach using the RW and the LQ methodology using the national input-output tables in 2015 and 2019 is used to prepare the regional interindustry analysis chart in Gwangyang City, and an exogenous demand induction model that reclassified the port logistics industry was applied. Through this, the purpose of this study was to provide policy implications by figuring out the regional economic ripple effects of the port logistics industry quantitatively in Gwangyang City. As a result of the analysis, the industries with high production inducement effect and forward/backward linkage effect of the port logistics industry in Gwangyang City were analyzed as manufacturing, transportation, land and air logistics sectors. And the industries in which the added value inducement effect and the employment inducement effect were analyzed as an industry related to the service industry. Therefore, it is necessary to prepare support measures to foster the port logistics industry as a way to promote these industries and revitalize the local economy of Gwangyang City. To this end, it is desirable to improve policies and systems for the vitalization of the Gwangyang port maritime cluster and provide various policy support for the port logistics industry in Gwangyang City. This study is meaningful in suggesting policy implications for the regional economy of Gwangyang City based on the results of exogenous analysis of the port logistics industry in small and medium-sized cities. However, It seems that further studies related to this will be needed in the future.

Development and Analysis of COMS AMV Target Tracking Algorithm using Gaussian Cluster Analysis (가우시안 군집분석을 이용한 천리안 위성의 대기운동벡터 표적추적 알고리듬 개발 및 분석)

  • Oh, Yurim;Kim, Jae Hwan;Park, Hyungmin;Baek, Kanghyun
    • Korean Journal of Remote Sensing
    • /
    • v.31 no.6
    • /
    • pp.531-548
    • /
    • 2015
  • Atmospheric Motion Vector (AMV) from satellite images have shown Slow Speed Bias (SSB) in comparison with rawinsonde. The causes of SSB are originated from tracking, selection, and height assignment error, which is known to be the leading error. However, recent works have shown that height assignment error cannot be fully explained the cause of SSB. This paper attempts a new approach to examine the possibility of SSB reduction of COMS AMV by using a new target tracking algorithm. Tracking error can be caused by averaging of various wind patterns within a target and changing of cloud shape in searching process over time. To overcome this problem, Gaussian Mixture Model (GMM) has been adopted to extract the coldest cluster as target since the shape of such target is less subject to transformation. Then, an image filtering scheme is applied to weigh more on the selected coldest pixels than the other, which makes it easy to track the target. When AMV derived from our algorithm with sum of squared distance method and current COMS are compared with rawindsonde, our products show noticeable improvement over COMS products in mean wind speed by an increase of $2.7ms^{-1}$ and SSB reduction by 29%. However, the statistics regarding the bias show negative impact for mid/low level with our algorithm, and the number of vectors are reduced by 40% relative to COMS. Therefore, further study is required to improve accuracy for mid/low level winds and increase the number of AMV vectors.

Analysis on the Trends of Studies Related to the National Competency Standard in Korea throughout the Semantic Network Analysis (언어네트워크 분석을 적용한 국가직무능력표준(NCS) 연구 동향 분석)

  • Lim, Yun-Jin;Son, Da-Mi
    • 대한공업교육학회지
    • /
    • v.41 no.2
    • /
    • pp.48-68
    • /
    • 2016
  • This study was conducted to identify the NCS-related research trends, Keywords, the Keywords Networks and the extension of the Keywords using the sementic network analysis and to seek for the development plans about NCS. For this, the study searched 345 the papers, with the National Competency Standards or NCS as a key word, among master's theses, dissertations and scholarly journals that RISS provides, and selected a total of 345 papers. Annual frequency analysis of the selected papers was carried out, and Semantic Network Analysis was carried out for 68 key words which can be seen as key terms of the terms shown by the subject. The method of analysis were KrKwic software, UCINET6.0 and NetDraw. The study results were as follows: First, NCS-related research increased gradually after starting in 2002, and has been accomplishing a significant growth since 2014. Second, as a result of analysis of keyword network, 'NCS, development, curriculum, analysis, application, job, university, education,' etc. appeared as priority key words. Third, as a result of sub-cluster analysis of NCS-related research, it was classified into four clusters, which could be seen as a research related to a specific strategy for realization of NCS's purpose, an exploratory research on improvement in core competency and exploration of college students' possibility related to employment using NCS, an operational research for junior college-centered curriculum and reorganization of the specialized subject, and an analysis of demand and perception of a high school-level vocational education curriculum. Fourth, the connection forming process among key words of domestic study results about NCS was expanding in the form of 'job${\rightarrow}$job ability${\rightarrow}$NCS${\rightarrow}$education${\rightarrow}$process, curriculum${\rightarrow}$development, university${\rightarrow}$analysis, utilization${\rightarrow}$qualification, application, improvement${\rightarrow}$plan, operation, industry${\rightarrow}$design${\rightarrow}$evaluation.'

Effects of Private Security Guards' Job Stress on Organizational Commitment and Turnover Intention: focused on mediating effects of job burnout (민간경비원의 직무스트레스가 조직몰입 및 이직의도에 미치는 영향: 직무소진의 매개효과를 중심으로)

  • Cho, Cheol-Kyu;Kim, Sang-Jin
    • Convergence Security Journal
    • /
    • v.15 no.3_2
    • /
    • pp.31-42
    • /
    • 2015
  • This study aims to discuss how job stress of private security guards would influence organizational commitment and turnover intention, and it basically looks into mediating effects of job burnout to understand the former's effects on the latter. In order to conduct the analysis, the study selected private security guards working for security agencies located in Seoul as a research subject, and carried out a survey targeting 700 of those security guards who had been gathered by a random cluster sampling method. The survey was conducted for about four months from May of 2014 to September of the same year and with 24 samples that had not been returned or that had been observed to have some outliers excluded, a total of 676 samples were applied as final data. The study used SPSSWIN 18.0 Statistical Package for analyzing the data, and hypotheses were confirmed via a Frequency Analysis, Factor analysis, Cronbach's Alpha, Person's Correlation Analysis, regression analysis and a path analysis. Findings of the analysis reported that emotional exhaustion has partially mediating effects on relations among role conflict, role overload and organizational commitment and that role ambiguity is not significantly connected. In addition, as for a relation of role conflict and turnover intention, emotional exhaustion was turned out to have a full mediating effect on the relation. The study did not notice any significant connection between emotional exhaustion and role ambiguity. Add to that, in terms of a relation between role overload and turnover intention, emotional exhaustion appeared to have a partial mediating effect on the relation which helped a relevant hypothesis to be partly adopted. Regarding a relation of job stress with organizational commitment, according to results of a path analysis on dehumanization, dehumanization does not significantly affect a relation between role ambiguity and organizational commitment and as for role conflict and role overload, the study confirmed that they have a partially mediating effect on this relation of dehumanization with organizational commitment. The study learned then that dehumanization does not have a significant influence on a relation between role ambiguity and turnover intention. However, the study figured out that when it comes to a relation of role conflict and role overload, dehumanization has a partially mediating effect on the relation and as a consequence, a relevant hypothesis was adopted in part.

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

  • Park, Jongin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.19-41
    • /
    • 2019
  • According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.

Analysis of News Agenda Using Text mining and Semantic Network Analysis: Focused on COVID-19 Emotions (텍스트 마이닝과 의미 네트워크 분석을 활용한 뉴스 의제 분석: 코로나 19 관련 감정을 중심으로)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.47-64
    • /
    • 2021
  • The global spread of COVID-19 around the world has not only affected many parts of our daily life but also has a huge impact on many areas, including the economy and society. As the number of confirmed cases and deaths increases, medical staff and the public are said to be experiencing psychological problems such as anxiety, depression, and stress. The collective tragedy that accompanies the epidemic raises fear and anxiety, which is known to cause enormous disruptions to the behavior and psychological well-being of many. Long-term negative emotions can reduce people's immunity and destroy their physical balance, so it is essential to understand the psychological state of COVID-19. This study suggests a method of monitoring medial news reflecting current days which requires striving not only for physical but also for psychological quarantine in the prolonged COVID-19 situation. Moreover, it is presented how an easier method of analyzing social media networks applies to those cases. The aim of this study is to assist health policymakers in fast and complex decision-making processes. News plays a major role in setting the policy agenda. Among various major media, news headlines are considered important in the field of communication science as a summary of the core content that the media wants to convey to the audiences who read it. News data used in this study was easily collected using "Bigkinds" that is created by integrating big data technology. With the collected news data, keywords were classified through text mining, and the relationship between words was visualized through semantic network analysis between keywords. Using the KrKwic program, a Korean semantic network analysis tool, text mining was performed and the frequency of words was calculated to easily identify keywords. The frequency of words appearing in keywords of articles related to COVID-19 emotions was checked and visualized in word cloud 'China', 'anxiety', 'situation', 'mind', 'social', and 'health' appeared high in relation to the emotions of COVID-19. In addition, UCINET, a specialized social network analysis program, was used to analyze connection centrality and cluster analysis, and a method of visualizing a graph using Net Draw was performed. As a result of analyzing the connection centrality between each data, it was found that the most central keywords in the keyword-centric network were 'psychology', 'COVID-19', 'blue', and 'anxiety'. The network of frequency of co-occurrence among the keywords appearing in the headlines of the news was visualized as a graph. The thickness of the line on the graph is proportional to the frequency of co-occurrence, and if the frequency of two words appearing at the same time is high, it is indicated by a thick line. It can be seen that the 'COVID-blue' pair is displayed in the boldest, and the 'COVID-emotion' and 'COVID-anxiety' pairs are displayed with a relatively thick line. 'Blue' related to COVID-19 is a word that means depression, and it was confirmed that COVID-19 and depression are keywords that should be of interest now. The research methodology used in this study has the convenience of being able to quickly measure social phenomena and changes while reducing costs. In this study, by analyzing news headlines, we were able to identify people's feelings and perceptions on issues related to COVID-19 depression, and identify the main agendas to be analyzed by deriving important keywords. By presenting and visualizing the subject and important keywords related to the COVID-19 emotion at a time, medical policy managers will be able to be provided a variety of perspectives when identifying and researching the regarding phenomenon. It is expected that it can help to use it as basic data for support, treatment and service development for psychological quarantine issues related to COVID-19.

A Study on the Status of Startups and Their Nurturing Plans: Focusing on Startups in Seongnam City (스타트업 실태 및 육성방안에 관한 연구: 성남시 스타트업을 중심으로)

  • Han, Kyu-Dong;Jeon, Byung-Hoon
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.17 no.5
    • /
    • pp.67-80
    • /
    • 2022
  • This study was conducted to derive policy measures such as fostering and supporting by examining the actual conditions of domestic startups. The subject of this study was the start-ups located in Seongnam-si, where Pangyo Techno Valley, which is the highest-level innovation cluster in Korea and is evaluated as a start-up mecca. Startups were defined as startups under 7 years old based on new technologies such as IT, BT, and CT, and the subjects of the study were selected. This can be seen as a step forward from previous research in that it embodies the concept of a startup that was previously abstract in a quantitatively measurable way. As a result of the analysis, about 94% of startups are distributed in the so-called "Death Valley" growth stage, and startups above scale-up, which means full-scale growth beyond BEP, account for about 6%. appeared to be occupied. He cited the problem of start-up funds as the biggest difficulty in the early stages of startups, and cited the loan evaluation method that prioritizes sales or collateral in raising funds as the biggest problem. In addition, start-ups rated the access to private investment capital such as VC, AC, and angel investors at a low level compared to policy funds, which are public funds. Most startups showed a lot of interest in overseas expansion, and they chose matching overseas investors such as overseas VCs as the biggest support for overseas expansion. The overall competitiveness in the overseas market was 49.6 points, which is less than 50 points out of 100, indicating that the overall competitiveness was somewhat inferior. It was analyzed that public support and investment in overseas sales channels (sales channels, distribution networks, etc.) should be prioritized along with enhancement of technological competitiveness in order for domestic startups to increase their competitiveness in overseas markets as well as in the domestic market.

Term Mapping Methodology between Everyday Words and Legal Terms for Law Information Search System (법령정보 검색을 위한 생활용어와 법률용어 간의 대응관계 탐색 방법론)

  • Kim, Ji Hyun;Lee, Jong-Seo;Lee, Myungjin;Kim, Wooju;Hong, June Seok
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.137-152
    • /
    • 2012
  • In the generation of Web 2.0, as many users start to make lots of web contents called user created contents by themselves, the World Wide Web is overflowing by countless information. Therefore, it becomes the key to find out meaningful information among lots of resources. Nowadays, the information retrieval is the most important thing throughout the whole field and several types of search services are developed and widely used in various fields to retrieve information that user really wants. Especially, the legal information search is one of the indispensable services in order to provide people with their convenience through searching the law necessary to their present situation as a channel getting knowledge about it. The Office of Legislation in Korea provides the Korean Law Information portal service to search the law information such as legislation, administrative rule, and judicial precedent from 2009, so people can conveniently find information related to the law. However, this service has limitation because the recent technology for search engine basically returns documents depending on whether the query is included in it or not as a search result. Therefore, it is really difficult to retrieve information related the law for general users who are not familiar with legal terms in the search engine using simple matching of keywords in spite of those kinds of efforts of the Office of Legislation in Korea, because there is a huge divergence between everyday words and legal terms which are especially from Chinese words. Generally, people try to access the law information using everyday words, so they have a difficulty to get the result that they exactly want. In this paper, we propose a term mapping methodology between everyday words and legal terms for general users who don't have sufficient background about legal terms, and we develop a search service that can provide the search results of law information from everyday words. This will be able to search the law information accurately without the knowledge of legal terminology. In other words, our research goal is to make a law information search system that general users are able to retrieval the law information with everyday words. First, this paper takes advantage of tags of internet blogs using the concept for collective intelligence to find out the term mapping relationship between everyday words and legal terms. In order to achieve our goal, we collect tags related to an everyday word from web blog posts. Generally, people add a non-hierarchical keyword or term like a synonym, especially called tag, in order to describe, classify, and manage their posts when they make any post in the internet blog. Second, the collected tags are clustered through the cluster analysis method, K-means. Then, we find a mapping relationship between an everyday word and a legal term using our estimation measure to select the fittest one that can match with an everyday word. Selected legal terms are given the definite relationship, and the relations between everyday words and legal terms are described using SKOS that is an ontology to describe the knowledge related to thesauri, classification schemes, taxonomies, and subject-heading. Thus, based on proposed mapping and searching methodologies, our legal information search system finds out a legal term mapped with user query and retrieves law information using a matched legal term, if users try to retrieve law information using an everyday word. Therefore, from our research, users can get exact results even if they do not have the knowledge related to legal terms. As a result of our research, we expect that general users who don't have professional legal background can conveniently and efficiently retrieve the legal information using everyday words.