• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.035 seconds

A Study on the Research Trends in Library & Information Science in Korea using Topic Modeling (토픽모델링을 활용한 국내 문헌정보학 연구동향 분석)

  • Park, Ja-Hyun;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.1
    • /
    • pp.7-32
    • /
    • 2013
  • The goal of the present study is to identify the topic trend in the field of library and information science in Korea. To this end, we collected titles and s of the papers published in four major journals such as Journal of the Korean Society for information Management, Journal of the Korean Society for Library and Information Science, Journal of Korean Library and Information Science Society, and Journal of the Korean BIBLIA Society for library and Information Science during 1970 and 2012. After that, we applied the well-received topic modeling technique, Latent Dirichlet Allocation(LDA), to the collected data sets. The research findings of the study are as follows: 1) Comparison of the extracted topics by LDA with the subject headings of library and information science shows that there are several distinct sub-research domains strongly tied with the field. Those include library and society in the domain of "introduction to library and information science," professionalism, library and information policy in the domain of "library system," library evaluation in the domain of "library management," collection development and management, information service in the domain of "library service," services by library type, user training/information literacy, service evaluation, classification/cataloging/meta-data in the domain of "document organization," bibliometrics/digital libraries/user study/internet/expert system/information retrieval/information system in the domain of "information science," antique documents in the domain of "bibliography," books/publications in the domain of "publication," and archival study. The results indicate that among these sub-domains, information science and library services are two most focused domains. Second, we observe that there is the growing trend in the research topics such as service and evaluation by library type, internet, and meta-data, but the research topics such as book, classification, and cataloging reveal the declining trend. Third, analysis by journal show that in Journal of the Korean Society for information Management, information science related topics appear more frequently than library science related topics whereas library science related topics are more popular in the other three journals studied in this paper.

A Study on the Essence and Tendency of Modern Manager (현대 경영자로서의 본질과 성향 연구)

  • Yeom, Bae-Hoon;Kim, Hyunsoo
    • Journal of Service Research and Studies
    • /
    • v.10 no.3
    • /
    • pp.23-42
    • /
    • 2020
  • This study conceptualized the essence and propensity of modern management in service age, based on philosophy, and developed items to evaluate the conceptualized content. It was carried out as a new study to deepen the study of management philosophy and management theory by the new management framework. In order to establish the philosophical foundation of the modern management, the essence of the modern management was conceptualized based on the fundamental ideas of the East and West, and then an evaluation item was developed to put the essence and propensity of the modern management into practical use through analytical and empirical methods. After analyzing the representative ideas of mankind, it was derived that the Book of Change has the qualification as a philosophical model that can derive the essence of modern management. The Book of Change explains the reasoning of the world in the structure of two opposing parties, such as Taiji or Yin and Yang, and the process of acknowledging the contradictions within each opposing party and overcoming the contradictions through change is the central idea. Because you can see. After conducting a conceptual study, through empirical research, the essence and propensity of a modern manager should be conceptualized. The concept of essence and empirical study of the modern management using the leading role was conducted in two stages. First, a qualitative study using repetitive comparative analysis (CCM), focus group interview (FGI), and text mining was conducted to derive the essential and propensity conceptualization items that modern managers should possess. In addition, a quantitative study using factor analysis to develop sample items and develop measurement items through literature review and FGI was conducted to derive the essential concept of the modern management. Finally, the essence of modern management was derived: learning, preparation, challenge, inclusion, trust, morality, and sacrifice. In the future, it is necessary to conduct empirical research on the effectiveness of the essence of modern management for global and Korean representative companies.

A Study on the Effect of the Document Summarization Technique on the Fake News Detection Model (문서 요약 기법이 가짜 뉴스 탐지 모형에 미치는 영향에 관한 연구)

  • Shim, Jae-Seung;Won, Ha-Ram;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.201-220
    • /
    • 2019
  • Fake news has emerged as a significant issue over the last few years, igniting discussions and research on how to solve this problem. In particular, studies on automated fact-checking and fake news detection using artificial intelligence and text analysis techniques have drawn attention. Fake news detection research entails a form of document classification; thus, document classification techniques have been widely used in this type of research. However, document summarization techniques have been inconspicuous in this field. At the same time, automatic news summarization services have become popular, and a recent study found that the use of news summarized through abstractive summarization has strengthened the predictive performance of fake news detection models. Therefore, the need to study the integration of document summarization technology in the domestic news data environment has become evident. In order to examine the effect of extractive summarization on the fake news detection model, we first summarized news articles through extractive summarization. Second, we created a summarized news-based detection model. Finally, we compared our model with the full-text-based detection model. The study found that BPN(Back Propagation Neural Network) and SVM(Support Vector Machine) did not exhibit a large difference in performance; however, for DT(Decision Tree), the full-text-based model demonstrated a somewhat better performance. In the case of LR(Logistic Regression), our model exhibited the superior performance. Nonetheless, the results did not show a statistically significant difference between our model and the full-text-based model. Therefore, when the summary is applied, at least the core information of the fake news is preserved, and the LR-based model can confirm the possibility of performance improvement. This study features an experimental application of extractive summarization in fake news detection research by employing various machine-learning algorithms. The study's limitations are, essentially, the relatively small amount of data and the lack of comparison between various summarization technologies. Therefore, an in-depth analysis that applies various analytical techniques to a larger data volume would be helpful in the future.

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

A Study Meaning Analysis and Interpretation of Body Sign, Kiki Smith - On Pee Body - (키키 스미스 작품에서 신체기호의 의미 분석과 해석 - 를 중심으로 -)

  • Kim, Sung-Hee
    • Journal of Science of Art and Design
    • /
    • v.10
    • /
    • pp.5-50
    • /
    • 2006
  • The terminology "human body" simply means a physical body but also more often, as an object in art works, carries symbolic concepts incorporating the whole history of human lives. Human body has been employed as an artistic object capturing physical body, delivering artist's idea expressing life indicators from different standpoints of times and places. This point of view about human body in art works has in fact rather short history since 1960's when modern thinking paradigm focusing upon rationality and reasoning has begun declining and on the contrary when the body used to be the servant of the mind and soul for a long time has begun attracting artist's attention as a real entity from the viewpoint of dichotomy. During the 1960's, frequent performances in Pop art and of Fluxus showed that the human body has been an important media for artistic communication after importance of body performances had been raised in Action painting in 1940's. The human body became a more determined media in body art works that had got into stride after Yves Kline's conceptual works applying body and its traces. These kinds of art works have continued and consolidated into the Feminism came into blossom in 1980's and into fragmentated and disembodied body art trend in 1990's. Through development of trends in body works, human body now might well be regarded as a clue provide from individual identity with implication over the world. This thesis is to analyse in semiotic way main works of Kiki Smith who is a representative artist devoting to Feminism and proposing extended significance of human body. In the analysis process of works done by two great artists with histrorical background of art trend in order to find and open an significance horizon of human body, semiotics and bodism are therefore perceived as pertinent and applied as basic tools. The first stage of analysis is to get the significances emerged in between expression part and contextual parts, which are separated structually from the most basic level. The study deals with body works furthermore in the way of structual cohesion of the expression and the context from the view of A J. Greimas' Structural Semantics and tried to build up a basic frame for the extended significances of human body. This thesis is, on the other hand, to attempt to contribute for extension of disembodied and fragmentated body discussed in the structural semantic frame earlier by Julia Kriesteva who delivers abjection concepts and phenomenology of Maurice Merleau-Ponty who enables to overview relationship between the body and the world from the viewpoint of Bodism, further into interpretation level. The other works are Kiki smith's that showed epics about death in mid-1980's, detailed humbleness of vulnerable human body exposed to dichotomy and fragmentation in 1990's and religion and mythology incorporating wouln healing in 2000's and henceforth. Through the analysis of Kiki Smith's representative work 'Pee body', it is verified and confirmed that fragmentated body showed beyond boundary gap of the human body and ultimately tends to imply human healing owing to divine maternity. Bodily symbols in Kiki Smith's are extended to the universal world to imply human life and death on the one hand and religion and mythology of human wound and divine healing one the other hand. This thesis through these process and results of analysis is in a broad context, to emphasize that human body as objectified text has a key indicator role to understand world as well as semiotic extension in art works in late 20th century so that we might confirm bodily symbol as a cultural context constitutes a section of contemporary visual arts.

  • PDF

A Case Study of Configuration Strategy and Context in Everyday Artifacts - Concentrated on analysis by Creativity Template Theory and Artifact Context Model - (일상 디자인산물의 구성배치 전략과 맥락에 관한 연구 - 창조성템플릿이론과 산물맥락모델을 이용한 분석을 중심으로 -)

  • Jin Sun-Tai
    • Archives of design research
    • /
    • v.19 no.4 s.66
    • /
    • pp.41-50
    • /
    • 2006
  • It is generally regarded a design system in post-industrial society, which products designed by in-house designers or design consultancy are manufactured in factory and distributed in market for the consumer. Although it is treated an old design system in traditional society, the traces of vernacular design has been remaining in the state of adopted to the periodical needs in these days, also proving the attribute of design culture to constitute human's material environment as well as existing design systems. There were discovered various design artifacts in daily surroundings vary from the established design in several manners, user modifications or manufactures in everyday lives formalized them. It was approached a case study that analyze the changes of artifact configuration and designer/user context and creation process of the non-professional design artifacts, Creativity Template Theory and ACM(Artifact Context Model) have been utilized for the analysis model. From the analysis result, It assume that the everyday artifacts may be ordinary but extra-ordinary including particular ideas and identity represented by everyday designers or users. Beside these characteristics induce the potentiality that reflect on creative motives for the designers or a complementary artifact generator filling up with drawbacks in established design system. The everyday design domain, various explorations and alternatives are made, is seems to be another design practice domain dissimilar to the one in the industry-based design. Moreover it provides an more easily accessability for the approaching user-friendly design, user customization because they conduct the reliable modeling of consumer and end-user. Finally, based on the exploratory study regarding interpretation of context and configuration in the everyday artifacts, new approach for the design process and design education through more detailed cognitive modeling of everyday designers will be a further study.

  • PDF

Detection of Protein Subcellular Localization based on Syntactic Dependency Paths (구문 의존 경로에 기반한 단백질의 세포 내 위치 인식)

  • Kim, Mi-Young
    • The KIPS Transactions:PartB
    • /
    • v.15B no.4
    • /
    • pp.375-382
    • /
    • 2008
  • A protein's subcellular localization is considered an essential part of the description of its associated biomolecular phenomena. As the volume of biomolecular reports has increased, there has been a great deal of research on text mining to detect protein subcellular localization information in documents. It has been argued that linguistic information, especially syntactic information, is useful for identifying the subcellular localizations of proteins of interest. However, previous systems for detecting protein subcellular localization information used only shallow syntactic parsers, and showed poor performance. Thus, there remains a need to use a full syntactic parser and to apply deep linguistic knowledge to the analysis of text for protein subcellular localization information. In addition, we have attempted to use semantic information from the WordNet thesaurus. To improve performance in detecting protein subcellular localization information, this paper proposes a three-step method based on a full syntactic dependency parser and WordNet thesaurus. In the first step, we constructed syntactic dependency paths from each protein to its location candidate, and then converted the syntactic dependency paths into dependency trees. In the second step, we retrieved root information of the syntactic dependency trees. In the final step, we extracted syn-semantic patterns of protein subtrees and location subtrees. From the root and subtree nodes, we extracted syntactic category and syntactic direction as syntactic information, and synset offset of the WordNet thesaurus as semantic information. According to the root information and syn-semantic patterns of subtrees from the training data, we extracted (protein, localization) pairs from the test sentences. Even with no biomolecular knowledge, our method showed reasonable performance in experimental results using Medline abstract data. Our proposed method gave an F-measure of 74.53% for training data and 58.90% for test data, significantly outperforming previous methods, by 12-25%.

Similar sub-Trajectory Retrieval Technique based on Grid for Video Data (비디오 데이타를 위한 그리드 기반의 유사 부분 궤적 검색 기법)

  • Lee, Ki-Young;Lim, Myung-Jae;Kim, Kyu-Ho;Kim, Joung-Joon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.9 no.5
    • /
    • pp.183-189
    • /
    • 2009
  • Recently, PCS, PDA and mobile devices, such as the proliferation of spread, GPS (Global Positioning System) the use of, the rapid development of wireless network and a regular user even images, audio, video, multimedia data, such as increased use is for. In particular, video data among multimedia data, unlike the moving object, text or image data that contains information about the movements and changes in the space of time, depending on the kinds of changes that have sigongganjeok attributes. Spatial location of objects on the flow of time, changing according to the moving object (Moving Object) of the continuous movement trajectory of the meeting is called, from the user from the database that contains a given query trajectory and data trajectory similar to the finding of similar trajectory Search (Similar Sub-trajectory Retrieval) is called. To search for the trajectory, and these variations, and given the similar trajectory of the user query (Tolerance) in the search for a similar trajectory to approximate data matching (Approximate Matching) should be available. In addition, a large multimedia data from the database that you only want to be able to find a faster time-effective ways to search different from the existing research is required. To this end, in this paper effectively divided into a grid to search for the trajectory to the trajectory of moving objects, similar to the effective support of the search trajectory offers a new grid-based search techniques.

  • PDF

A Morphological Analysis Method of Predicting Place-Event Performance by Online News Titles (온라인 뉴스 제목 분석을 통한 특정 장소 이벤트 성과 예측을 위한 형태소 분석 방법)

  • Choi, Sukjae;Lee, Jaewoong;Kwon, Ohbyung
    • The Journal of Society for e-Business Studies
    • /
    • v.21 no.1
    • /
    • pp.15-32
    • /
    • 2016
  • Online news on the Internet, as published open data, contain facts or opinions about a specific affair and hence influences considerably on the decisions of the general publics who are interested in a particular issue. Therefore, we can predict the people's choices related with the issue by analyzing a large number of related internet news. This study aims to propose a text analysis methodto predict the outcomes of events that take place in a specific place. We used topics of the news articles because the topics contains more essential text than the news articles. Moreover, when it comes to mobile environment, people tend to rely more on the news topics before clicking into the news articles. We collected the titles of news articles and divided them into the learning and evaluation data set. Morphemes are extracted and their polarity values are identified with the learning data. Then we analyzed the sensitivity of the entire articles. As a result, the prediction success rate was 70.6% and it showed a clear difference with other analytical methods to compare. Derived prediction information will be helpful in determining the expected demand of goods when preparing the event.

Website Monitoring on the Behavior of Consumers for Educational Pet Insects (애완학습곤충 소비자의 행동 모니터링)

  • Kim, So Yun;Kim, Seong Hyun;Choi, Won Ho;Park, Jong Bin;Park, Hae Chul;Lee, Young Bo;Kim, Namjung
    • Korean journal of applied entomology
    • /
    • v.52 no.4
    • /
    • pp.335-340
    • /
    • 2013
  • As the market of educational pet insects is expanding, understanding the consumer needs became more crucial. To achieve the ideal analysis on the market, this research monitored the behavior of consumers. The posting on the blogs of consumers, who have visited insect museums and farms, or have bought insects were collected as data. Moreover, the informational contents, photographs and texts, were analyzed. The results showed that the family-unit visitors with elementary school lower graders were the main type of visitors for their children's education. The visiting areas were concentrated in Seoul and the Metropolitans of Gyeonggi province, and the visits were mostly occurred during their children's vacation period. The analysis of posted photographs showed the visitors' high interest in the hands-on program. According to the texts on visitors' blogs, especially, the largest number of visitors satisfied with the variety of program. It implies the necessity of development in diverse and differentiated hands-on program. Otherwise, the programs available to connect insects to other animals and plants should be introduced to reduce aversion against insects, which was reported as the strongest dissatisfaction. In conclusion, diversification on insect species and development in systematized hands-on program seem to be required for the continuous growth of educational pet insects market.