• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.034 seconds

A Study on the Performance Improvement of Rocchio Classifier with Term Weighting Methods (용어 가중치부여 기법을 이용한 로치오 분류기의 성능 향상에 관한 연구)

  • Kim, Pan-Jun
    • Journal of the Korean Society for information Management
    • /
    • v.25 no.1
    • /
    • pp.211-233
    • /
    • 2008
  • This study examines various weighting methods for improving the performance of automatic classification based on Rocchio algorithm on two collections(LISA, Reuters-21578). First, three factors for weighting are identified as document factor, document factor, category factor for each weighting schemes, the performance of each was investigated. Second, the performance of combined weighting methods between the single schemes were examined. As a result, for the single schemes based on each factor, category-factor-based schemes showed the best performance, document set-factor-based schemes the second, and document-factor-based schemes the worst. For the combined weighting schemes, the schemes(idf*cat) which combine document set factor with category factor show better performance than the combined schemes(tf*cat or ltf*cat) which combine document factor with category factor as well as the common schemes (tfidf or ltfidf) that combining document factor with document set factor. However, according to the results of comparing the single weighting schemes with combined weighting schemes in the view of the collections, while category-factor-based schemes(cat only) perform best on LISA, the combined schemes(idf*cat) which combine document set factor with category factor showed best performance on the Reuters-21578. Therefore for the practical application of the weighting methods, it needs careful consideration of the categories in a collection for automatic classification.

'Elderly image' Analysis Using Big Data and Social Networking Techniques (빅데이터와 사회연결망 기법을 이용한 '노인 이미지' 분석)

  • Han, Sun-Bo;Lee, Hyun-Sim
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.11
    • /
    • pp.253-263
    • /
    • 2016
  • We analyzed the social issue 'image of the elderly' using Big Data and Social Network Analysis. First, we analyzed the words extracted by the text mining technique by inputting the keyword 'elderly'. As a result of analysis, the image of the elderly viewed through media such as cafes, blogs, etc. Representing the trend of the public was using the word 'Senior' the most. The image of the elderly is expressed using the word having the highest frequency in the top 10, "The elderly are 'Senior' people who are respected by society, they are organized to earn money, to earn their qualifications, to health, and to 'Seniors' who desire to work healthy up to 100 years old". The purpose of this study is to differentiate from the existing analysis method by analyzing the macro-level image of the elderly including the social discourse by collecting vast amount of data and analyzing it with the social networking technique. When the image of the elderly that the public perceives is positively expressed as 'Senior', it can be said that the direction of the current elderly policy is evaluated as a desirable direction. On the other hand, it was able to feel the 'desire' of the public who wanted to be evaluated. Therefore, the policy direction of the elderly to be applied in the future should be the policy that enables the elderly to be perceived as 'Necessary existence' in society by taking on social roles. In addition, we proposed to implement the policy of the elderly that reflects priorities such as job creation, welfare, and alienation that can activity and maintain health.

Design of Heterogeneous Content Linkage Method by Analyzing Genbank (Genbank 분석을 통한 이종의 콘텐츠 연계 방안 설계)

  • Ahn, Bu-Young;Lee, Myung-Sun;Kim, Ji-Young;Oh, Chung-Shick
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.6
    • /
    • pp.49-54
    • /
    • 2010
  • As information on gene sequences is not only diverse but also extremely huge in volume, high-performance computer and information technology techniques are required to build and analyze gene sequence databases. This has given rise to the discipline of bioinformatics, a field of research where computers are utilized to collect, to manage, to save, to evaluate, and to analyze biological data. In line with such continued development in bioinformatics, the Korea Institute of Science and Technology Information (KISTI) has built an infrastructure for the biological information, based on the information technology, and provided the information for researchers of bioscience. This paper analyzes the reference fields of Genbank, the most frequently used gene database by the global researchers among the life information databases, and proposes the interface method to NDSL which is the science and technology information integrated service provided by KISTI. For these, after collecting Genbank data from NCBI FTP site, we rebuilt the database by separating Genbank text files into the basic gene data and the reference data. So new tables are generated through extracting the paper and patent information from Genbank reference fields. Then we suggest the method of connection with the paper DB and the patent DB operated by KISTI.

실시간 MP3 파일 검색 엔진을 위한 지원 시스템의 설계와 구현

  • 김우진;최문기
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2000.04a
    • /
    • pp.307-316
    • /
    • 2000
  • MP3(MPEG 1 layer 3) 파일 형식(file format)은 최근 높은 압축율과 뛰어난 음질 복원 능력으로 주목을 받고 있다. 실제로 MP3의 압축율은 CD의 약 50분의 1 정도이고 음질은 CD 음질을 동일한 수준으로 유지할 수 있다.한편, 이러한 MP3의 장점 때문에 web을 통해 MP3 파일을 찾으려는 수요는 폭발적으로 증가하고 있지만 기존의 검색 엔진들이 가지고 있는 프로세스는 급속하게 update되고 있는 MP3 컨텐츠에 효과적으로 대응하지 못하고 있는 실정이다. 특히, 기존의 검색 엔진들은 미디어 파일을 위한 검색이 아닌 문자 기반의 검색 기능을 위해 개발되어 MP3 검색에는 부적절하거나, 파일 중심이 아닌 사이트 중심의 링크 변동에 대하여 수동적인 업데이트만을 수행하여 빠른 변화에 능동적으로 대응하기 어려운 경우가 많다.현재 미디어 파일을 위한 검색 엔진들은 여럿 서비스 중이지만, 텍스트 중심의 탐색 방법을 사용하고, 정기적인 DB update 방법에 관해서도 문자 기반의 검색 엔진과 동일한 방법을 사용하고 있다. 또한, 국내에서는 web 서비스를 위한 미디어 파일 탐색 알고리즘과 지능형 탐색 방법에 등에 관한 연구 역시 거의 전무한 상태이다.본 논문은 MP3 파일 전문 검색을 위한 지능형 프로세스를 설계와 구현 결과에 관한 것으로, 기존의 미디어 검색 엔진들이 가지는 문제점을 지적하고 보다 효율적이고 능동적인 미디어 파일 탐색을 위한 방법을 제시한다. 특히, MP3 파일에 대한 미디어 파일 검증 알고리즘과 verification method을 제안하고, 이러한 메커니즘에 따라 구현된 지능형 robot과 spider 등으로 구성된, 신뢰성 있고 지능적인 MP3 검색 엔진 지원 시스템의 설계와 구현 결과 그리고 성능 등을 종합적으로 요약한다.실어증 환자들은 화시적 대명사를 조응적 대명사보다 더 잘 처리하는 동일한 결과를 보였다. 이러한 실험 결과들은 실어증 환자들이 뇌손상으로 인해 문법적 언어처리에는 어려움을 보이지만 비언어적인, 세상 지식과 관련된 화시적 대명사의 처리는 가능할 것이라는 가설을 뒷받침 해준다. 또한 이러한 실험 결과를 통해 대명사의 기능적인 측면에서 화시와 조응의 처리가 구분되어 있음을 보여준다.l mechanism is concentrate on only the reaction zone. As strain rate and CO2 quantity increase, NO production is remarkably augmented.our 10%를 대용한 것이 무첨가한 것보다 많이 단단해졌음을 알 수 있었다. 혼합중의 반죽의 조사형 전자현미경 관찰로 amarans flour로 대체한 gluten이 단단해졌음을 알수 있었다. 유화제 stearly 칼슘, 혹은 hemicellulase를 amarans 10% 대체한 밀가루에 첨가하면 확연히 비용적을 증대시킬 수 있다는 사실을 알 수 있었다. quinoa는 명아주과 Chenopodium에 속하고 페루, 볼리비아 등의 고산지에서 재배 되어지는 것을 시료로 사용하였다. quinoa 분말은 중량의 5-20%을 quinoa를 대체하고 더욱이 분말중량에 대하여 0-200ppm의 lipase를 lipid(밀가루의 2-3배)에 대하여 품질개량제로서 이용했다. 그 결과 quinoa 대량 7.5%에서 비용적, gas cell이 가장 긍정적 결과를 산출했고 반죽의 조직구조가 강화되었다. 또 quinoa 대체에 의해 전분-지질 복합제의 흡열량이 증대된 것으로부터 전분-지질복합제의 형성 촉진이 시사되었다.이것으로 인하여 호화억제에 의한 노화 방지효과가 기대되었지만

  • PDF

Complexity Metrics for Analysis Classes in the Unified Software Development Process (Unified Process의 분석 클래스에 대한 복잡도 척도)

  • 김유경;박재년
    • The KIPS Transactions:PartD
    • /
    • v.8D no.1
    • /
    • pp.71-80
    • /
    • 2001
  • Object-Oriented (OO) methodology to use the concept like encapsulation, inheritance, polymorphism, and message passing demands metrics that are different from structured methodology. There are many studies for OO software metrics such as program complexity or design metrics. But the metrics for the analysis class need to decrease the complexity in the analysis phase so that greatly reduce the effort and the cost of system development. In this paper, we propose new metrics to measure the complexity of analysis classes which draw out in the analysis phase based on Unified Process. By the collaboration complexity, is denoted by CC, we mean the maximum number of the collaborations can be achieved with each of the collaborator and detennine the potential complexity. And the interface complexity, is denoted by IC, shows the difficulty related to understand the interface of collaborators each other. We prove mathematically that the suggested metrics satisfy OO characteristics such as class size and inheritance. And we verify it theoretically for Weyuker' s nine properties. Moreover, we show the computation results for analysis classes of the system which automatically respond to questions of the it's user using the text mining technique. As we compared CC and IC to CBO and WMC, the complexity can be represented by CC and IC more than CBO and WMC. We expect to develop the cost-effective OO software by reviewing the complexity of analysis classes in the first stage of SDLC (Software Development Life Cycle).

  • PDF

A Study on a Real Time Presentation Method for Playing of a Multimedia mail on Internet (인터넷상의 동영상 메일을 재생하기 위한 실시간 연출 기법 연구)

  • Im, Yeong-Hwan;Lee, Seon-Hye
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.4
    • /
    • pp.877-890
    • /
    • 1999
  • In this paper, a multimedia mail including video, sound, graphic data has been proposed as the next generation mail of the text based mail. In order to develop the multimedia mail, the most outstanding problem is the fact that the multimedia data are too huge to send them to the receiving end directly. The fact of big data may cause many problems in both transferring and storing the data of the multimedia mail. Our main idea is to separate between a control program for the multimedia presentation and multimedia data. Since the size of a control program is as small as a plain text mail, it has no problem to send it attached to the internet mail to the receiver directly. Instead, the big multimedia data themselves may remain on the sender's computer or be sent to a designated server so that the data may be transferred to the receiver only when the receiver activates the play of the multimedia mail. In this scheme, our research focus is paced on the buffer management and the thread scheduling for the real time play of the multimedia mail on internet. Another problem is to provide an easy way of editing a multimedia presentation for an ordinary people having no programming knowledge. For the purposed, VIP(Visual Interface Player) has been used and the results or multimedia mail implemented on LAN has been described.

  • PDF

A study of expressing social agenda in feature film (Focusing on the Coen brother's film "A big lebowski (1998)) (상업 영화 속 사회의제 표현에 대한 분석 (코엔형제의 영화 "위대한 레보스키(1998)"를 중심으로))

  • Lee, Tae-hoon
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.399-406
    • /
    • 2017
  • Contrary to the fact that the old films contain artistic and include contemporary literature, religion, and philosophy, latest films are produced with focusing on external interesting composition and sensational scene. A good movie emotionally express the directors' topic message exuding from an interesting story, and empathize with the social agenda which shows a sharp look of the directors' on contemporary social aspect. In the movies of the Coen brothers, it seems like an entertainment movie as typical black comedy genre through irony and happening, but in fact, it inserts a lot of social problems in the film to show that they cynically express their social agenda from a contemplative view. In their movie "The Big Lebowski (1998)", it seems like they are creating comical content through the main characters' unaffected attitude. However, it is director's excellent director of the sub-text that expresses American social issues such as Vietnam war, post-modernism and an obscurantist policy and au fond the comedy about the historical facts of mass production of social maladjustment into black comedy. We expect to contribute to make a step forward in the Korea film industry by analyzing such movies that has the cultural power of influence.

The Effects of Contextual Error-Correction Feedback on Learners' Academic Achievement io Web Courseware for Learning Productivity S/W (Productivity S/W 학습용 웹 코스웨어에서 상황맥락적 오류교정 패드백이 학업성취도에 미치는 영향)

  • Kim, Do-Yun;Bae, Young-Kwon;Baek, Jang-Hyeon;Lee, Tae-Wuk
    • The Journal of Korean Association of Computer Education
    • /
    • v.7 no.1
    • /
    • pp.141-149
    • /
    • 2004
  • Today there are many Web courseware systems for formative evaluation and feedback. Formative evaluation and feedback provided according to users' response in most Web courseware systems, however, are simple texts showing only whether correct or wrong, correct answers, relevant information, etc., far deviated from actual context. Thus such a system may weaken the corrective function of feedback and, as a result, reduce learners' understanding of contents and the possibility of learning transfer. In addition, according to the learning theory of constructivism, learning is influenced by the situation, in which it happens, and knowledge is learned and transferred differently depending on the context in which it is learned. In the background, this study designed and implemented a contextual error-correction feedback system that can provide feedback in a context closely related and similar to the relevant situation according to the response of learners when formative evaluation is carried out in Web courseware. In addition, it applied 'correction/correct-answer-providing feedback', 'relevant information providing feedback' and 'contextual error-correction feedback' to Web courseware for learning actual productivity S/W and verified if 'contextual error-correction feedback' is more effective than other two types of feedback for learners' academic achievement.

  • PDF

A Critical Analysis of and Its Implications ("나꼼수현상"이 그려내는 문화정치의 명암: 권력-대항적인 정치시사콘텐츠의 함의를 맥락화하기)

  • Lee, Kee-Hyeung;Lee, Young-Joo;Hwang, Kyong-Ah;Chae, Zi-Yeon;Cheon, Hye-Young;Kwon, Sook-Young
    • Korean journal of communication and information
    • /
    • v.58
    • /
    • pp.74-105
    • /
    • 2012
  • $I$ $am$ $a$ $Weasel$ > is a radically different communicative form in several ways. It innovatively utilizes podcast, a kind of internet radio format while dealing actively with thorny political issues and scandals in much direct and challenging fashion. Also this program adopts politically-charged parody, sharp critique of current socio-political issues, as well as lively dialogues through which the program provides both acute political awareness and entertainment. As a new kind of talk show and an alternative media form, this program has gained much popularity and attention since its appearance. Considering the fact that the journalistic fields and public spheres are in disarray through the government intervention and wrought with fierce partisanship and political polarization, the role of this program needs to be examined both cautiously and contextually. This study aims to shed some lights on the multifaceted and much contentious role of $I$ $am$ $a$ $Weasel$ > through a textual reading and discourse analysis, as well as email interviews.

  • PDF

A Rhetoric of Naming in Korean Newspapers: A Socio-Constructive Meaning of the 'Split of National Opinion' As an Ultimate Term (한국 신문 속 명명하기의 수사학: 승부수 언어(ultimate term)로서의 '국론 분열'의 사회구성적 의미)

  • NamGung, Eun-Jeong;Shin, Seong-Gene;Lee, In-Hee
    • Korean journal of communication and information
    • /
    • v.43
    • /
    • pp.314-358
    • /
    • 2008
  • This study examined how the meaning of news stories covering the split of national opinion was constructed in the media to represent social conflicts. To clarify the function of the term 'split of national opinion' as an ultimate term, this study examined the meaning of the term in the context of both text and society. Ten newspapers were included in the content analysis. The frequency of words used for the purpose of metaphor and equivalent in describing the split of national opinion was calculated to determine their meaning in the textual context. The frequency of incidents and subjects involved in allegedly causing the split of national opinion was calculated to determine their meaning in the social context. The results of this study are summarized as follows: First, the term 'split of national opinion' was coined by the newspapers as a metaphor of disease, disaster, and cost. The attitudes or the ways in which the split of national opinion was dealt with were generally negative and passive. Second, the term 'split of national opinion' was dealt with an equivalent status of such terms as national policy, national loss, societal problems, and ideology. Third, each newspaper reported that the split of national opinion had been caused by certain subjects, which indicates that each newspaper had its own position of viewing who was the key player in splitting the national opinion. The implication was also discussed that the use of the ultimate term would incur the unbalance of power between participants and the existing players, which would make individuals or groups who were involved in the social actions excluded and make the newspapers exercise the rhetorical power as news media.

  • PDF