• 제목/요약/키워드: 출현빈도

Search Result 993, Processing Time 0.037 seconds

An Efficient Algorithm for Similarity Search in Large Biosequence Database (대용량 유전체를 위한 효율적인 유사성 검색 알고리즘)

  • Jeong, In-Seon;Park, Kyoung-Wook;Lim, Hyeong-Seok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.2
    • /
    • pp.1073-1076
    • /
    • 2005
  • Since the size of biosequence database grows exponentially every year, it becomes impractical to use Smith-Waterman algorithm for exact sequence similarity search. For fast sequence similarity search, researchers have been proposed heuristic methods that use the frequency of characters in subsequences. These methods have the defect that different sequences are treated as the same sequence. Because of using only the frequency of characters, the accuracy of these methods are lower than Smith-Waterman algorithm. In this paper, we propose an algorithm which processes query efficiently by indexing the frequency of characters including the positional information of characters in subsequences. The experiments show that our algorithm improve the accuracy of sequence similarity search approximately 5${\sim}$20% than heuristic algorithms using only the frequency of characters.

  • PDF

Correlation Analysis of Social Sentiment and Stock Prices (사회적 감성과 주가의 상관성 분석)

  • Yun, Hongwon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.7
    • /
    • pp.1593-1598
    • /
    • 2015
  • In this paper, we analyze the correlation between social sentiment and stock prices. Polarity analysis is conducted for the stock prices plunging and soaring duration. And it is performed for its prior period. Using these results, we analyze the relationship between the social sentiment and stock prices. We collected the past data of Dow Jones Industrial Average and detected the period of plunging and soaring. On the basis of the detected time, the New York Times articles are collected and polarity analysis is conducted. Frequency of negative terms is decreased and it of positive terms is increased during the stock prices soaring. There is a little difference between the frequency of negative and positive terms in the previous stock prices plunging or soaring. According to the correlation analysis, it shows a positive correlation between social sentiment and stock prices in the period of plunging and soaring. A significant correlation is not appeared in the previous stock prices plunging or soaring.

The Current Situation of Otter (Lutra lutra) Inhabitation and Conservation Measures in the Bukhan River (북한강수계 수달(Lutra lutra)의 서식실태 및 보호방안)

  • Kang, Jung Hoon;Nam, Taek Woo;Kwon, Kyung Ja;Jung, Sang Yong;Son, Jang Ik;Lee, Seung Hoon;Park, Young Mi;Han, Sung Yong
    • Korean Journal of Heritage: History & Science
    • /
    • v.44 no.2
    • /
    • pp.46-57
    • /
    • 2011
  • The aim of this study was to examine the current situation of otter inhabitation and conservation and to collect basic information for establishing appropriate policies. We conducted the study around the Bukhan river from April to October 2009, mostly focusing on otter distribution, feeding habits, threats, and conservation measures. We divided the study area into 2 sectors: the dam area and the stream. We found 39 spraint sites in the dam area and 70 in the stream area. A significant difference was observed in the number of spraint sites between the upper stream and the lower stream. To evaluate the feeding habit, we collected and analyzed the frequency and bulk occurrence of the spraints. Among the prey items, fish were the most numerous (36.99%) followed by amphibians (17.22%). Fish showed the highest bulk occurrence in the dam area, and the bulk occurrence of amphibians and insects seemed to increase in the stream area. However, the bulk occurrence in the dam area seemed to be lower than that in the stream area (ANOVA, F = 3.99, p < 0.05). Fyke nets and abandoned fishing nets were found to be the most threatening factors. Further research on the systematic management of otters and the use of stop grids is required for better conservation of otters.

Frequency of grammar items for Korean substitution of /u/ for /o/ in the word-final position (어말 위치 /ㅗ/의 /ㅜ/ 대체 현상에 대한 문법 항목별 출현빈도 연구)

  • Yoon, Eunkyung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.33-42
    • /
    • 2020
  • This study identified the substitution of /u/ for /o/ (e.g., pyəllo [pyəllu]) in Korean based on the speech corpus as a function of grammar items. Korean /o/ and /u/ share the vowel feature [+rounded], but are distinguished in terms of tongue height. However, researchers have reported that the merger of Korean /o/ and /u/ is in progress, making them indistinguishable. Thus, in this study, the frequency of the phonetic manifestation /u/ of the underlying form of /o/ for each grammar item was calculated in The Korean Corpus of Spontaneous Speech (Seoul Corpus 2015) which is a large corpus from a total of 40 speakers from Seoul or Gyeonggi-do. It was then confirmed that linking endings, particles, and adverbs ending with /o/ in the word-final position were substituted for /u/ approximately 50% of the stimuli, whereas, in nominal items, they were replaced at a frequency of less than 5%. The high rates of substitution were the special particle "-do[du]" (59.6%) and the linking ending "-go[gu]" (43.5%) among high-frequency items. Observing Korean pronunciation in real life provides deep insight into its theoretical implications in terms of speech recognition.

The Analysis of the Weather Characteristics by Source Region of the Asian Dust Observed in South Korea (한국에 출현한 황사의 발원지별 기상 특성 분석)

  • Kim, Sunyoung;Lee, Seungho
    • Journal of the Korean Geographical Society
    • /
    • v.48 no.2
    • /
    • pp.167-183
    • /
    • 2013
  • This paper aimed to investigate the Asian dust source region and climatic condition of source region by the case of Asian dust in south Korea. In order to analyze the weather condition of source region, observed the Asian dust days data and weather data in China were used. The Asian dust days originating from inner-Mongolia were the most frequent. The Asian dust days originating from all the source regions except Loess plateau were increased recently and occurred over the country. In case of Loess plateau, the frequency of the Asian dust days in 1960s was the highest and only the southern region of the south Korea was mostly affected. The relationship between the Asian dust days of Korea and climatic factors of spring and April of source region was significant. The relationship between the Asian dust days originating from the inner Mongolia and sea level pressure of April and relative humidity of spring was negative. The Asian dust days from Gobi had positive relationship with wind gust days and negative relationship with sea level pressure in April. The Asian dust days from Manchuria had negative relationship with precipitation and sea level pressure in April. The Asian dust days from Loess plateau had positive relationship with maximum wind speed and negative relationship with sea level pressure in April.

  • PDF

Step-by-step Approach for Effective Korean Unknown Word Recognition (한국어 미등록어 인식을 위한 단계별 접근방법)

  • Park, So-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.05a
    • /
    • pp.369-372
    • /
    • 2009
  • Recently, newspapers as well as web documents include many newly coined words such as "mid"(meaning "American drama" since "mi" means "America" in Korean and "d" refers to the "d" of drama) and "anseup"(meaning "pathetic" since "an" and "seup" literally mean eyeballs and moist respectively). However, these words cause a Korean analyzing system's performance to decrease. In order to recognize these unknown word automatically, this paper propose a step-by-step approach consisting of an unknown noun recognition phase based on full text analysis, an unknown verb recognition phase based on web document frequency, and an unknown noun recognition phase based on web document frequency. The proposed approach includes the phase based on full text analysis to recognize accurately the unknown words occurred once and again in a document. Also, the proposed approach includes two phases based on web document frequency to recognize broadly the unknown words occurred once in the document. Besides, the proposed model divides between an unknown noun recognition phase and an unknown verb recognition phase to recognize various unknown words. Experimental results shows that the proposed approach improves precision 1.01% and recall 8.50% as compared with a previous approach.

  • PDF

Co-occurrence Based Drug-disease Relationship Inference with Genes as Mediators (유전자를 중간 매개로 고려한 동시발생 기반의 약물-질병 관계 추론)

  • Shin, Sangwon;Sin, Yeeun;Jang, Giup;Yoo, Youngmi
    • The Journal of Korean Institute of Information Technology
    • /
    • v.16 no.11
    • /
    • pp.1-9
    • /
    • 2018
  • Drug repositioning is to discover new uses of drugs. Text mining derives knowledge from unstructured text. We propose a method to predict new drug-disease relationships by taking into account the rate of frequency of genes simultaneously measured in disease-gene and gene-drug. Co-occurrence of drug-gene and gene-disease in the biological literature is counted and calculate the rate of the gene for each drug and disease. Weights of drug-disease relationships are calculated using the average of the rates of genes that are measured and used to measure the accuracy for each disease. In measuring drug-disease relationships, a more accurate identification of relationships was shown by measuring the frequency on a sentence and considering multiple relationships than existing method.

Study on the Sister Chromatid Exchange Inducibility in Chinese Hamster Don Cell by Metal Compounds in Work Enviroment

  • Seo, Kwang-Seok;Lee, Chong-Sam
    • Journal of Environmental Health Sciences
    • /
    • v.22 no.1
    • /
    • pp.91-98
    • /
    • 1996
  • 산업장이나 생활환경에서 접하기 쉬운 수용성 염화물을 중심으로 19개 원소 24종의 금속화합물이 Chinese Hamster Don 세포에 있어서의 sister chromatid exchange(SCE) 출현빈도에 미치는 영향을 조사하였다. Chinese Hamster Don 세포에 대한 자매염색분체 교환출현빈도의 증가가 $CrO_3, K_2CrO_4, K_2Cr_2O_7, MnCl_2, K_2SeO_3, CH_3HgCl$ (p<0.01), $CoCl_2, Na_2HAsO_4, HgCl_2$ (p<0.05) 9종의 금속화합물에서 나타났으며, dose-response relationships이 현저한 금속화합물은 6가 크로화합물과 $K_2SeO_3$이었다.

  • PDF

Seasonal Changes of the Phytoplankton and the Periphyton Community at the Suer Stream in Kwangyang (전남 광양의 수어천 수역에 있어서 식물플랑크톤과 부착조류 군집의 계절적 변화)

  • Yoon, Sook-Kyung;Lee, Kyung
    • Korean Journal of Ecology and Environment
    • /
    • v.33 no.1 s.89
    • /
    • pp.38-50
    • /
    • 2000
  • Seasonal changes of the phytoplankton and the periphyton community were investigated from August 1998 to April 1999 at five stations at the Suer stream in Kwangyang. A total of 112 species of phytoplankton were identified. Of those, the diatoms were present at all stations but the green algae, the bluegreen algae, and the dioflagellates were present at Station 4 and Station 5 more frequently than the other stations. The phytoplankton standing crops varied from 10,100 cells/1 at Station 4 in April 1999 to 1,489,100 cells/1 at Station 4 in October 1998. The seasonal variation patterns of phytoplankton standing crops were different among stations as well as the pattern of presence. The dominant species were as follows: Achnanthes minutissima, Aulacoseira distans v. alpigena, Cocconeis placentula v. lineata, Cymbella minuta, C. silesiaca, Fragilaria arcus v. recta, Peridinium cinctum, Rhizosofenia longiseta, Synedra rumpens and filamentous algae. Of those, Achnanthes minutissima, Rhizosolenia longiseta, Synedra rumpens and filamentous algae showed the highest rate of occupation in the phytoplankton standing crops during the investigated periods. A total of 99 species of periphyton were identified. Among those, the diatoms of the periphyton community were observed frequently rather than those of the phytoplankton community. The ecological indicator values showed ${\bate}$-mesosaprobous in saprobity and was close to eutraphentic in trophic state. There were no considerable differences between the ecological indicator values by planktonic diatoms and periphytic diatoms.

  • PDF

Effect of Expectancy and Strategy on Emotional Information Processing (정서자극에 대한 빈도와 예상이 주의에 미치는 효과)

  • Choi, Moon-Gee;Nam, Ki-Chun
    • Korean Journal of Cognitive Science
    • /
    • v.17 no.3
    • /
    • pp.177-190
    • /
    • 2006
  • Present study was investigated to measure the influence of expectancy on emotional information processing. For inducing an expectancy for emotional stimulus, participants conducted three same blocks in which negative face was presented for prime in 75% of total trials(Group1) or three same blocks in which positive face was presented in 75% of total trials(Group2). We compared the means of RTs of two blocks conducted after and before these induction blocks. Results exhibited that participants in Group 1 allocated more attention after expectancy induction. This indicate that in normal population, the top-down processing like expectancy can influence emotional processing pattern related to negative information.

  • PDF