• Title/Summary/Keyword: web science

Search Result 5,051, Processing Time 0.038 seconds

Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques (텍스트 마이닝을 이용한 2012년 한국대선 관련 트위터 분석)

  • Bae, Jung-Hwan;Son, Ji-Eun;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.141-156
    • /
    • 2013
  • Social media is a representative form of the Web 2.0 that shapes the change of a user's information behavior by allowing users to produce their own contents without any expert skills. In particular, as a new communication medium, it has a profound impact on the social change by enabling users to communicate with the masses and acquaintances their opinions and thoughts. Social media data plays a significant role in an emerging Big Data arena. A variety of research areas such as social network analysis, opinion mining, and so on, therefore, have paid attention to discover meaningful information from vast amounts of data buried in social media. Social media has recently become main foci to the field of Information Retrieval and Text Mining because not only it produces massive unstructured textual data in real-time but also it serves as an influential channel for opinion leading. But most of the previous studies have adopted broad-brush and limited approaches. These approaches have made it difficult to find and analyze new information. To overcome these limitations, we developed a real-time Twitter trend mining system to capture the trend in real-time processing big stream datasets of Twitter. The system offers the functions of term co-occurrence retrieval, visualization of Twitter users by query, similarity calculation between two users, topic modeling to keep track of changes of topical trend, and mention-based user network analysis. In addition, we conducted a case study on the 2012 Korean presidential election. We collected 1,737,969 tweets which contain candidates' name and election on Twitter in Korea (http://www.twitter.com/) for one month in 2012 (October 1 to October 31). The case study shows that the system provides useful information and detects the trend of society effectively. The system also retrieves the list of terms co-occurred by given query terms. We compare the results of term co-occurrence retrieval by giving influential candidates' name, 'Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn' as query terms. General terms which are related to presidential election such as 'Presidential Election', 'Proclamation in Support', Public opinion poll' appear frequently. Also the results show specific terms that differentiate each candidate's feature such as 'Park Jung Hee' and 'Yuk Young Su' from the query 'Guen Hae Park', 'a single candidacy agreement' and 'Time of voting extension' from the query 'Jae In Moon' and 'a single candidacy agreement' and 'down contract' from the query 'Chul Su Ahn'. Our system not only extracts 10 topics along with related terms but also shows topics' dynamic changes over time by employing the multinomial Latent Dirichlet Allocation technique. Each topic can show one of two types of patterns-Rising tendency and Falling tendencydepending on the change of the probability distribution. To determine the relationship between topic trends in Twitter and social issues in the real world, we compare topic trends with related news articles. We are able to identify that Twitter can track the issue faster than the other media, newspapers. The user network in Twitter is different from those of other social media because of distinctive characteristics of making relationships in Twitter. Twitter users can make their relationships by exchanging mentions. We visualize and analyze mention based networks of 136,754 users. We put three candidates' name as query terms-Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn'. The results show that Twitter users mention all candidates' name regardless of their political tendencies. This case study discloses that Twitter could be an effective tool to detect and predict dynamic changes of social issues, and mention-based user networks could show different aspects of user behavior as a unique network that is uniquely found in Twitter.

Linkage Map and Quantitative Trait Loci(QTL) on Pig Chromosome 6 (돼지 염색체 6번의 연관지도 및 양적형질 유전자좌위 탐색)

  • Lee, H.Y.;Choi, B.H.;Kim, T.H.;Park, E.W.;Yoon, D.H.;Lee, H.K.;Jeon, G.J.;Cheong, I.C.;Hong, K.C.
    • Journal of Animal Science and Technology
    • /
    • v.45 no.6
    • /
    • pp.939-948
    • /
    • 2003
  • The objective of this study was to identify the quantitative traits loci(QTL) for economically important traits such as growth, carcass and meat quality on pig chromosome 6. A three generation resource population was constructed from cross between Korean native boars and Landrace sows. A total of 240 F$_2$ animals were produced using intercross between 10 boars and 31 sows of F$_1$ animals. Phenotypic data including body weight at 3 weeks, backfat thickness, muscle pH, shear force and crude protein level were collected from F$_2$ animals. Animals including grandparents(F$_0$), parents(F$_1$) and offspring(F$_2$) were genotyped for 29 microsatellite markers and PCR-RFLP marker on chromosome 6. The linkage analysis was performed using CRI-MAP software version 2.4(Green et al., 1990) with FIXED option to obtain the map distances. The total length of SSC6 linkage map estimated in this study was 169.3cM. The average distance between adjacent markers was 6.05cM. For mapping of QTL, we used F$_2$ QTL Analysis Servlet of QTL express, a web-based QTL mapping tool(http://qtl.cap.ed.ac.uk). Five QTLs were detected at 5% chromosome-wide level for body weight of 3 weeks of age, shear force, meat pH at 24 hours after slaughtering, backfat thickness and crude protein level on SSC6.

Compilation of 104 Experimental Theses on the Antitumor and Immuno-activating therapies of Oriental Medicine (한의학의 항종양 면역치료에 관한 연구 -1990년 이후 발표된 실험논문을 중심으로-)

  • Kang Yeon Yee;Kim Tai Im;Park Jong Ho;Kim Sung Hoon;Park Jong Dai;Kim Dong Hee
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.17 no.1
    • /
    • pp.1-24
    • /
    • 2003
  • This study was done to compile 104 experimental theses which are related to the antitumor and immuno-activating therapies between February 1990 through February 2002. Master's and doctoral theses were dassified by schools, degrees, materials, effects, experimental methods of antitumor and immunoactivity, and results. The following results were obtained from this study : 1. Classifying the theses by the school, 34.6% were presented by Daejeon University, 29.8% by Kyung-hee University and 11.5% by Won-kwang University. Of all theses, 51.0% were aimed for the doctoral degree and 43.3% were for the master's degree. All of three universities have their own cancer centers. 2. Classifying the theses by herb materials, complex prescription accounted for 60.3%, single herb accounted for 24.8% and herbal acupuncture accounted for 14.2%. Considering the key principles of the traditional medicine, complex prescription was much more thoroughly studied than single herb prescription. The results showed that the complex prescription had both antitumor activity and immuno-activating activity, which might reflects on multi-activation mechanisms by complex components. 3. Classifying the theses by the efficacy of herbs examined, in single herb, invigorating spleen and supplementing was 35.5%, expelling toxin and cooling was 29.0%, activating blood flow and removing blood stasis was 12.9%. In herbal acupuncture, invigorating spleen and supplementing was 52.9%, expelling toxin and cooling was 29.4%. In complex prescription, pathogen-free status was 41.9%, strengthening healthy qi to eliminate pathogen was 35.5%, strengthening healthy qi was 22.6%. It is presumed that the antitumor and immunoactivating therapy based on syndrome differentiation is the best way to develop oriental oncology. 4. Classifying the theses by antitumor experiments, cytotoxic effect was 48.1 %, survival time was 48.1 % and change of tumor size was 42.3%. Survival rate was not necessarily correlated with cytotoxicity. These data reflect the characteristic, wholistic nature of the oriental medicine which is based on BRM (biological response modifier). 5. Classifying the theses by immunoactivating experiments, hemolysin titer was 51.0%, hemagglutinin titer was 46.2% and NK cell's activity was 44.2%. In the future studies, an effort to elucidate specific molecular and cellular mechanisms of cytokine production in the body would be crucial. 6. Classifying the theses according to the data in terms of antitumor activity, 50% was evaluated good, 24.0% was excellent, and 15.5% have no effect. In an evaluation of immuno-activating activity, 35.9% was excellent and 18.0% showed a little effect. The index point, as described here, may helps to use experimental data for clinical trials. Changes in index points by varying dosage implicate the importance of oriental medical theory for prescription. 7. In 167 materials, IIP (immunoactivating index point, mean : 3.12±0.07) was significantly higher than AIP(antitumor index point, mean : 2.83±0.07). These data demonstrate that the effect of herb medicine on tumor activity depends more on immunoactivating activity than antitumor activity. This further implies that the development of herbal antitumor drugs must be preceded by the mechanistic understanding of immunoactivating effect. 8. After medline-searching tumor and herb-related articles from NCBI web site, we conclude that most of the studies are primarily focused on biomolecular mechanisms and/or pathways. Henceforth, we need to define the biomolecular mechanisms and/or pathways affected by herbs or complicated prescriptions. 9. Therefore, the most important point of oriental medical oncology is to conned between experimental results and clinical trials. For the public application of herbal therapy to cancer, it is critical to present the data to mass media. 10. To develop the relationship of experimental results and clinical trials, university's cancer clinic must have a long-range plan related to the university laboratories and, at the same time, a regular consortium for this relationship is imperative. 11. After all these efforts, a new type herbal medicine for cancer therapy which is to take care of the long-term administering and safety problem must be developed. Then, it would be expected that anti-tumor herbal acupuncture can improve clinical symptoms and quality of life (QOL) for cancer patients. 12. Finally, oriental medical cancer center must be constructed in NCC (National Cancer Center) or government agency for the development of oriental medical oncology which has international competitive power.

Characteristics of Fish Fauna in the Lower Geum River and Identification of Trophic Guilds using Stable Isotopes Analysis (금강하류의 어류상 및 안정동위원소 분석을 이용한 섭식길드 파악)

  • Yoon, Ju-Duk;Park, Sang-Hyeon;Chang, Kwang-Hyeon;Choi, Jong-Yun;Joo, Gea-Jae;Nam, Gui-Sook;Yoon, Johee;Jang, Min-Ho
    • Korean Journal of Environmental Biology
    • /
    • v.33 no.1
    • /
    • pp.34-44
    • /
    • 2015
  • Fish fauna, difference of stable isotope ratio between freshwater and seawater, and trophic guilds of freshwater fishes were investigated in the lower Geum River. The study was conducted in 2011, and total study area was about 30 km of 20 km upstream and 10 km downstream from the Geum River estuary barrage. Only freshwater fishes were used for analyzing trophic guilds, and discriminant function analysis (DFA) was utilized to reclassify trophic guilds based on stable isotope ratio. Fish fauna in freshwater and seawater areas were entirely different each other, but small number of migratory species such as Coilia nasus and Chelon haematocheilus occurred both areas. Other species were not collected in the different areas because they did not have physiological ability to adapt different salinity concentrations. Stable isotope ration of two areas were different considerably due to food sources. Estuary and seawater fishes uptake food sources originated from marine, and freshwater fishes were from freshwater and terrestrial. Some migratory species showed reverse stable isotope ratio. Even though they collected in freshwater, they showed stable isotope ratio of seawater. This is because ecological characteristics of each species. Trophic guilds of freshwater fishes were reclassified by DFA, and showed slight difference with literatures. However, because this result is related with ontogenetic shift of species, more studies are needed to explain exact and correct trophic guilds. Stable isotope ratio can be changed among regions, seasons and ontogenetic stage, thus we always consider these aspects when analyzing results to get a right answer.

GWB: An integrated software system for Managing and Analyzing Genomic Sequences (GWB: 유전자 서열 데이터의 관리와 분석을 위한 통합 소프트웨어 시스템)

  • Kim In-Cheol;Jin Hoon
    • Journal of Internet Computing and Services
    • /
    • v.5 no.5
    • /
    • pp.1-15
    • /
    • 2004
  • In this paper, we explain the design and implementation of GWB(Gene WorkBench), which is a web-based, integrated system for efficiently managing and analyzing genomic sequences, Most existing software systems handling genomic sequences rarely provide both managing facilities and analyzing facilities. The analysis programs also tend to be unit programs that include just single or some part of the required functions. Moreover, these programs are widely distributed over Internet and require different execution environments. As lots of manual and conversion works are required for using these programs together, many life science researchers suffer great inconveniences. in order to overcome the problems of existing systems and provide a more convenient one for helping genomic researches in effective ways, this paper integrates both managing facilities and analyzing facilities into a single system called GWB. Most important issues regarding the design of GWB are how to integrate many different analysis programs into a single software system, and how to provide data or databases of different formats required to run these programs. In order to address these issues, GWB integrates different analysis programs byusing common input/output interfaces called wrappers, suggests a common format of genomic sequence data, organizes local databases consisting of a relational database and an indexed sequential file, and provides facilities for converting data among several well-known different formats and exporting local databases into XML files.

  • PDF

Occurrence and Molecular Phylogenetic Characteristics of Benthic Sand-dwelling Dinoflagellates in the Intertidal Flat of Dongho, West Coast of Korea (서해안 동호 사질 조간대에 서식하는 저서성 와편모류의 출현양상 및 분자계통학적 특성)

  • KIM, SUNJU;YOON, JIHAE;PARK, MYUNG GIL
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.20 no.3
    • /
    • pp.141-150
    • /
    • 2015
  • Dinoflagellates are ubiquitous and important primary producers in the oceans. They have diverse trophic modes, i.e., phototrophic, heterotrophic, and mixotrophic modes and thereby, play important ecological role in marine microbial food-web. While many studies have been focused on planktonic dinoflagellates in pelagic ecosystems, benthic, sand-dwelling dinoflagellates that inhabit in intertidal zone have been very poorly documented worldwide. We investigated biodiversity, occurrence, and molecular phylogeny of benthic, sand-dwelling dinoflagellates from the intertidal flat of Dongho, west coast of Korea during low-tide, monthly from November 2012 to February 2014. About 27 species of 13 genera in orders Gonyaulacales, Gymnodiniales, Peridiniales, Prorocentrales have been identified, of which members in the genus Amphidinium constituted a major part of the sand-dwelling dinoflagellates in this area. A total of 34 isolates from 16 species of the sand-dwelling dinoflagellates were isolated from Dongho, Mohang, Gamami, and Songho in the west coast and Hyupjae in Jeju of Korea, their 28S rDNA sequences were successfully amplified, and applied for molecular phylogenetic analyses. In the 28S rDNA phylogeny, Amphidinium species diverged across three major clusters within the order Gymnodiniales and formed polyphyletic group. Based on the unambiguously aligned partial 28S rDNA sequences including variable D2 region, the genotypes of Amphidinium mootonorum Korean strains greatly differed from that of Canadian strain with 19.2% of pairwise nucleotide difference, suggesting that further ultrastructural studies may provide additional characters to clearly separate these genotypes. Two potential toxic species, Amphidinium carterae and A. operculatum appeared occasionally during this study. Quantitative assessment and toxicity of those species should be addressed in the future.

An analysis of the Domestic Interior Materials as the Ecological Design Aspects (친환경측면에서 본 국내 실내건축자재의 현황 조사 및 분석)

  • Chun Jin-Hie;Kim Jung-Ah
    • Archives of design research
    • /
    • v.19 no.4 s.66
    • /
    • pp.133-144
    • /
    • 2006
  • According to the latest report by the Customer Protection Board, those who moved into newly constructed buildings are complaining about unidentified pains, asking for more careful selection of constructive materials for prevention of such potential problems. It is internationally recognized today that ecological materials can serve a significant factor for users' health, environmental protection and better industrial competitiveness. This study examined eco-design aspects of each interior material through web site search, in order to help customers learn about and capitalize on eco materials in a proper manner. As a result, 1. It turned out that the domestic industry are giving an impetus to releasing new eco items focusing on lower VOCs emission or addition of functional components as part of the marketing strategy. However, it is recommended that company understand significance of life cycle, and produce eco-concept materials. 2. The reliable standard for choosing the domestic material is EL, HB, GR marks. It is desirable to enhance recycling technologies and expand the sustainable consumption. customer class, since many recycled items are not developed. 3. The sourcing is a vulnerable part in terms of the concept of being environment-friendly material. Therefore, many manufacturers should design the easy knock-down products and produce the good items using recycled materials instead of new raw materials. Also solutions for making the energy from burning material should be studied. 4. The guidebook or manual with correct information about eco-materials is required to promote production and consumption with sustainable concept. 5. Many manufacturers are emphasizing ecological materials for customers, but some of them intended to disrupt customers' proper selection by promoting even unverified items to be environment-friendly.

  • PDF

Characteristic of Seasonal Dynamics of Planktonic Ciliates at Four Major Ports (Busan, Ulsan, Gwangyang and Incheon), Korea (한국의 4개 주요항만(부산, 울산, 광양, 인천)에 분포하는 섬모충 플랑크톤의 계절동태 특성)

  • Yang, Seung-Woo;Lee, Joon-Baek;Kim, Young-Ok
    • Korean Journal of Environmental Biology
    • /
    • v.36 no.2
    • /
    • pp.217-231
    • /
    • 2018
  • Planktonic ciliates play an important role in the food web of marine ecosystem as well as a bio-indicator for invasive species from ballast waters or from changing flow of ocean currents due to climate changes. This study was carried out to find some evidences for introduction of such invasive species using ciliate plankton in four major international ports of Korea. We surveyed the seasonal species composition of planktonic ciliate to find out the evidence for the invasive species at Busan, Ulsan, Gwangyang and Incheon ports from February 2007 to November 2008. A total of 45 ciliates species, belonging to 15 genera, were identified during the study period: 33 species occurred at Busan, 31 at Gwangyang, 30 at Ulsan, 18 at Incheon. The abundance of naked ciliates ranged from 566 to $65,151cells\;L^{-1}$ and that of tintinnids 10 to $5,973cells\;L^{-1}$. Based on vector species of ciliates reported from Coos Bay in Oregon, 13 vector species of tinitinnids were identified as follows, Eutintinnus lususundae, E. tubulosus, Favella ehrenbergii, F. taraikaensis, Helicostomella subulata, Stenosemella nivalis, Tintinnopsis ampla, T. beroidea, T. cylindrica, T. directa, T. lohmanni, T. radix, T. rapa. All vector species occurred at Gwangyang port. Most tintinnids were mainly neritic species throughout the survey, while warm water species occurred only in short period at Busan, Ulsan and Gwangyang ports that might be affected seasonally by Tsushima warm current.

Research Direction for Functional Foods Safety (건강기능식품 안전관리 연구방향)

  • Jung, Ki-Hwa
    • Journal of Food Hygiene and Safety
    • /
    • v.25 no.4
    • /
    • pp.410-417
    • /
    • 2010
  • Various functional foods, marketing health and functional effects, have been distributed in the market. These products, being in forms of foods, tablets, and capsules, are likely to be mistaken as drugs. In addition, non-experts may sell these as foods, or use these for therapy. Efforts for creating health food regulations or building regulatory system for improving the current status of functional foods have been made, but these have not been communicated to consumers yet. As a result, problems of circulating functional foods for therapy or adding illegal medical to such products have persisted, which has become worse by internet media. The cause of this problem can be categorized into (1) product itself and (2) its use, but in either case, one possible cause is lack of communications with consumers. Potential problems that can be caused by functional foods include illegal substances, hazardous substances, allergic reactions, considerations when administered to patients, drug interactions, ingredients with purity or concentrations too low to be detected, products with metabolic activations, health risks from over- or under-dose of vitamin and minerals, and products with alkaloids. (Journal of Health Science, 56, Supplement (2010)). The reason why side effects related to functional foods have been increasing is that under-qualified functional food companies are exaggerating the functionality for marketing purposes. KFDA has been informing consumers, through its web pages, to address the above mentioned issues related to functional foods, but there still is room for improvement, to promote proper use of functional foods and avoid drug interactions. Specifically, to address these issues, institutionalizing to collect information on approved products and their side effects, settling reevaluation systems, and standardizing preclinical tests and clinical tests are becoming urgent. Also to provide crucial information, unified database systems, seamlessly aggregating heterogeneous data in different domains, with user interfaces enabling effective one-stop search, are crucial.

Korean Word Sense Disambiguation using Dictionary and Corpus (사전과 말뭉치를 이용한 한국어 단어 중의성 해소)

  • Jeong, Hanjo;Park, Byeonghwa
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.1-13
    • /
    • 2015
  • As opinion mining in big data applications has been highlighted, a lot of research on unstructured data has made. Lots of social media on the Internet generate unstructured or semi-structured data every second and they are often made by natural or human languages we use in daily life. Many words in human languages have multiple meanings or senses. In this result, it is very difficult for computers to extract useful information from these datasets. Traditional web search engines are usually based on keyword search, resulting in incorrect search results which are far from users' intentions. Even though a lot of progress in enhancing the performance of search engines has made over the last years in order to provide users with appropriate results, there is still so much to improve it. Word sense disambiguation can play a very important role in dealing with natural language processing and is considered as one of the most difficult problems in this area. Major approaches to word sense disambiguation can be classified as knowledge-base, supervised corpus-based, and unsupervised corpus-based approaches. This paper presents a method which automatically generates a corpus for word sense disambiguation by taking advantage of examples in existing dictionaries and avoids expensive sense tagging processes. It experiments the effectiveness of the method based on Naïve Bayes Model, which is one of supervised learning algorithms, by using Korean standard unabridged dictionary and Sejong Corpus. Korean standard unabridged dictionary has approximately 57,000 sentences. Sejong Corpus has about 790,000 sentences tagged with part-of-speech and senses all together. For the experiment of this study, Korean standard unabridged dictionary and Sejong Corpus were experimented as a combination and separate entities using cross validation. Only nouns, target subjects in word sense disambiguation, were selected. 93,522 word senses among 265,655 nouns and 56,914 sentences from related proverbs and examples were additionally combined in the corpus. Sejong Corpus was easily merged with Korean standard unabridged dictionary because Sejong Corpus was tagged based on sense indices defined by Korean standard unabridged dictionary. Sense vectors were formed after the merged corpus was created. Terms used in creating sense vectors were added in the named entity dictionary of Korean morphological analyzer. By using the extended named entity dictionary, term vectors were extracted from the input sentences and then term vectors for the sentences were created. Given the extracted term vector and the sense vector model made during the pre-processing stage, the sense-tagged terms were determined by the vector space model based word sense disambiguation. In addition, this study shows the effectiveness of merged corpus from examples in Korean standard unabridged dictionary and Sejong Corpus. The experiment shows the better results in precision and recall are found with the merged corpus. This study suggests it can practically enhance the performance of internet search engines and help us to understand more accurate meaning of a sentence in natural language processing pertinent to search engines, opinion mining, and text mining. Naïve Bayes classifier used in this study represents a supervised learning algorithm and uses Bayes theorem. Naïve Bayes classifier has an assumption that all senses are independent. Even though the assumption of Naïve Bayes classifier is not realistic and ignores the correlation between attributes, Naïve Bayes classifier is widely used because of its simplicity and in practice it is known to be very effective in many applications such as text classification and medical diagnosis. However, further research need to be carried out to consider all possible combinations and/or partial combinations of all senses in a sentence. Also, the effectiveness of word sense disambiguation may be improved if rhetorical structures or morphological dependencies between words are analyzed through syntactic analysis.