• Title/Summary/Keyword: Web-System

Search Result 7,837, Processing Time 0.033 seconds

The Brassica rapa Tissue-specific EST Database (배추의 조직 특이적 발현유전자 데이터베이스)

  • Yu, Hee-Ju;Park, Sin-Gi;Oh, Mi-Jin;Hwang, Hyun-Ju;Kim, Nam-Shin;Chung, Hee;Sohn, Seong-Han;Park, Beom-Seok;Mun, Jeong-Hwan
    • Horticultural Science & Technology
    • /
    • v.29 no.6
    • /
    • pp.633-640
    • /
    • 2011
  • Brassica rapa is an A genome model species for Brassica crop genetics, genomics, and breeding. With the completion of sequencing the B. rapa genome, functional analysis of the genome is forthcoming issue. The expressed sequence tags are fundamental resources supporting annotation and functional analysis of the genome including identification of tissue-specific genes and promoters. As of July 2011, 147,217 ESTs from 39 cDNA libraries of B. rapa are reported in the public database. However, little information can be retrieved from the sequences due to lack of organized databases. To leverage the sequence information and to maximize the use of publicly-available EST collections, the Brassica rapa tissue-specific EST database (BrTED) is developed. BrTED includes sequence information of 23,962 unigenes assembled by StackPack program. The unigene set is used as a query unit for various analyses such as BLAST against TAIR gene model, functional annotation using MIPS and UniProt, gene ontology analysis, and prediction of tissue-specific unigene sets based on statistics test. The database is composed of two main units, EST sequence processing and information retrieving unit and tissue-specific expression profile analysis unit. Information and data in both units are tightly inter-connected to each other using a web based browsing system. RT-PCR evaluation of 29 selected unigene sets successfully amplified amplicons from the target tissues of B. rapa. BrTED provided here allows the user to identify and analyze the expression of genes of interest and aid efforts to interpret the B. rapa genome through functional genomics. In addition, it can be used as a public resource in providing reference information to study the genus Brassica and other closely related crop crucifer plants.

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

  • Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.21-44
    • /
    • 2018
  • In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.

End-use analysis of household water by metering (가정용수의 용도별 사용량 조사 및 원단위 분석)

  • Kim, Hwa-Soo;Lee, Doo-Jin;Kim, Ju-Whan;Kim, Jung-Hyun;Jung, Kwan-Soo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2008.05a
    • /
    • pp.869-877
    • /
    • 2008
  • The purpose of this study is to investigate the trends and patterns of variou kind of water uses in a household by metering in Korea. Water use components are classified by toilet, washbowl, bathing, laundry, kitchen, etc. Flow meters are installed in 146 household selected by sampling in all around Korea. The data are gathered by web-based data collection system from the year 2002 to 2006, considering pre-investigated data such as occupation, revenue, family members, housing types, age, floor area, water saving devices, education, etc. Reliable data are selected by upper fence method for each observed water use component and statistical characteristics are estimated for each residential type to determine liter per capita per day. Estimated domestic per capita day show an indoor water use with the range from $150{\ell}pcd$ to $169{\ell}pcd$ for each housing type as the order of high rise apartment, multi-house, and single house. As the order of consuming amount among water use components, it is investigated that toilet($38.5{\ell}pcd$) is the first, and the second is laundry water($30.8{\ell}pcd$), the third is kitchen($28.4{\ell}pcd$), the fourth is bathtub($24.7{\ell}pcd$), the next is washbowl($15.4{\ell}pcd$). The results are compared with water uses in U.K. and U.S. As life style has been changed into western style, pattern of water use in Korea is tend to be similar with the U.S. water use pattern. Compared with the surveying results by Bradley, on 1985. Thirty liter of total use increased with the advancement of economic level, and a little change of water use pattern can be found. Especially, toilet water take almost half part of total water use and laundry water shows lowest as 11% in surveying at the year of 1985. But, this study shows that 39 liter, 28% of toilet water, has been decreased by the spread of saving devices and campaign. It is supposed that the spread large sized laundry machine make by-hand laundry has been decreased and water use increased. Unit water amount of each end-use in household can be applied to design factor for water and wastewater facilities, and it play a role as information in establishing water demand forecasting and conservation policy.

  • PDF

A Study on the Characteristics of Jobs in Academic Libraries According to Different Generations (대학도서관 업무의 시대별 변천에 따른 특성 연구)

  • Cho, Chul-Hyun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.26 no.1
    • /
    • pp.135-170
    • /
    • 2015
  • This study aimed to investigate the transition of academic libraries' jobs by developing a model based on a shift of library generations including Library 1.0, Library 2.0, and Library 3.0 corresponding to the shift of web generations and to explore generational characteristics of library duties as well. The research used three phases of procedure: literature review about different library generations; job analyses for academic libraries in South Korea and the U.S.A.; the Delphi technique in tree sequential order. The research findings were as follows. First of all, there were 170 duties that continued from Library 1.0 to Library 3.0. There were 58 duties which continued from Library 2.0 to Library 3.0 whereas three duties that continued from Library 1.0 to Library 2.0. In addition, three distinctive duties existed only in Library 1.0 whereas one unique duty was only in Library 2.0. Library 3.0 generated 25 new duties. Secondly, considering general characteristics which cover specific parts of individual duties, there was a significant increase in importance, difficulty, and frequency of library administration throughout the three generations. In terms of importance, difficulty, and frequency of collection development and management, there was a significant increase only from Library 2.0 to Library 3.0. Considering information organization, there was a significant decrease in importance from Library 1.0 to Library 2.0. In addition, there was a significant decrease in frequency and there was no significant difference in difficulty throughout the three generations. In the case of information service, while there was a significant increase in importance among three generations, there was a significant increase in difficulty only from Library 1.0 to Library 2.0. However, there was no generational difference in frequency. With the respect of information system development and management, there was a significant increase in importance and frequency throughout the three generations, but there was no significant difference in difficulty among three generations.

A Study on the Strategic Use of an IMC Planning Model for the Distribution Industry (유통업 IMC 기획모델의 전략적 활용에 관한 연구)

  • Mo, Sun-Jong;Song, In-Am
    • Journal of Global Scholars of Marketing Science
    • /
    • v.18 no.2
    • /
    • pp.113-145
    • /
    • 2008
  • Marketing for the distribution industry is making an ongoing progress in the changes of customers, the competitive environment, and the internal marketing environment. Integrated marketing communication activities are required for the enhancement of efficiency in the market.oriented activities. In this study, IMC is defined as "a notion that a market oriented business integrated marketing communication means, conducting and evaluating marketing activities with consistent messages in order to communicate with customers based on databases." In this study, an IMC planning model for the improvement of marketing efficiency in the distribution industry was derived from a pilot study. This model may be broken down into the following phases: IMC goals setting, situational analysis (customer analysis, competition analysis and company analysis), customer data analysis, contact management, budgeting, the establishment of an IMC strategy, the IMC mix and execution, an evaluation system, and feedback. In consideration of the characteristics of the distribution industry, this study was accompanied by a vocational study on IMC means employed by, in particular, department stores and other distributors such as: advertising, sales promotion, sales promotion advertising, direct marketing, public relations, personal selling, the Internet, mobile, visual merchandising, words of mouth. In addition, this study also covered the correlation among variables such as IMC activities of distributors, the process of forming customer's brand attitudes, brand loyalty and repurchase intention. This research would enhance the utilization of IMC. The analysis on customer's brand attitudes toward the IMC activities of distributors requires the simultaneous consideration of how they are linked to purchase as well as their attitudes toward both distributors and stores. The formation of brand loyalty and repurchase intention is related to the integration of marketing communication and the maintenance of consistency in contents, which requires integrated brand communication (IBC) strategies. IBC is a concept of using IMC means to manage the brand in a continuing and consistent manner and measuring their effect, which is a process to establish enterprise.level brand identity and maximize brand loyalty and repurchase intention by integrating IMC means. For an empirical analysis in this study, an online questionnaire survey was conducted among those department store customers from 20's to 50's who reside either in the Seoul and Gyeonggi areas and have made purchase at department stores. In this study, the research model consisted of four theoretical variables: IMC activities, IMC attitudes, brand loyalty, and repurchase intention, on which variables a pilot study was conducted. A number of hypotheses were constructed on the relations between IMC activities and IMC attitudes, between IMC attitudes and repurchase intention, and between brand loyalty and repurchase intention. The test of the hypotheses may be summarized as follows: Firstly, the test of the hypothesis concerning the relation between IMC attitudes and IMC activities - advertising, sales promotion, direct marketing, public relations, personal selling, the Web, mobile, visual merchandising, and word of mouth - indicates that advertising, sales promotion, direct marketing, public relations, personal selling, mobile, visual merchandising, and word of mouth have significant impact on IMC activities. In addition to the result similar to those of previous studies that such marketing communication means as word of mouth, advertising, personal selling and sales promotion, in particular, play very important roles, a notable finding of this study is that visual merchandising performed by department stores is shown to have very significant impact on IMC activities. On a separate note, it is also noteworthy that Internet marketing activities engaged by department stores are not shown to have significant impact on IMC attitudes. Secondly, the test of the hypothesis on the relation between IMC attitudes and brand loyalty attests that IMC attitudes for the distribution industry significantly affect brand loyalty. Thirdly, the test of the hypothesis concerning the relation between IMC attitudes and repurchase intention confirms that IMC attitudes for the distribution industry significantly affect repurchase intention. Fourthly, the test of the hypothesis concerning the relation between brand loyalty and repurchase intention indicates that brand loyalty significantly affect repurchase intention. A comprehensive view of these findings points to the conclusion that the IMC activities for the distribution industry do affect IMC attitudes, brand loyalty, and repurchase intention.

  • PDF

Effect of Highly Concentrated Turbid Water on the Water Quality and Periphytic Diatom Community in Artificial Channel (인공수로에서 고농도 탁수가 수질 및 부착 규조류 군집에 미치는 영향)

  • Yoon, Sung-Ae;You, Kyung-A;Park, Ji-Hyoung;Kim, Baik-Ho;Hwang, Soon-Jin
    • Korean Journal of Ecology and Environment
    • /
    • v.44 no.1
    • /
    • pp.75-84
    • /
    • 2011
  • We examined the effect of the turbid water on the periphytic diatom community in an artificial stream system. The artificial stream was constructed with transparent acryl and composed of four channels. Each channel ($20\;cm{\times}200\;cm{\times}40\;cm$) was supplied continuously with eutrophic lake water. In order to the freely colonize and grow diatoms, artificial substrate was installed with commercial slide glass soaked in 1% agar. Prior to introducing turbid water, the artificial stream was operated with lake water for 6 days to permit the propagation of diatom community on the substrates. The turbid water prepared with sediment sieved with ${\varphi}$ $64\;{\mu}m$ at $2\;g\;L^{-1}$ (final concentration, 300 NTU) was provided daily for 50 minute duration. The experiment was conducted for 7 days with manipulated experimental condition of light ($50{\sim}80\;{\mu}mol\;m^{-2}s^{-1}$, light:dark=24:0), temperature ($10{\pm}1^{\circ}C$), and flow rate ($0.31\;cm\;s^{-1}$). Sampling and analysis were conducted daily for water quality and diatom. Turbidity of the water varied 162.2~173.2 NTU during the experiment. After introduction of turbid water, DO, pH and TN were decreased, while SS and TP increased significantly. A total of 14 genera and 47 species of diatoms was observed on the artificial substrates during the experimental period. Of these, Navicula appeared to be a most dominant genus with 10 species, followed by Cymbella (6 species), Fragilaria (6 species) and Gomphonema (5 species). Achnanthes minutissima was the most dominant species (>70% of total frequency) in both control and treatment experiments. Increase in diatom abundance lasted for three days since turbid water introduction, after that they gradually decreased by the termination of the experiment. These results suggest that frequent supply of highly-concentrated turbid water significantly decreases the periphytic diatom community, and retard the recovery of the stable food-web within the stream.

Discussions about Expanded Fests of Cartoons and Multimedia Comics as Visual Culture: With a Focus on New Technologies (비주얼 컬처로서 만화영상의 확장된 장(場, fest)에 대한 논의: 뉴 테크놀로지를 중심으로)

  • Lee, Hwa-Ja;Kim, Se-Jong
    • Cartoon and Animation Studies
    • /
    • s.28
    • /
    • pp.1-25
    • /
    • 2012
  • The rapid digitalization across all aspects of society since 1990 led to the digitalization of cartoons. As the medium of cartoons moved from paper to the web, a powerful visual culture emerged. An encounter between cartoons and multimedia technologies has helped cartoons evolve into a video culture. Today cartoons are no longer literate culture. It is critical to pay attention to cartoons as an "expanded fest" and as visual and video culture with much broader significance. In this paper, the investigator set out to diagnose the current position of cartoons changing in the rapidly changing digital age and talk about future directions that they should pursue. Thus she discussed cases of changes from 1990 when colleges began to provide specialized education for cartoons and animation to the present day when cartoon and Multimedia Comics fests exist in addition to the digitalization of cartoons. The encounter between new technologies and cartoons broke down the conventional forms of cartoons. The massive appearance of artists that made active use of new technologies in their works, in particular, has facilitated changes to the content and forms of cartoons and the expansion of character uses. The development of high technologies extends influence to the roles of appreciators beyond the artists' works. Today readers voice their opinions about works actively, build a fan base, promote the works and artists they favor, and help them rise to stardom. As artist groups of various genres were formed, the possibilities of new stories and texts and the appearance of diverse styles and world views have expanded the essence of cartoon texts and the overall cartoon system of cartoon culture, industry, education, institution, and technology. It is expected that cartoons and Multimedia Comics will continue to make a contribution as a messenger to reflect the next generation of culture, mediate it, and communicate with it. Today there is no longer a distinction between print and video cartoons. Cartoons will expand in every field through a wide range of forms and styles, given the current situations involving installation concept cartoons, blockbuster digital videos, fancy items, and characters at theme parks based on a narrative. It is therefore necessary to diversify cartoon and Multimedia Comics education in diverse ways. Today educators are faced with a task to bring up future generations of talents who are capable of leading the culture of overall senses based on literate and video culture by incorporating humanities, social studies, and new technology education into their creative artistic abilities.

Correlation of Consumer Evaluation on Restaurants in Social Network System (SNS) with Food Hygiene (식품접객업소에 대한 사회관계망서비스(SNS) 상의 소비자 평가와 위생상태의 연관성 분석)

  • Kim, Kyungmi;Kim, Sejeong;Lee, Soomin;Lee, Jeeyeon;Lee, Heeyoung;Choi, Yukyung;Yoon, Yohan
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.27 no.4
    • /
    • pp.473-476
    • /
    • 2017
  • Social network service (SNS) plays an important role in food service industry consumers SNS restaurants, and other consumers review the reputations. It was assumed that bad reputation could have poor food hygiene. Therefore, this study evaluated the relation between reputations SNS and food hygiene. Restaurants were searched using web portals and 12 restaurants (six for good and six for bad reputation) were selected. Microbiological analysis (total aerobic bacteria, coliform, and Escherichia coli) for main and side dish was performed. Detection frequencies for total aerobic bacteria were not different between good and bad restaurants. However, bad restaurants had higher detection frequencies (70.8%) with mean of 3.2 log CFU/g for coliform than good restaurants (62.5%; mean of 2.3 log CFU/g). In addition, bad restaurants had higher detection frequencies (25%) of E. coli with mean of 0.8 log CFU/g than good restaurants (8.3%; mean of 0.5 log CFU/g). This result indicates that consumer reputations SNS are related to food hygiene, and the reputation data can be used for food hygiene inspection by food safety agencies.

Story-based Information Retrieval (스토리 기반의 정보 검색 연구)

  • You, Eun-Soon;Park, Seung-Bo
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.81-96
    • /
    • 2013
  • Video information retrieval has become a very important issue because of the explosive increase in video data from Web content development. Meanwhile, content-based video analysis using visual features has been the main source for video information retrieval and browsing. Content in video can be represented with content-based analysis techniques, which can extract various features from audio-visual data such as frames, shots, colors, texture, or shape. Moreover, similarity between videos can be measured through content-based analysis. However, a movie that is one of typical types of video data is organized by story as well as audio-visual data. This causes a semantic gap between significant information recognized by people and information resulting from content-based analysis, when content-based video analysis using only audio-visual data of low level is applied to information retrieval of movie. The reason for this semantic gap is that the story line for a movie is high level information, with relationships in the content that changes as the movie progresses. Information retrieval related to the story line of a movie cannot be executed by only content-based analysis techniques. A formal model is needed, which can determine relationships among movie contents, or track meaning changes, in order to accurately retrieve the story information. Recently, story-based video analysis techniques have emerged using a social network concept for story information retrieval. These approaches represent a story by using the relationships between characters in a movie, but these approaches have problems. First, they do not express dynamic changes in relationships between characters according to story development. Second, they miss profound information, such as emotions indicating the identities and psychological states of the characters. Emotion is essential to understanding a character's motivation, conflict, and resolution. Third, they do not take account of events and background that contribute to the story. As a result, this paper reviews the importance and weaknesses of previous video analysis methods ranging from content-based approaches to story analysis based on social network. Also, we suggest necessary elements, such as character, background, and events, based on narrative structures introduced in the literature. We extract characters' emotional words from the script of the movie Pretty Woman by using the hierarchical attribute of WordNet, which is an extensive English thesaurus. WordNet offers relationships between words (e.g., synonyms, hypernyms, hyponyms, antonyms). We present a method to visualize the emotional pattern of a character over time. Second, a character's inner nature must be predetermined in order to model a character arc that can depict the character's growth and development. To this end, we analyze the amount of the character's dialogue in the script and track the character's inner nature using social network concepts, such as in-degree (incoming links) and out-degree (outgoing links). Additionally, we propose a method that can track a character's inner nature by tracing indices such as degree, in-degree, and out-degree of the character network in a movie through its progression. Finally, the spatial background where characters meet and where events take place is an important element in the story. We take advantage of the movie script to extracting significant spatial background and suggest a scene map describing spatial arrangements and distances in the movie. Important places where main characters first meet or where they stay during long periods of time can be extracted through this scene map. In view of the aforementioned three elements (character, event, background), we extract a variety of information related to the story and evaluate the performance of the proposed method. We can track story information extracted over time and detect a change in the character's emotion or inner nature, spatial movement, and conflicts and resolutions in the story.

Financial Fraud Detection using Text Mining Analysis against Municipal Cybercriminality (지자체 사이버 공간 안전을 위한 금융사기 탐지 텍스트 마이닝 방법)

  • Choi, Sukjae;Lee, Jungwon;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.119-138
    • /
    • 2017
  • Recently, SNS has become an important channel for marketing as well as personal communication. However, cybercrime has also evolved with the development of information and communication technology, and illegal advertising is distributed to SNS in large quantity. As a result, personal information is lost and even monetary damages occur more frequently. In this study, we propose a method to analyze which sentences and documents, which have been sent to the SNS, are related to financial fraud. First of all, as a conceptual framework, we developed a matrix of conceptual characteristics of cybercriminality on SNS and emergency management. We also suggested emergency management process which consists of Pre-Cybercriminality (e.g. risk identification) and Post-Cybercriminality steps. Among those we focused on risk identification in this paper. The main process consists of data collection, preprocessing and analysis. First, we selected two words 'daechul(loan)' and 'sachae(private loan)' as seed words and collected data with this word from SNS such as twitter. The collected data are given to the two researchers to decide whether they are related to the cybercriminality, particularly financial fraud, or not. Then we selected some of them as keywords if the vocabularies are related to the nominals and symbols. With the selected keywords, we searched and collected data from web materials such as twitter, news, blog, and more than 820,000 articles collected. The collected articles were refined through preprocessing and made into learning data. The preprocessing process is divided into performing morphological analysis step, removing stop words step, and selecting valid part-of-speech step. In the morphological analysis step, a complex sentence is transformed into some morpheme units to enable mechanical analysis. In the removing stop words step, non-lexical elements such as numbers, punctuation marks, and double spaces are removed from the text. In the step of selecting valid part-of-speech, only two kinds of nouns and symbols are considered. Since nouns could refer to things, the intent of message is expressed better than the other part-of-speech. Moreover, the more illegal the text is, the more frequently symbols are used. The selected data is given 'legal' or 'illegal'. To make the selected data as learning data through the preprocessing process, it is necessary to classify whether each data is legitimate or not. The processed data is then converted into Corpus type and Document-Term Matrix. Finally, the two types of 'legal' and 'illegal' files were mixed and randomly divided into learning data set and test data set. In this study, we set the learning data as 70% and the test data as 30%. SVM was used as the discrimination algorithm. Since SVM requires gamma and cost values as the main parameters, we set gamma as 0.5 and cost as 10, based on the optimal value function. The cost is set higher than general cases. To show the feasibility of the idea proposed in this paper, we compared the proposed method with MLE (Maximum Likelihood Estimation), Term Frequency, and Collective Intelligence method. Overall accuracy and was used as the metric. As a result, the overall accuracy of the proposed method was 92.41% of illegal loan advertisement and 77.75% of illegal visit sales, which is apparently superior to that of the Term Frequency, MLE, etc. Hence, the result suggests that the proposed method is valid and usable practically. In this paper, we propose a framework for crisis management caused by abnormalities of unstructured data sources such as SNS. We hope this study will contribute to the academia by identifying what to consider when applying the SVM-like discrimination algorithm to text analysis. Moreover, the study will also contribute to the practitioners in the field of brand management and opinion mining.