• Title/Summary/Keyword: response database

Search Result 549, Processing Time 0.025 seconds

Chatbot Design Method Using Hybrid Word Vector Expression Model Based on Real Telemarketing Data

  • Zhang, Jie;Zhang, Jianing;Ma, Shuhao;Yang, Jie;Gui, Guan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.4
    • /
    • pp.1400-1418
    • /
    • 2020
  • In the development of commercial promotion, chatbot is known as one of significant skill by application of natural language processing (NLP). Conventional design methods are using bag-of-words model (BOW) alone based on Google database and other online corpus. For one thing, in the bag-of-words model, the vectors are Irrelevant to one another. Even though this method is friendly to discrete features, it is not conducive to the machine to understand continuous statements due to the loss of the connection between words in the encoded word vector. For other thing, existing methods are used to test in state-of-the-art online corpus but it is hard to apply in real applications such as telemarketing data. In this paper, we propose an improved chatbot design way using hybrid bag-of-words model and skip-gram model based on the real telemarketing data. Specifically, we first collect the real data in the telemarketing field and perform data cleaning and data classification on the constructed corpus. Second, the word representation is adopted hybrid bag-of-words model and skip-gram model. The skip-gram model maps synonyms in the vicinity of vector space. The correlation between words is expressed, so the amount of information contained in the word vector is increased, making up for the shortcomings caused by using bag-of-words model alone. Third, we use the term frequency-inverse document frequency (TF-IDF) weighting method to improve the weight of key words, then output the final word expression. At last, the answer is produced using hybrid retrieval model and generate model. The retrieval model can accurately answer questions in the field. The generate model can supplement the question of answering the open domain, in which the answer to the final reply is completed by long-short term memory (LSTM) training and prediction. Experimental results show which the hybrid word vector expression model can improve the accuracy of the response and the whole system can communicate with humans.

A gene expression database for the molecular pharmacology of cancer

  • Scherf, Uwe;Ross, Douglas-T.;Waltham, Mark;Smith, Lawrence-H.;Lee, Jae-K.;Tanbe, Lorraine;Kohn, Kurt-W.;Reinhold, William-C.;Mayers, Timothy-G.;Andrews, Darren-T.;Scudiero, Dominic-A.;Eisen, Michael-B.;Sausville, Edward-A.;Pommier, Yves;Botstein, David;Brown, Patrick-O.;Weinstein, John-N.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2001.08a
    • /
    • pp.129-137
    • /
    • 2001
  • We used cDNA microarrays to assess gene expression profiles in 60 human cancer used in a drug discovery screen by the National Cancer Institute. Using these data, we linked bioinformatics and chemoinformatics by correlating gene expression and drug activity pattens in the NCI60 lines. Clustering the cell lines on the basis of gene expression yielded relationships very different from those obtained by clustering the cell lines on the basis of their response to drugs. Gene-drug relationships for the clinical agents 5-fluorouracil and L-asparaginase exemplify how variations in the transcript levels of particular genes relate to mechanisms of drug sensitivity and resistance. This is the first study to intergrate large databases on gene expression and molecular pharmacology.

  • PDF

Gene Expression Profiling in Diethylnitrosamine Treated Mouse Liver: From Pathological Data to Microarray Analysis (Diethylnitrosamine 처리 후 병리학적 결과를 기초로 한 마우스 간에서의 유전자 발현 분석)

  • Kim, Ji-Young;Yoon, Seok-Joo;Park, Han-Jin;Kim, Yong-Bum;Cho, Jae-Woo;Koh, Woo-Suk;Lee, Michael
    • Toxicological Research
    • /
    • v.23 no.1
    • /
    • pp.55-63
    • /
    • 2007
  • Diethylnitrosamine (DEN) is a nitrosamine compound that can induce a variety of liver lesions including hepatic carcinoma, forming DNA-carcinogen adducts. In the present study, microarray analyses were performed with Affymetrix Murine Genome 430A Array in order to identify the gene-expression profiles for DEN and to provide valuable information for the evaluation of potential hepatotoxicity. C57BL/6NCrj mice were orally administered once with DEN at doses of 0, 3, 7 and 20 mg/kg. Liver from each animal was removed 2, 4, 8 and 24 hrs after the administration. The histopathological analysis and serum biochemical analysis showed no significant difference in DEN-treated groups compared to control group. Conversely, the principal component analysis (PCA) profiles demonstrated that a specific normal gene expression profile in control groups differed clearly from the expression profiles of DEN-treated groups. Within groups, a little variance was found between individuals. Student's t-test on the results obtained from triplicate hybridizations was performed to identify those genes with statistically significant changes in the expression. Statistical analysis revealed that 11 genes were significantly downregulated and 28 genes were upregulated in all three animals after 2 h treatment at 20 mg/kg. The upregulated group included genes encoding Gdf15, JunD1, and Mdm2, while the genes including Sox6, Shmt2, and SIc6a6 were largely down regulated. Hierarchical clustering of gene expression also allowed the identification of functionally related clusters that encode proteins related to metabolism, and MAPK signaling pathway. Taken together, this study suggests that match with a toxicant signature can assign a putative mechanism of action to the test compound if is established a database containing response patterns to various toxic compounds.

Investigation of Minimum Number of Drop Levels and Test Points for FWD Network-Level Testing Protocol in Iowa Department of Transportation (아이오와 주 교통국의 FWD 네트워크 레벨 조사 프로토콜을 위한 최소 하중 재하 수와 조사지점 수의 결정)

  • Kim, Yong-Joo;Lee, Ho-Sin(David);Omundson, Jason S.
    • International Journal of Highway Engineering
    • /
    • v.12 no.4
    • /
    • pp.39-46
    • /
    • 2010
  • In 2007, Iowa department of transportation (DOT) initiated to run the falling weight deflectometer (FWD) network-level testing along Iowa highway and road systems and to build a comprehensive database of deflection data and subsequent structural analysis, which are used for detecting pavement structure failure, estimating expected life, and calculating overlay requirements over a desired design life. Iowa's current FWD networklevel testing protocol requires that pavements are tested at three-drop level with 8-deflection basin collected at each drop level. The test point is determined by the length of the tested pavement section. However, the current FWD network-level program could cover about 20% of Iowa's highway and road systems annually. Therefore, the current FWD network-level test protocol should be simplified to test more than 20% of Iowa's highway and road systems for the network-level test annually. The main objective of this research is to investigate if the minimum number of drop levels and test points could be reduced to increase the testing production rate and reduce the cost of testing and traffic control without sacrificing the quality of the FWD data. Based upon the limited FWD network-level test data of eighty-three composite pavement sections, there was no significant difference between the mean values of three different response parameters when the number of drop levels and test points were reduced from the current FWD network-level testing protocol. As a result, the production rate of FWD tests would increase and the cost of testing and traffic control would be decreased without sacrificing the quality of the FWD data.

Greedy Heuristic Algorithm for the Optimal Location Allocation of Pickup Points: Application to the Metropolitan Seoul Subway System (Pickup Point 최적입지선정을 위한 Greedy Heuristic Algorithm 개발 및 적용: 서울 대도시권 지하철 시스템을 대상으로)

  • Park, Jong-Soo;Lee, Keum-Sook
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.14 no.2
    • /
    • pp.116-128
    • /
    • 2011
  • Some subway passengers may want to have their fresh vegetables purchased through internet at a service facility within the subway station of the Metropolitan Seoul subway system on the way to home, which raises further questions about which stations are chosen to locate service facilities and how many passengers can use the facilities. This problem is well known as the pickup problem, and it can be solved on a traffic network with traffic flows which should be identified from origin stations to destination stations. Since flows of the subway passengers can be found from the smart card transaction database of the Metropolitan Seoul smart card system, the pickup problem in the Metropolitan Seoul subway system is to select subway stations for the service facilities such that captured passenger flows are maximized. In this paper, we have formulated a model of the pickup problem on the Metropolitan Seoul subway system with subway passenger flows, and have proposed a fast heuristic algorithm to select pickup stations which can capture the most passenger flows in each step from an origin-destination matrix which represents the passenger flows. We have applied the heuristic algorithm to select the pickup stations from a large volume of traffic network, the Metropolitan Seoul subway system, with about 400 subway stations and five millions passenger transactions daily. We have obtained not only the experimental results in fast response time, but also displayed the top 10 pickup stations in a subway guide map. In addition, we have shown that the resulting solution is nearly optimal by a few more supplementary experiments.

  • PDF

Systematizing and Improving of Spatial Environment Data for the Establishment of Spatial Environment Planning (공간환경계획 수립을 위한환경정보의 체계화와 개선방안)

  • Eum, Jeong-Hee;Choi, Hee-Sun;Lee, Gil-Sang
    • Journal of Environmental Policy
    • /
    • v.9 no.2
    • /
    • pp.111-133
    • /
    • 2010
  • Environmental conservation plan, notwithstanding their feasibility and potential utility in construction of environment-friendly spaces, has long been perceived in practice as "declarative" and a "formality." Such perceptions are largely the result of the failure to provide spatial planning that is directly relevant to development of the space in question, and to sufficiently interconnect with urban development plan. This demonstrates the need for ways to link disparate plans, i.e. to enact "spatial environment plan." In response to these issues, this study proposes the systematization of spatial environment data as a necessary prerequisite to the establishment of spatial environment plan, which would provide both linkages with other plans, and ensure the applicability of environmental conservation plan. To this end, this study analyzed existing environmental data, and then proposed systems for links with spatial environment plan. In respect of this, the study examined spatial data systems, and then classified applicable spatial according to each environmental medium. The study also produced spatial information and planning items that can be included in spatial environment plans for each of the nine environmental media, and then constructed a system that could link the existing spatial information system, current spatial environment data, and spatial environment management plan. Furthermore, the study proposed matters for improvement in the construction of spatial environmental data to ensure the activation of spatial environment plan. The construction of a systematic spatial database, by facilitating the smooth establishment of spatial environment plan, can enhance and upgrade environmental conservation plan, while contributing to enhanced linkages with related spatial plans.

  • PDF

A Study on Task Allocation of Parallel Spatial Joins using Fixed Grids (고정 그리드를 이용한 병렬 공간 조인의 태스크 할당에 관한 연구)

  • Kim, Jin-Deok;Seo, Yeong-Deok;Hong, Bong-Hui
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.347-360
    • /
    • 2001
  • The most expensive spatial operation in spatial databases is a spatial join which computes a combined table of which tuple consists of two tuples of the two tables satisfying a spatial predicate. Although the execution time of sequential processing of a spatial join has been so far considerably improved, the response time is not tolerable because of not meeting the requirements of interactive users. It is usually appropriate to use parallel processing to improve the performance of spatial join processing. However, as the number of processors increases, the efficiency of each processor decreases rapidly because of the disk bottleneck and the overhead of message passing. This paper proposes the method of task allocation to soften the disk bottleneck caused by accessing the shared disk at the same time, and to minimize message passing among processors. In order to evaluate the performance of the proposed method in terms of the number of disk accesses and message passing, we conduct experiments on the two kinds of parallel spatial join algorithms. The experimental tests on the MIMD parallel machine with shared disks show that the proposed semi-dynamic task allocation method outperforms the static and dynamic task allocation methods.

  • PDF

Evaluation of Greenhouse Gas Emission for Wooden House Using Simplified Life Cycle Assessment Tool (목조주택 온실가스 배출량 평가를 위한 간이 전과정평가 툴 개발)

  • Chang, Yoon-Seong;Kim, Sejong;Son, Whi-Lim;Jung, Soon-Chul;Shin, Hyun-Kyeong;Shim, Kug-Bo
    • Journal of the Korean Wood Science and Technology
    • /
    • v.45 no.5
    • /
    • pp.650-660
    • /
    • 2017
  • In this study, simplified LCA (life cycle assessment) tool was developed to increase accessibility and availability on LCA timber construction. The result of simplified LCA was compared with commercial program on LCA (Simapro.7) to verify its availability. As a result of evaluating environmental impacts with the Life Cycle Inventory of all processes, gap between LCA and simplified LCA tools of timber construction was about 1%. Therefore, the simplified LCA tool could analyse greenhouse gas emissions of timber construction and to expand number of data set through improved conveniency of users for developing database of timber construction in Korea. The reduction effects of greenhouse gas emissions of timber construction was about 53% of total emission offset up to construction phase. The results of this study would support decision making process to expand to timber construction policy to showcase environmental friendliness of timber construction. It was expected to contribute to response to the New climate regime in forestry.

Consumer Characteristics Relating to Business Jacket Practices -Focus on Working Women in the U.S.- (미국 직장여성들의 비지니스 쟈켓 착용과 관련된 소비자 특성 분석)

  • Yoo, Seul-Hee
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.30 no.12 s.159
    • /
    • pp.1649-1660
    • /
    • 2006
  • IIn the United States, professional dress codes for working women have changed over time since the 1970s. Considering the changes, from conservative and traditional business uniforms in the 1970s, business casual in the late 1980s through 1990s, and the current revival of tailored business suits, this study investigated working women's business jacket practices and their association with personal, psycho-social, and physical characteristics. Working women's job satisfaction and corporate culture were also examined in relation to business jacket practices. Research data were collected by implementing mail surveys to 1,500 randomly selected working women in the United States. Of the 1,500 distributed questionnaires, a total of 312 were returned, of which 265 were deemed usable, yielding a 20.8% response rate. For data analysis, descriptive statistics, such as frequency, percentage distribution, mean scores, standard deviations, and Canonical Correlation were tabulated. The respondents ranged in age from 22 to 65. The mean age of the respondents was 44 years(SD=9.63). Most respondents were married(77.4%), working full-time(81.4%), career-oriented (77.2%), Caucasian(89.8%), had at least one child(78.9%), and had a professional job(75.9%). Working women's age, number of children, self-confidence in dressing, perceived importance of clothing, body frame size, and visibility to superiors and public were positively associated with business jacket practices, while age of first child, family size, dress size, and job satisfaction were negatively associated with business jacket practices.

Characteristics of Input and Output of Scientific Research (국가별 과학연구 투입과 성과의 특성분석)

  • Park, Hyun-Woo;Kim, Kyung-Ho;Yeo, Woon-Dong
    • Journal of Korea Technology Innovation Society
    • /
    • v.12 no.3
    • /
    • pp.471-498
    • /
    • 2009
  • The ability to judge a country's scientific standing is vital for the governments and businesses that must decide scientific priorities and funding. In this paper, we analyze the output and outcomes from research investment over the recent years, to measure the quality of scientific research on national scales and to set it in an international context. There are many ways to evaluate the quality of scientific research, but few have proved satisfactory. To measure the quantity and quality of science in different nations, we analyzed the numbers of published research papers and their citations. The number of citations per paper is a useful measure of the impact of a nation's research output. Essential at a were acquired from SCI database by Thomson Scientific, which indexes more than 8,000 journals, representing most significant materials in science and engineering. The purpose of this paper is to evaluate and compare the output and outcomes among nations in a variety of viewpoints and criteria. One of the implications in response to the result of analysis is that sustainable economic development in highly competitive world markets requires a direct engagement in the generation of knowledge. Even modest improvement in healthcare, clean water, sanitation, food, and transport need capabilities in engineering, technology, and medicine beyond many countries' reach. Nations exporting natural resources such as gold and oil can import technology and expertise, but only until these resources are exhausted. For them, sustainability should imply investment in alternative agricultural and technological capabilities through improvements in their skills base. A strong science base does not necessarily leat to wealth generation. However, strength in science has additional benefits for individual nations, and for the world as a whole.

  • PDF