• Title/Summary/Keyword: Data Matching

Search Result 1,979, Processing Time 0.032 seconds

A Study on Knowledge Entity Extraction Method for Individual Stocks Based on Neural Tensor Network (뉴럴 텐서 네트워크 기반 주식 개별종목 지식개체명 추출 방법에 관한 연구)

  • Yang, Yunseok;Lee, Hyun Jun;Oh, Kyong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.25-38
    • /
    • 2019
  • Selecting high-quality information that meets the interests and needs of users among the overflowing contents is becoming more important as the generation continues. In the flood of information, efforts to reflect the intention of the user in the search result better are being tried, rather than recognizing the information request as a simple string. Also, large IT companies such as Google and Microsoft focus on developing knowledge-based technologies including search engines which provide users with satisfaction and convenience. Especially, the finance is one of the fields expected to have the usefulness and potential of text data analysis because it's constantly generating new information, and the earlier the information is, the more valuable it is. Automatic knowledge extraction can be effective in areas where information flow is vast, such as financial sector, and new information continues to emerge. However, there are several practical difficulties faced by automatic knowledge extraction. First, there are difficulties in making corpus from different fields with same algorithm, and it is difficult to extract good quality triple. Second, it becomes more difficult to produce labeled text data by people if the extent and scope of knowledge increases and patterns are constantly updated. Third, performance evaluation is difficult due to the characteristics of unsupervised learning. Finally, problem definition for automatic knowledge extraction is not easy because of ambiguous conceptual characteristics of knowledge. So, in order to overcome limits described above and improve the semantic performance of stock-related information searching, this study attempts to extract the knowledge entity by using neural tensor network and evaluate the performance of them. Different from other references, the purpose of this study is to extract knowledge entity which is related to individual stock items. Various but relatively simple data processing methods are applied in the presented model to solve the problems of previous researches and to enhance the effectiveness of the model. From these processes, this study has the following three significances. First, A practical and simple automatic knowledge extraction method that can be applied. Second, the possibility of performance evaluation is presented through simple problem definition. Finally, the expressiveness of the knowledge increased by generating input data on a sentence basis without complex morphological analysis. The results of the empirical analysis and objective performance evaluation method are also presented. The empirical study to confirm the usefulness of the presented model, experts' reports about individual 30 stocks which are top 30 items based on frequency of publication from May 30, 2017 to May 21, 2018 are used. the total number of reports are 5,600, and 3,074 reports, which accounts about 55% of the total, is designated as a training set, and other 45% of reports are designated as a testing set. Before constructing the model, all reports of a training set are classified by stocks, and their entities are extracted using named entity recognition tool which is the KKMA. for each stocks, top 100 entities based on appearance frequency are selected, and become vectorized using one-hot encoding. After that, by using neural tensor network, the same number of score functions as stocks are trained. Thus, if a new entity from a testing set appears, we can try to calculate the score by putting it into every single score function, and the stock of the function with the highest score is predicted as the related item with the entity. To evaluate presented models, we confirm prediction power and determining whether the score functions are well constructed by calculating hit ratio for all reports of testing set. As a result of the empirical study, the presented model shows 69.3% hit accuracy for testing set which consists of 2,526 reports. this hit ratio is meaningfully high despite of some constraints for conducting research. Looking at the prediction performance of the model for each stocks, only 3 stocks, which are LG ELECTRONICS, KiaMtr, and Mando, show extremely low performance than average. this result maybe due to the interference effect with other similar items and generation of new knowledge. In this paper, we propose a methodology to find out key entities or their combinations which are necessary to search related information in accordance with the user's investment intention. Graph data is generated by using only the named entity recognition tool and applied to the neural tensor network without learning corpus or word vectors for the field. From the empirical test, we confirm the effectiveness of the presented model as described above. However, there also exist some limits and things to complement. Representatively, the phenomenon that the model performance is especially bad for only some stocks shows the need for further researches. Finally, through the empirical study, we confirmed that the learning method presented in this study can be used for the purpose of matching the new text information semantically with the related stocks.

Fusion of Gamma and Realistic Imaging (감마영상과 실사영상의 Fusion)

  • Kim, Yun-Cheol;Yu, Yeon-Uk;Seo, Young-Deok;Moon, Jong-Woon;Kim, Yeong-Seok;Won, Woo-Jae;Kim, Seok-Ki
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.14 no.1
    • /
    • pp.78-82
    • /
    • 2010
  • Purpose: Recently, South Korea has seen a rapidly increased incidence of both breast and thyroid cancers. As a result, the I-131 scan and lymphoscintigraphy have been performed more frequently. Although this type of diagnostic imaging is prominent in that visualizes pathological conditions, which is similar to previous nuclear diagnostic imaging techniques, there is not much anatomical information obtained. Accordingly, it has been used in different ways to help find anatomical locations by transmission scan, however the results were unsatisfactory. Therefore, this study aims to realize an imaging technique which shows more anatomical information through the fusion of gamma and realistic imaging. Materials and Methods: We analyzed the data from patients who were examined by the lymphoscintigraphy and I-131 additional scan by Symbia Gamma camera (SIEMENS) in the nuclear medicine department of the National Cancer Center from April to July of 2009. First, we scanned the same location in patients by using a miniature camera (R-2000) in hyVISION. Afterwards, we scanned by gamma camera. The data we obtained was evaluated based on the scanning that measures an agreement of gamma and realistic imaging by the Gamma Ray Tool fusion program. Results: The amount of radiation technicians and patients were exposed was generated during the production process of flood source and applied transmission scan. During this time, the radiation exposure dose of technicians was an average of 14.1743 ${\mu}Sv$, while the radiation exposure dose of patients averaged 0.9037 ${\mu}Sv$. We also confirmed this to matching gamma and realistic markers in fusion imaging. Conclusion: Therefore, we found that we could provide imaging with more anatomical information to clinical doctors by fusion of system of gamma and realistic imaging. This has allowed us to perform an easier method in which to reduce the work process. In addition, we found that the radiation exposure can be reduced from the flood source. Eventually, we hope that this will be applicable in other nuclear medicine studies. Therefore, in order to respect the privacy of patients, this procedure will be performed only after the patient has agreed to the procedure after being given a detailed explanation about the process itself and its advantages.

  • PDF

Study of the UAV for Application Plans and Landscape Analysis (UAV를 이용한 경관분석 및 활용방안에 관한 기초연구)

  • Kim, Seung-Min
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.32 no.3
    • /
    • pp.213-220
    • /
    • 2014
  • This is the study to conduct the topographical analysis using the orthophotographic data from the waypoint flight using the UAV and constructed the system required for the automatic waypoint flight using the multicopter.. The results of the waypoint photographing are as follows. First, result of the waypoint flight over the area of 9.3ha, take time photogrammetry took 40 minutes in total. The multicopter have maintained the certain flight altitude and a constant speed that the accurate photographing was conducted over the waypoint determined by the ground station. Then, the effect of the photogrammetry was checked. Second, attached a digital camera to the multicopter which is lightweight and low in cost compared to the general photogrammetric unmanned airplane and then used it to check its mobility and economy. In addition, the matching of the photo data, and production of DEM and DXF files made it possible to analyze the topography. Third, produced the high resolution orthophoto(2cm) for the inside of the river and found out that the analysis is possible for the changes in vegetation and topography around the river. Fourth, It would be used for the more in-depth research on landscape analysis such as terrain analysis and visibility analysis. This method may be widely used to analyze the various terrains in cities and rivers. It can also be used for the landscape control such as cultural remains and tourist sites as well as the control of the cultural and historical resources such as the visibility analysis for the construction of DSM.

Methodology for Issue-related R&D Keywords Packaging Using Text Mining (텍스트 마이닝 기반의 이슈 관련 R&D 키워드 패키징 방법론)

  • Hyun, Yoonjin;Shun, William Wong Xiu;Kim, Namgyu
    • Journal of Internet Computing and Services
    • /
    • v.16 no.2
    • /
    • pp.57-66
    • /
    • 2015
  • Considerable research efforts are being directed towards analyzing unstructured data such as text files and log files using commercial and noncommercial analytical tools. In particular, researchers are trying to extract meaningful knowledge through text mining in not only business but also many other areas such as politics, economics, and cultural studies. For instance, several studies have examined national pending issues by analyzing large volumes of text on various social issues. However, it is difficult to provide successful information services that can identify R&D documents on specific national pending issues. While users may specify certain keywords relating to national pending issues, they usually fail to retrieve appropriate R&D information primarily due to discrepancies between these terms and the corresponding terms actually used in the R&D documents. Thus, we need an intermediate logic to overcome these discrepancies, also to identify and package appropriate R&D information on specific national pending issues. To address this requirement, three methodologies are proposed in this study-a hybrid methodology for extracting and integrating keywords pertaining to national pending issues, a methodology for packaging R&D information that corresponds to national pending issues, and a methodology for constructing an associative issue network based on relevant R&D information. Data analysis techniques such as text mining, social network analysis, and association rules mining are utilized for establishing these methodologies. As the experiment result, the keyword enhancement rate by the proposed integration methodology reveals to be about 42.8%. For the second objective, three key analyses were conducted and a number of association rules between national pending issue keywords and R&D keywords were derived. The experiment regarding to the third objective, which is issue clustering based on R&D keywords is still in progress and expected to give tangible results in the future.

The Factors Associated with Dental Caries Experience and Oral Hygiens Status in Smoking Adolescents (흡연청소년의 치아우식경험도 및 구강위생 관련요인)

  • Shin, Seon-Haeng;Kim, Myung-Seok
    • Journal of dental hygiene science
    • /
    • v.9 no.5
    • /
    • pp.497-506
    • /
    • 2009
  • This study was conducted to estimate the dental caries experience, oral hygiene status and the factors influencing the dental disease in the smoking adolescents and to provide the baseline data for managing smokers efficiently. We recruited 156 smokers(male: 106, female: 50) in middle, high school students in 5 day Non-smoking program in seoul city and 176 non-smokers(male: 64, female: 112) by matching method for considering sex and age from June 1 to August 31 2009. Data on general characteristics, basic oral health care, smoking factors, self-efficiency, control of oral health, oral health promotion behavior, knowledge of oral health were collected by a questionnaire interview. DMFT index, DT index, MT index, FT index, Plaque index, Calculus index were calculated by the oral examination. The results of this study were as follows. 1. Dental clinic visit(p < 0.05), self-perception of oral health status(p < 0.001), oral health concern (p < 0.01) in non-smoker group were significantly higher than that of smoker group. 2. self-efficiency(p<0.05), oral health promotion behavior(p < 0.05) in non-smoker group were significantly higher than that of smoker group. 3. DT index, Plaque index, Calculus index in non-smoker group was significantly lower than that of smoker group(p < 0.0001). 4. The fewer smoke amount, the lower DT index(p < 0.05), Plaque index(p < 0.01), Calculus index(p < 0.001). 5. It was significant correlated among DT index and self-efficiency, oral health promotion behavior, control of oral health. 6. In multiple regression analysis, oral health promotion behavior, Plaque index was proved as a significant factors related with the degree of dental caries experience in smoking adolescents. In other word, the higher oral health promotion behavior, the lower Plaque index, the fewer DT index.

  • PDF

A Study on the Factors Affecting Health Promoting Lifestyles of Workers in the Small Scale Industries (소형 사업장 근로자들의 건강증진 생활양식에 영향을 미치는 요인)

  • Jang Yong-Nam;Lee Eun-Kyoung;Chong Myong-Soo;Jun Sun-Young;Kim Sang-Deok;Jeoung Jae-Yul;Jahng Doo-Sub;Song Yung-Sun;Lee Ki-Nam
    • Journal of Society of Preventive Korean Medicine
    • /
    • v.5 no.1
    • /
    • pp.10-30
    • /
    • 2001
  • Oriental medicine needs to be armed with theories on health-improvement concept under it and basic data matching its views, in order to participate in the health-improvement service in industrial work places. The Orient medicine health-improvement program defines factors that determine individuals' lifestyle, and provides information and technologies for workers to practice in life. To that end, this research compares and analyzes health-improvement concept and health care, defines relations between individuals' health state and their lifestyle as the basic data needed to perform health-improvement business for workers. 1. The subjects employed for this research is categorized into; by gender, males 52.1% and females 47.9% with no big difference between them; and by age, 20s, 6.1%, 30s. 33.9%, 40s, 34.1%, and 50s, 24.8% with 30-50 accounting for most of it. By marriage status, unmarried represents 7.1%, and married 79.1% with most of them married; by revenue, under one million won represents 3.0%, 1-2 million won 26.4%, 2-2.49 million won 11.2%, above 2.5 million won 11.2%, and 1-2.5 million won a majority. By living location, owned houses represents 65.4%, rented houses 14.7%, monthly-rented 9.5%; and by education, elementary and middle school represent 16.9%, high school and its dropouts 22.6%, and junior college and higher 51.6%, with high school and higher occupying most of the group. 2. By job, office workers and managerial workers represent 12.3%, part-timers 21.0%, manual workers 11.4%, jobless 0.6%, professionals 35.6%, service 0.6%, housewives 8.4%, and equipment/machinery operation/assemblers 10.1%. Of this, jobless and part-timers, totaling three, are dropped from this research. By years worked, 0-3.9 years represents 9.7%, 4-7.9 years 6.7%, 8-14.9 years 18.4%, above 15 years 28.7%, and no respondents 36.5%. 3. The degree of the subjects practicing life-improvement lifestyle, on a scale of 1 to 4, is an average of 2.69, personal relations 3.04, self-realization 2.92, stress management 2.76, nutritional state 2.73, responsibility for health 2.47, and athletic activities 2.18, with personal relations earning the highest points and athletic activities the lowest. As for factors influencing health-improvement lifestyle, there is no significant difference between gender, age, and marriage status. Meanwhile, there is significant difference between revenue, dwelling pattern, education level, etc. That is, higher income-bracket, owned houses, rented houses, monthly-rented houses, and higher-educated, in this order, show higher average in health-enhancement lifestyle. By job, housewives, manual workers, office workers, professionals, equipment/ machinery operation/ assemblers, and part-timers, in this order show higher points, while there is no difference with significance by years worked. 4. Factors that affect health-improvement lifestyle are shown below. Self-realization is influenced by age, marriage status, type of dwellings, and level of education; responsibility for health by type of dwellings; athletic activities by gender and age; nutrition by age, marriage status and type of dwellings; personal relations by marriage status; and stress management by type of dwellings. 5. Areas with high points by job show this: in self-realization, office workers, manual workers, housewives, professionals, equipment/ machinery operation/ assemblers, in this order, show difference with significance; in the area of responsibility for health, manual workers, housewives, equipment/ machinery operation/ assemblers, professionals, office workers and part-timers, in this order, do. In athletic activities, manual workers, housewives, office workers, professionals, equipment/ machinery operation/ assemblers, and part-timers, in this order, show difference with significance; in nutrition, housewives, office workers, manual workers, professionals, equipment/ machinery operation/ assemblers, and part-timers, in this order do; and in stress, housewives, office workers, manual workers, professionals, equipment/ machinery operation/ assemblers, part-timers, in this order do. By years worked, more years showed higher points in the area of responsibility for health and nutrition; in the area of athletic activities, above 15 years, 4-8 years, below 4 years and 8-14 years, in this order, show higher points; and no difference shows in realization, personal relation, and stress area. 6. To look at correlation between overall and divisional health-improvement practice degree, this researcher has analyzed it using Person's correlation coefficient. Self-realization, responsibility for health, athletic activities, nutrition, support for personal relations, and stress management show significant correlation with the sub-divisions, while all health-improvement lifestyle shows significant correlation with the six sub-divisions.

  • PDF

Evaluation of Web Service Similarity Assessment Methods (웹서비스 유사성 평가 방법들의 실험적 평가)

  • Hwang, You-Sub
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.4
    • /
    • pp.1-22
    • /
    • 2009
  • The World Wide Web is transitioning from being a mere collection of documents that contain useful information toward providing a collection of services that perform useful tasks. The emerging Web service technology has been envisioned as the next technological wave and is expected to play an important role in this recent transformation of the Web. By providing interoperable interface standards for application-to-application communication, Web services can be combined with component based software development to promote application interaction and integration both within and across enterprises. To make Web services for service-oriented computing operational, it is important that Web service repositories not only be well-structured but also provide efficient tools for developers to find reusable Web service components that meet their needs. As the potential of Web services for service-oriented computing is being widely recognized, the demand for effective Web service discovery mechanisms is concomitantly growing. A number of techniques for Web service discovery have been proposed, but the discovery challenge has not been satisfactorily addressed. Unfortunately, most existing solutions are either too rudimentary to be useful or too domain dependent to be generalizable. In this paper, we propose a Web service organizing framework that combines clustering techniques with string matching and leverages the semantics of the XML-based service specification in WSDL documents. We believe that this is one of the first attempts at applying data mining techniques in the Web service discovery domain. Our proposed approach has several appealing features : (1) It minimizes the requirement of prior knowledge from both service consumers and publishers; (2) It avoids exploiting domain dependent ontologies; and (3) It is able to visualize the semantic relationships among Web services. We have developed a prototype system based on the proposed framework using an unsupervised artificial neural network and empirically evaluated the proposed approach and tool using real Web service descriptions drawn from operational Web service registries. We report on some preliminary results demonstrating the efficacy of the proposed approach.

  • PDF

A Study on Effectiveness of the Hospital-based Home Nursing Care of the Early Discharged Surgical Patients and its Cost Analysis (조기퇴원 수술환자의 병원중심 가정간호 효과 및 비용분석에 관한 연구)

  • 박경숙;정연강
    • Journal of Korean Academy of Nursing
    • /
    • v.24 no.4
    • /
    • pp.545-556
    • /
    • 1994
  • Medical insurance and health care delivery system enabled Korean people to get the necessary medical service, but it caused increased needs for medical service, and resulted in the occurence of some problems such as a lack of manpower and medical facilities. In order to solve these problems, many countries, which already had medical insurance system had developed home care system and it has been regarded effective both in reducing costs and in increasing the rates of turnover of bed. Recently, Korea has included home nursing care in its health care delivery system, and some models of the hospital based home nursing care had been tried and its effects had been evaluated. So, author tried to run a home nursing care for the Cesarean section mothers and evaluate Its effects both in the mother's health and costs. This study was designed as a Quasi-experimental study. Subjects were thirty mothers who got Cesarean section operation in hospital in Seoul. Experimental group consisted of 15 volunteers, and control group were selected by means of matching technique. Data were gathered from February 1st to March 26th by two assistants who were trained by author. Experimental group were discharged on the 4th day after their operation, and got nursing care and assessment about their home three times on the 5th, 6th, and 7th day. Control group stayed in the hospital until 7th day as usual and were checked on the same day as above mentioned To evaluate the state of physiological recovery, vital signs, H.O.F, presence of edema in the legs, bathing, appetite, sleep, presence of pain or discomfort in the breasts, amount of lochia, color of lochia, defecation urination. To compare incidence of complication in experimental group with that in control group, specific assessment was done such variables as smell of lochia, presence of inflammation of operation wound, dizziness, and presence of immobilization in the extremities. The activities of daily living were checked Satisfaction of nursing were checked To calculate costs, author asked subjects to specify expenditure including hospital charge, traffic enpenses, and food expenses. The results were as fellows. 1. On effectiveness of home nursing careThere were n significant differences between experimental and control group in incidence of abnormal symptoms and any complication. The number of taking a bath [POD #5 P=0.001, #6 P=0.0003, #7 P=0.001] and the degree of appetite [POD #5 P=0.03, #6 P=0.02, #7 P=0.013] were significantly higher in experimental group than in control group. Contrary to author's expectation, the degree of the activities of daily living in experimental group was not higher than that of control group. All of the experimental group said they were satisfied with the home nursing care. 2. Cost analysis 1) Hospital charge of experimental group was lower than that of control group. [P=0.009] By taking home nursing care, average period of hospitalization was shortened to 3.1 days, and family members could save 22.8 hours. Total amount of money saved by early discharge was 3,443,093 Won. It is estimated that total amount of money saved by early discharge in a year will be 40,398,956 Won. 2) Home nursing care charge of 15 mothers was 1,781,633 Won. It is estimated that total amount of money Saved by it in a year Will be 20,904,493 Won. It was lower altogether than hospital charge of the three days which is 5th, 6th, 7th day of operation. The average cost of single home visit was calculated 10,940 Won. It took 87 minutes per round and it costed 1,017.3 Won. The average hour of home care was 39.0 minutes. 3) It is expected that early discharge can bring forth the increase of hospital income. On the condition that the rate of running bed is 100%, the expected increase of hospital income will be 202,374, 026 Won in a year. Suggestions for further study and nursing practice are as follows : 1. For the welfare of patients and the increased rates of running bed, home nursing care system should be included in the hospital nursing care system. 2. Studies to test effect of home nursing care on the patients with other diseases are needed. 3. Establishment of law on the practice of home nursing care is strongly recommended.

  • PDF

A Tool Box to Evaluate the Phased Array Coil Performance Using Retrospective 3D Coil Modeling (3차원 코일 모델링을 통해 위상배열코일 성능을 평가하기 위한 프로그램)

  • Perez, Marlon;Hernandez, Daniel;Michel, Eric;Cho, Min Hyoung;Lee, Soo Yeol
    • Investigative Magnetic Resonance Imaging
    • /
    • v.18 no.2
    • /
    • pp.107-119
    • /
    • 2014
  • Purpose : To efficiently evaluate phased array coil performance using a software tool box with which we can make visual comparison of the sensitivity of every coil element between the real experiment and EM simulation. Materials and Methods: We have developed a $C^{{+}{+}}$- and MATLAB-based software tool called Phased Array Coil Evaluator (PACE). PACE has the following functions: Building 3D models of the coil elements, importing the FDTD simulation results, and visualizing the coil sensitivity of each coil element on the ordinary Cartesian coordinate and the relative coil position coordinate. To build a 3D model of the phased array coil, we used an electromagnetic 3D tracker in a stylus form. After making the 3D model, we imported the 3D model into the FDTD electromagnetic field simulation tool. Results: An accurate comparison between the coil sensitivity simulation and real experiment on the tool box platform has been made through fine matching of the simulation and real experiment with aids of the 3D tracker. In the simulation and experiment, we used a 36-channel helmet-style phased array coil. At the 3D MRI data acquisition using the spoiled gradient echo sequence, we used the uniform cylindrical phantom that had the same geometry as the one in the FDTD simulation. In the tool box, we can conveniently choose the coil element of interest and we can compare the coil sensitivities element-by-element of the phased array coil. Conclusion: We expect the tool box can be greatly used for developing phased array coils of new geometry or for periodic maintenance of phased array coils in a more accurate and consistent manner.

A Study On Design of ZigBee Chip Communication Module for Remote Radiation Measurement (원격 방사선 측정을 위한 ZigBee 원칩형 통신 모듈 설계에 대한 연구)

  • Lee, Joo-Hyun;Lee, Seung-Ho
    • Journal of IKEEE
    • /
    • v.18 no.4
    • /
    • pp.552-558
    • /
    • 2014
  • This paper suggests how to design a ZigBee-chip-based communication module to remotely measure radiation level. The suggested communication module consists of two control processors for the chip as generally required to configure a ZigBee system, and one chip module to configure a ZigBee RF device. The ZigBee-chip-based communication module for remote radiation measurement consists of a wireless communication controller; sensor and high-voltage generator; charger and power supply circuit; wired communication part; and RF circuit and antenna. The wireless communication controller is to control wireless communication for ZigBee and to measure radiation level remotely. The sensor and high-voltage generator generates 500 V in two consecutive series to amplify and filter pulses of radiation detected by G-M Tube. The charger and power supply circuit part is to charge lithium-ion battery and supply power to one-chip processors. The wired communication part serves as a RS-485/422 interface to enable USB interface and wired remote communication for interfacing with PC and debugging. RF circuit and antenna applies an RLC passive component for chip antenna to configure BALUN and antenna impedance matching circuit, allowing wireless communication. After configuring the ZigBee-chip-based communication module, tests were conducted to measure radiation level remotely: data were successfully transmitted in 10-meter and 100-meter distances, measuring radiation level in a remote condition. The communication module allows an environment where radiation level can be remotely measured in an economically beneficial way as it not only consumes less electricity but also costs less. By securing linearity of a radiation measuring device and by minimizing the device itself, it is possible to set up an environment where radiation can be measured in a reliable manner, and radiation level is monitored real-time.