A Study on Knowledge Entity Extraction Method for Individual Stocks Based on Neural Tensor Network (뉴럴 텐서 네트워크 기반 주식 개별종목 지식개체명 추출 방법에 관한 연구)
-
- Journal of Intelligence and Information Systems
- /
- v.25 no.2
- /
- pp.25-38
- /
- 2019
Selecting high-quality information that meets the interests and needs of users among the overflowing contents is becoming more important as the generation continues. In the flood of information, efforts to reflect the intention of the user in the search result better are being tried, rather than recognizing the information request as a simple string. Also, large IT companies such as Google and Microsoft focus on developing knowledge-based technologies including search engines which provide users with satisfaction and convenience. Especially, the finance is one of the fields expected to have the usefulness and potential of text data analysis because it's constantly generating new information, and the earlier the information is, the more valuable it is. Automatic knowledge extraction can be effective in areas where information flow is vast, such as financial sector, and new information continues to emerge. However, there are several practical difficulties faced by automatic knowledge extraction. First, there are difficulties in making corpus from different fields with same algorithm, and it is difficult to extract good quality triple. Second, it becomes more difficult to produce labeled text data by people if the extent and scope of knowledge increases and patterns are constantly updated. Third, performance evaluation is difficult due to the characteristics of unsupervised learning. Finally, problem definition for automatic knowledge extraction is not easy because of ambiguous conceptual characteristics of knowledge. So, in order to overcome limits described above and improve the semantic performance of stock-related information searching, this study attempts to extract the knowledge entity by using neural tensor network and evaluate the performance of them. Different from other references, the purpose of this study is to extract knowledge entity which is related to individual stock items. Various but relatively simple data processing methods are applied in the presented model to solve the problems of previous researches and to enhance the effectiveness of the model. From these processes, this study has the following three significances. First, A practical and simple automatic knowledge extraction method that can be applied. Second, the possibility of performance evaluation is presented through simple problem definition. Finally, the expressiveness of the knowledge increased by generating input data on a sentence basis without complex morphological analysis. The results of the empirical analysis and objective performance evaluation method are also presented. The empirical study to confirm the usefulness of the presented model, experts' reports about individual 30 stocks which are top 30 items based on frequency of publication from May 30, 2017 to May 21, 2018 are used. the total number of reports are 5,600, and 3,074 reports, which accounts about 55% of the total, is designated as a training set, and other 45% of reports are designated as a testing set. Before constructing the model, all reports of a training set are classified by stocks, and their entities are extracted using named entity recognition tool which is the KKMA. for each stocks, top 100 entities based on appearance frequency are selected, and become vectorized using one-hot encoding. After that, by using neural tensor network, the same number of score functions as stocks are trained. Thus, if a new entity from a testing set appears, we can try to calculate the score by putting it into every single score function, and the stock of the function with the highest score is predicted as the related item with the entity. To evaluate presented models, we confirm prediction power and determining whether the score functions are well constructed by calculating hit ratio for all reports of testing set. As a result of the empirical study, the presented model shows 69.3% hit accuracy for testing set which consists of 2,526 reports. this hit ratio is meaningfully high despite of some constraints for conducting research. Looking at the prediction performance of the model for each stocks, only 3 stocks, which are LG ELECTRONICS, KiaMtr, and Mando, show extremely low performance than average. this result maybe due to the interference effect with other similar items and generation of new knowledge. In this paper, we propose a methodology to find out key entities or their combinations which are necessary to search related information in accordance with the user's investment intention. Graph data is generated by using only the named entity recognition tool and applied to the neural tensor network without learning corpus or word vectors for the field. From the empirical test, we confirm the effectiveness of the presented model as described above. However, there also exist some limits and things to complement. Representatively, the phenomenon that the model performance is especially bad for only some stocks shows the need for further researches. Finally, through the empirical study, we confirmed that the learning method presented in this study can be used for the purpose of matching the new text information semantically with the related stocks.
Convolutional Neural Network (ConvNet) is one class of the powerful Deep Neural Network that can analyze and learn hierarchies of visual features. Originally, first neural network (Neocognitron) was introduced in the 80s. At that time, the neural network was not broadly used in both industry and academic field by cause of large-scale dataset shortage and low computational power. However, after a few decades later in 2012, Krizhevsky made a breakthrough on ILSVRC-12 visual recognition competition using Convolutional Neural Network. That breakthrough revived people interest in the neural network. The success of Convolutional Neural Network is achieved with two main factors. First of them is the emergence of advanced hardware (GPUs) for sufficient parallel computation. Second is the availability of large-scale datasets such as ImageNet (ILSVRC) dataset for training. Unfortunately, many new domains are bottlenecked by these factors. For most domains, it is difficult and requires lots of effort to gather large-scale dataset to train a ConvNet. Moreover, even if we have a large-scale dataset, training ConvNet from scratch is required expensive resource and time-consuming. These two obstacles can be solved by using transfer learning. Transfer learning is a method for transferring the knowledge from a source domain to new domain. There are two major Transfer learning cases. First one is ConvNet as fixed feature extractor, and the second one is Fine-tune the ConvNet on a new dataset. In the first case, using pre-trained ConvNet (such as on ImageNet) to compute feed-forward activations of the image into the ConvNet and extract activation features from specific layers. In the second case, replacing and retraining the ConvNet classifier on the new dataset, then fine-tune the weights of the pre-trained network with the backpropagation. In this paper, we focus on using multiple ConvNet layers as a fixed feature extractor only. However, applying features with high dimensional complexity that is directly extracted from multiple ConvNet layers is still a challenging problem. We observe that features extracted from multiple ConvNet layers address the different characteristics of the image which means better representation could be obtained by finding the optimal combination of multiple ConvNet layers. Based on that observation, we propose to employ multiple ConvNet layer representations for transfer learning instead of a single ConvNet layer representation. Overall, our primary pipeline has three steps. Firstly, images from target task are given as input to ConvNet, then that image will be feed-forwarded into pre-trained AlexNet, and the activation features from three fully connected convolutional layers are extracted. Secondly, activation features of three ConvNet layers are concatenated to obtain multiple ConvNet layers representation because it will gain more information about an image. When three fully connected layer features concatenated, the occurring image representation would have 9192 (4096+4096+1000) dimension features. However, features extracted from multiple ConvNet layers are redundant and noisy since they are extracted from the same ConvNet. Thus, a third step, we will use Principal Component Analysis (PCA) to select salient features before the training phase. When salient features are obtained, the classifier can classify image more accurately, and the performance of transfer learning can be improved. To evaluate proposed method, experiments are conducted in three standard datasets (Caltech-256, VOC07, and SUN397) to compare multiple ConvNet layer representations against single ConvNet layer representation by using PCA for feature selection and dimension reduction. Our experiments demonstrated the importance of feature selection for multiple ConvNet layer representation. Moreover, our proposed approach achieved 75.6% accuracy compared to 73.9% accuracy achieved by FC7 layer on the Caltech-256 dataset, 73.1% accuracy compared to 69.2% accuracy achieved by FC8 layer on the VOC07 dataset, 52.2% accuracy compared to 48.7% accuracy achieved by FC7 layer on the SUN397 dataset. We also showed that our proposed approach achieved superior performance, 2.8%, 2.1% and 3.1% accuracy improvement on Caltech-256, VOC07, and SUN397 dataset respectively compare to existing work.
Much has teed changed in the field of hospital administration in the It wake of the rapid development of sciences, techniques ana systematic hospital management. However, we still have a long way to go in organization, in the quality of hospital employees and hospital equipment and facilities, and in financial support in order to achieve proper hospital management. The above factors greatly effect the ability of hospitals to fulfill their obligation in patient care and nursing services. The purpose of this study is to determine the optimal methods of standardization and quality nursing so as to improve present nursing services through investigations and analyses of various problems concerning nursing administration. This study has been undertaken during the six month period from October 1971 to March 1972. The 41 comprehensive hospitals have been selected iron amongst the 139 in the whole country. These have been categorized according-to the specific purposes of their establishment, such as 7 university hospitals, 18 national or public hospitals, 12 religious hospitals and 4 enterprise ones. The following conclusions have been acquired thus far from information obtained through interviews with nursing directors who are in charge of the nursing administration in each hospital, and further investigations concerning the purposes of establishment, the organization, personnel arrangements, working conditions, practices of service, and budgets of the nursing service department. 1. The nursing administration along with its activities in this country has been uncritical1y adopted from that of the developed countries. It is necessary for us to re-establish a new medical and nursing system which is adequate for our social environments through continuous study and research. 2. The survey shows that the 7 university hospitals were chiefly concerned with education, medical care and research; the 18 national or public hospitals with medical care, public health and charity work; the 2 religious hospitals with medical care, charity and missionary works; and the 4 enterprise hospitals with public health, medical care and charity works. In general, the main purposes of the hospitals were those of charity organizations in the pursuit of medical care, education and public benefits. 3. The survey shows that in general hospital facilities rate 64 per cent and medical care 60 per-cent against a 100 per cent optimum basis in accordance with the medical treatment law and approved criteria for training hospitals. In these respects, university hospitals have achieved the highest standards, followed by religious ones, enterprise ones, and national or public ones in that order. 4. The ages of nursing directors range from 30 to 50. The level of education achieved by most of the directors is that of graduation from a nursing technical high school and a three year nursing junior college; a very few have graduated from college or have taken graduate courses. 5. As for the career tenure of nurses in the hospitals: one-third of the nurses, or 38 per cent, have worked less than one year; those in the category of one year to two represent 24 pet cent. This means that a total of 62 per cent of the career nurses have been practicing their profession for less than two years. Career nurses with over 5 years experience number only 16 per cent: therefore the efficiency of nursing services has been rated very low. 6. As for the standard of education of the nurses: 62 per cent of them have taken a three year course of nursing in junior colleges, and 22 per cent in nursing technical high schools. College graduate nurses come up to only 15 per cent; and those with graduate course only 0.4 per cent. This indicates that most of the nurses are front nursing technical high schools and three year nursing junior colleges. Accordingly, it is advisable that nursing services be divided according to their functions, such as professional, technical nurses and nurse's aides. 7. The survey also shows that the purpose of nursing service administration in the hospitals has been regulated in writing in 74 per cent of the hospitals and not regulated in writing in 26 per cent of the hospitals. The general purposes of nursing are as follows: patient care, assistance in medical care and education. The main purpose of these nursing services is to establish proper operational and personnel management which focus on in-service education. 8. The nursing service departments belong to the medical departments in almost 60 per cent of the hospitals. Even though the nursing service department is formally separated, about 24 per cent of the hospitals regard it as a functional unit in the medical department. Only 5 per cent of the hospitals keep the department as a separate one. To the contrary, approximately 12 per cent of the hospitals have not established a nursing service department at all but surbodinate it to the other department. In this respect, it is required that a new hospital organization be made to acknowledge the independent function of the nursing department. In 76 per cent of the hospitals they have advisory committees under the nursing department, such as a dormitory self·regulating committee, an in-service education committee and a nursing procedure and policy committee. 9. Personnel arrangement and working conditions of nurses 1) The ratio of nurses to patients is as follows: In university hospitals, 1 to 2.9 for hospitalized patients and 1 to 4.0 for out-patients; in religious hospitals, 1 to 2.3 for hospitalized patients and 1 to 5.4 for out-patients. Grouped together this indicates that one nurse covers 2.2 hospitalized patients and 4.3 out-patients on a daily basis. The current medical treatment law stipulates that one nurse should care for 2.5 hospitalized patients or 30.0 out-patients. Therefore the statistics indicate that nursing services are being peformed with an insufficient number of nurses to cover out-patients. The current law concerns the minimum number of nurses and disregards the required number of nurses for operation rooms, recovery rooms, delivery rooms, new-born baby rooms, central supply rooms and emergency rooms. Accordingly, tile medical treatment law has been requested to be amended. 2) The ratio of doctors to nurses: In university hospitals, the ratio is 1 to 1.1; in national of public hospitals, 1 to 0.8; in religious hospitals 1 to 0.5; and in private hospitals 1 to 0.7. The average ratio is 1 to 0.8; generally the ideal ratio is 3 to 1. Since the number of doctors working in hospitals has been recently increasing, the nursing services have consequently teen overloaded, sacrificing the services to the patients. 3) The ratio of nurses to clerical staff is 1 to 0.4. However, the ideal ratio is 5 to 1, that is, 1 to 0.2. This means that clerical personnel far outnumber the nursing staff. 4) The ratio of nurses to nurse's-aides; The average 2.5 to 1 indicates that most of the nursing service are delegated to nurse's-aides owing to the shortage of registered nurses. This is the main cause of the deterioration in the quality of nursing services. It is a real problem in the guest for better nursing services that certain hospitals employ a disproportionate number of nurse's-aides in order to meet financial requirements. 5) As for the working conditions, most of hospitals employ a three-shift day with 8 hours of duty each. However, certain hospitals still use two shifts a day. 6) As for the working environment, most of the hospitals lack welfare and hygienic facilities. 7) The salary basis is the highest in the private university hospitals, with enterprise hospitals next and religious hospitals and national or public ones lowest. 8) Method of employment is made through paper screening, and further that the appointment of nurses is conditional upon the favorable opinion of the nursing directors. 9) The unemployment ratio for one year in 1971 averaged 29 per cent. The reasons for unemployment indicate that the highest is because of marriage up to 40 per cent, and next is because of overseas employment. This high unemployment ratio further causes the deterioration of efficiency in nursing services and supplementary activities. The hospital authorities concerned should take this matter into a jeep consideration in order to reduce unemployment. 10) The importance of in-service education is well recognized and established. 1% has been noted that on the-job nurses. training has been most active, with nursing directors taking charge of the orientation programs of newly employed nurses. However, it is most necessary that a comprehensive study be made of instructors, contents and methods of education with a separate section for in-service education. 10. Nursing services'activities 1) Division of services and job descriptions are urgently required. 81 per rent of the hospitals keep written regulations of services in accordance with nursing service manuals. 19 per cent of the hospitals do not keep written regulations. Most of hospitals delegate to the nursing directors or certain supervisors the power of stipulating service regulations. In 21 per cent of the total hospitals they have policy committees, standardization committees and advisory committees to proceed with the stipulation of regulations. 2) Approximately 81 per cent of the hospitals have service channels in which directors, supervisors, head nurses and staff nurses perform their appropriate services according to the service plans and make up the service reports. In approximately 19 per cent of the hospitals the staff perform their nursing services without utilizing the above channels. 3) In the performance of nursing services, a ward manual is considered the most important one to be utilized in about 32 percent of hospitals. 25 per cent of hospitals indicate they use a kardex; 17 per cent use ward-rounding, and others take advantage of work sheets or coordination with other departments through conferences. 4) In about 78 per cent of hospitals they have records which indicate the status of personnel, and in 22 per cent they have not. 5) It has been advised that morale among nurses may be increased, ensuring more efficient services, by their being able to exchange opinions and views with each other. 6) The satisfactory performance of nursing services rely on the following factors to the degree indicated: approximately 32 per cent to the systematic nursing activities and services; 27 per cent to the head nurses ability for nursing diagnosis; 22 per cent to an effective supervisory system; 16 per cent to the hospital facilities and proper supply, and 3 per cent to effective in·service education. This means that nurses, supervisors, head nurses and directors play the most important roles in the performance of nursing services. 11. About 87 per cent of the hospitals do not have separate budgets for their nursing departments, and only 13 per cent of the hospitals have separate budgets. It is recommended that the planning and execution of the nursing administration be delegated to the pertinent administrators in order to bring about improved proved performances and activities in nursing services.