• Title/Summary/Keyword: Text data

Search Result 2,956, Processing Time 0.03 seconds

Achievement and satisfaction research of the undergraduate orchestra club activities - A convergent aspects of statistical method and opinion mining (오케스트라 동아리 참여 대학생의 성취도 및 만족도 조사 - 통계적 방법과 오피니언 마이닝의 융합적 측면)

  • Choi, Sui;Choi, Kyoungho
    • Journal of the Korea Convergence Society
    • /
    • v.6 no.4
    • /
    • pp.25-31
    • /
    • 2015
  • General student orchestra activity is known as desirable hobby for students of adolescent period, developing their creativity and sensitivity, give students sense of belonging and stability by resolving their social, emotional anxiety. Accordingly, his research investigated whether orchestra club activity also has similar effect on university students. As a result unlike of adolescent students, orchestra activity turned out to be not that helpful for the social, self-confidence improvement of university students, though achievement of the activity itself was high. Despite of the result, there exist positive factors; obviously the activity has positive factors analyzing through recognition analysis (opinion mining) using big data. Therefore national support is required also for the orchestra activity of the undergraduates.

The Scalability and the Strategy for EMR Database Encryption Techniques

  • Shin, David;Sahama, Tony;Kim, Steve Jung-Tae;Kim, Ji-Hong
    • Journal of information and communication convergence engineering
    • /
    • v.9 no.5
    • /
    • pp.577-582
    • /
    • 2011
  • EMR(Electronic Medical Record) is an emerging technology that is highly-blended between non-IT and IT area. One of methodology to link non-IT and IT area is to construct databases. Nowadays, it supports before and after-treatment for patients and should satisfy all stakeholders such as practitioners, nurses, researchers, administrators and financial department and so on. In accordance with the database maintenance, DAS (Data as Service) model is one solution for outsourcing. However, there are some scalability and strategy issues when we need to plan to use DAS model properly. We constructed three kinds of databases such as plain-text, MS built-in encryption which is in-house model and custom AES (Advanced Encryption Standard) - DAS model scaling from 5K to 2560K records. To perform custom AES-DAS better, we also devised Bucket Index using Bloom Filter. The simulation showed the response times arithmetically increased in the beginning but after a certain threshold, exponentially increased in the end. In conclusion, if the database model is close to in-house model, then vendor technology is a good way to perform and get query response times in a consistent manner. If the model is DAS model, it is easy to outsource the database, however, some technique like Bucket Index enhances its utilization. To get faster query response times, designing database such as consideration of the field type is also important. This study suggests cloud computing would be a next DAS model to satisfy the scalability and the security issues.

Self Introduction Essay Classification Using Doc2Vec for Efficient Job Matching (Doc2Vec 모형에 기반한 자기소개서 분류 모형 구축 및 실험)

  • Kim, Young Soo;Moon, Hyun Sil;Kim, Jae Kyeong
    • Journal of Information Technology Services
    • /
    • v.19 no.1
    • /
    • pp.103-112
    • /
    • 2020
  • Job seekers are making various efforts to find a good company and companies attempt to recruit good people. Job search activities through self-introduction essay are nowadays one of the most active processes. Companies spend time and cost to reviewing all of the numerous self-introduction essays of job seekers. Job seekers are also worried about the possibility of acceptance of their self-introduction essays by companies. This research builds a classification model and conducted an experiments to classify self-introduction essays into pass or fail using deep learning and decision tree techniques. Real world data were classified using stratified sampling to alleviate the data imbalance problem between passed self-introduction essays and failed essays. Documents were embedded using Doc2Vec method developed from existing Word2Vec, and they were classified using logistic regression analysis. The decision tree model was chosen as a benchmark model, and K-fold cross-validation was conducted for the performance evaluation. As a result of several experiments, the area under curve (AUC) value of PV-DM results better than that of other models of Doc2Vec, i.e., PV-DBOW and Concatenate. Furthmore PV-DM classifies passed essays as well as failed essays, while PV_DBOW can not classify passed essays even though it classifies well failed essays. In addition, the classification performance of the logistic regression model embedded using the PV-DM model is better than the decision tree-based classification model. The implication of the experimental results is that company can reduce the cost of recruiting good d job seekers. In addition, our suggested model can help job candidates for pre-evaluating their self-introduction essays.

Correction Method of Slit Modulation Transfer function on Digital Medical Imaging System (디지털 의료영상에서 슬릿법에 의한 Modulation Transfer Function의 보정방법)

  • Kim, Jung-Min;Jung, Hoi-Woun;Min, Jung-Whan;Im, Eon-Kyung
    • Journal of radiological science and technology
    • /
    • v.29 no.3
    • /
    • pp.133-139
    • /
    • 2006
  • By using CR image pixel data, We examined the way how to calculate the MTF and digital characteristic curve. It can be changed to the text-file(Excel) from a pixel data which was printed with a digital x-ray equipment. In this place, We described the way how to figure out and correct the sharpness of a digital images of the MTF from FUJITA. Excel program was utilized to calculate from radiography of slit. Digital characteristic curve, Line Spread Function, Discrete Fourier Transform, Fast Fourier Transform digital specification curve, were indicated in regular sequence. A big advantage of this method, It can be understood easily and you can get results without costly program and without full knowledge of computer language. It shows many different values by using different correction methods. Therefore we need to be handy with appropriate correction method and we should try many experiments to get a precise MTF figures.

  • PDF

The Current Status of Utilization and Demand on Cancer Information in the Faculties of Medical School in Korea (국내 의과대학 교수의 암정보 활용 현황과 요구도)

  • Lim, Min-Kyung;Park, Sook-Kyung;Yang, Jeong-Hee;Lee, Young-Sung
    • Journal of Preventive Medicine and Public Health
    • /
    • v.36 no.1
    • /
    • pp.39-46
    • /
    • 2003
  • Objectives : To investigate the availability and demand for overall cancer-related information, and to establish a basic plan for the construction of a cancer database and information system based on the research results from Korea. Methods : Postal and telephone surveys were carried out, between August 2001 and November 2001, of 323 affiliated faculty professors from medical universities and colleges in Korea. The data were analyzed with descriptive statistical methods, with regard to the present status and demand for health and cancer-related information. Results : Most (over 80%) subjects studied utilized the health-related information provided on Internet website from foreign countries, such as Medline, but similar comprehensive information system lacked in Korea. The construction of a cancer-related database of domestic research results was revealed to be in a great demand. Information on registration and statistics (52.8%), study results (48.5%) and study resources (37.4%) were the major ingredients required in the database. In constructing a database of the cancer-related research results, a full-text service, continuous updating of data, and the development of standardized user-friendly searching tool were regarded as the necessary components. The formulation of an information sharing system, regarding cancer-related clinical trials, was investigated as being quite feasible. Conclusion : This study demonstrated the great importance of cancer information systems, and much demand for an available cancer-related database based on Korean research results.

An Evaluation of Website Information Architecture for Old Adults: Focused on Organization and Labeling System (고령층을 위한 웹 사이트 정보 구조 평가: 조직화 체계와 레이블링 체계를 중심으로)

  • Seo, Jiwoong;Kim, Heesop
    • Journal of the Korean Society for information Management
    • /
    • v.33 no.1
    • /
    • pp.181-196
    • /
    • 2016
  • The objective of this study is to evaluate the organization system and the labeling system of information architecture of a website for the elderly. To achieve this aims, we selected a representative website, i.e., Naver, and the participants were conducted given three types of search tasks using their own information literacy skills and they were answered to the questionnaire and an additional interview, if necessary. A total of 74 valid data were collected through the experiment, and we analyzed the data using SPSS Ver. 20. It revealed that Naver received a positive evaluation in the organization system aspect, particularly its systematic subject categorization and chronological browsing mechanisms. Old adults were preferred the icon-based labeling than the text-based labeling system, and showed a significant difference among their academic backgrounds.

Modeling User Preference based on Bayesian Networks for Office Event Retrieval (사무실 이벤트 검색을 위한 베이지안 네트워크 기반 사용자 선호도 모델링)

  • Lim, Soo-Jung;Park, Han-Saem;Cho, Sung-Bae
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.14 no.6
    • /
    • pp.614-618
    • /
    • 2008
  • As the multimedia data increase a lot with the rapid development of the Internet, an efficient retrieval technique focusing on individual users is required based on the analyses of such data. However, user modeling services provided by recent web sites have the limitation of text-based page configurations and recommendation retrieval. In this paper, we construct the user preference model with a Bayesian network to apply the user modeling to video retrieval, and suggest a method which utilizes probability reasoning. To do this, context information is defined in a real office environment and the video scripts acquired from established cameras and annotated the context information manually are used. Personal information of the user, obtained from user input, is adopted for the evidence value of the constructed Bayesian Network, and user preference is inferred. The probability value, which is produced from the result of Bayesian Network reasoning, is used for retrieval, making the system return the retrieval result suitable for each user's preference. The usability test indicates that the satisfaction level of the selected results based on the proposed model is higher than general retrieval method.

Performance Evaluation of WAVE Communication System for the Next-Generation ITS (차세대 ITS를 위한 WAVE 통신 시스템 성능 평가)

  • Lee, Se-Yeun;Jeong, Han-Gyun;Shin, Dae-Kyo;Lim, Ki-Taeg;Lee, Myung-Ho
    • Journal of Advanced Navigation Technology
    • /
    • v.15 no.6
    • /
    • pp.1059-1067
    • /
    • 2011
  • Next-Generation ITS environment requires high-speed data packet transmission, security, authentication, and hand-over supportable for driving vehicle on road by installing RSEs and OBUs. Therefore, wireless communication technology for next-generation ITS services are advancing to 200km/h maximum speed supportable, 1km communication radius, minimum 10Mbps hish-speed datarate for multimedia data(such as text, images, movie clips and so on) supportable, high reliability. In this paper, we implemented WAVE communication system which based on IEEE 802.11p PHY/MAC and evaluated that the system meets next-generation ITS environments.

VOC Summarization and Classification based on Sentence Understanding (구문 의미 이해 기반의 VOC 요약 및 분류)

  • Kim, Moonjong;Lee, Jaean;Han, Kyouyeol;Ahn, Youngmin
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.1
    • /
    • pp.50-55
    • /
    • 2016
  • To attain an understanding of customers' opinions or demands regarding a companies' products or service, it is important to consider VOC (Voice of Customer) data; however, it is difficult to understand contexts from VOC because segmented and duplicate sentences and a variety of dialog contexts. In this article, POS (part of speech) and morphemes were selected as language resources due to their semantic importance regarding documents, and based on these, we defined an LSP (Lexico-Semantic-Pattern) to understand the structure and semantics of the sentences and extracted summary by key sentences; furthermore the LSP was introduced to connect the segmented sentences and remove any contextual repetition. We also defined the LSP by categories and classified the documents based on those categories that comprise the main sentences matched by LSP. In the experiment, we classified the VOC-data documents for the creation of a summarization before comparing the result with the previous methodologies.

Design and Implementation of IoT Terminal Equipment for Vessels using Thuraya Geo-stationary Orbit Satellite (Thuraya 정지궤도 위성을 이용한 선박용 IoT 단말 장치 설계 및 구현)

  • Jang, Won-Chang;Lee, Myung-Eui
    • Journal of Advanced Navigation Technology
    • /
    • v.24 no.2
    • /
    • pp.67-72
    • /
    • 2020
  • Satellite communication is not used by many people like mobile communication, but it is a necessary technology for public service and communication services, such as providing the Internet in military, disaster, remote education and medical services, island areas, and infrastructure vulnerable areas. However, on ships and aircraft, mobile communications requiring base stations are either unavailable or restricted in their use. In this paper, we used a Raspberry Pi board as the terminal device to communicate network through satellite modem and PPP protocol, and implemented two-way data link using the text message of the modem to connect to the Thuraya geo-stationary orbit network. In addition, I/O devices were connected to the controller of the terminal equipment to design and implement an IoT device system for ships that can remotely access the system under control and control I/Os and transmit measured data through various sensors.