• Title/Summary/Keyword: Automated analysis system

Search Result 846, Processing Time 0.031 seconds

Web Site Keyword Selection Method by Considering Semantic Similarity Based on Word2Vec (Word2Vec 기반의 의미적 유사도를 고려한 웹사이트 키워드 선택 기법)

  • Lee, Donghun;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.2
    • /
    • pp.83-96
    • /
    • 2018
  • Extracting keywords representing documents is very important because it can be used for automated services such as document search, classification, recommendation system as well as quickly transmitting document information. However, when extracting keywords based on the frequency of words appearing in a web site documents and graph algorithms based on the co-occurrence of words, the problem of containing various words that are not related to the topic potentially in the web page structure, There is a difficulty in extracting the semantic keyword due to the limit of the performance of the Korean tokenizer. In this paper, we propose a method to select candidate keywords based on semantic similarity, and solve the problem that semantic keyword can not be extracted and the accuracy of Korean tokenizer analysis is poor. Finally, we use the technique of extracting final semantic keywords through filtering process to remove inconsistent keywords. Experimental results through real web pages of small business show that the performance of the proposed method is improved by 34.52% over the statistical similarity based keyword selection technique. Therefore, it is confirmed that the performance of extracting keywords from documents is improved by considering semantic similarity between words and removing inconsistent keywords.

Constructing Database for Drugs and its Application to Biological Sample by HPTLC and GC/MS (HPTLC와 GC/MS를 이용한 의약품의 데이타베이스화 및 생체시료에의 응용)

  • Yoo, Young-Chan;Park, Sung-Woo;Lim, Mie-Ae;Baeck, Seung-Kyung;Park, Seh-Youn;Lee, Ju-Seon;Lho, Dong-Seok
    • Analytical Science and Technology
    • /
    • v.13 no.2
    • /
    • pp.136-150
    • /
    • 2000
  • For the identification of unknown drugs in biological samples, we attempted rapid high performance thin layer chromatographic method which is sensitive and selective chromatographic analysis of high performance thin layer chromatography (HPTLC) with automated TLC sampler and ultra-violet (UV) scanner. We constructed HPTLC database (DB) on two hundred five drugs by using the data of Rf values and UV spectra (scan 200-360 nm) as well as gas chromatography/mass spectrometry (GC/MS) DB on ninety six drugs by using the data of relative retention time (RRT) on lidocain and mass spectra. After extracting drugs in biological sample by solid phase extraction (Clean Screen ZSDAU020), we applied them to HPTLC and GC/MS DB. Drugs, especially extracted from biological samples, showed good matching ratio to HPTLC DB and these drugs were confirmed by GC/MS. In conclusion, this DB system is thought to be very useful method for the screening of unknown drugs in biological samples.

  • PDF

Bioequivalence Test of Triflusal Capsules (트리플루살 캅셀의 생물학적 동등성 평가)

  • 박정숙;이미경;박경미;김진기;임수정;최성희;민경아;김종국
    • Biomolecules & Therapeutics
    • /
    • v.9 no.4
    • /
    • pp.291-297
    • /
    • 2001
  • The bioequivalence of two triflusal products was evaluated with 20 healthy volunteers following single oral dose according to the guidelines of Korea Food and Drug Administration (KFDA). Trisa $l^{R}$ capsule (Whanin Pharm. Corp., Korea) and Disgre $n^{R}$ capsule (Myung-In Pharm. Corp., Korea) were used as test product and reference product, respectively. Both products contain 300 mg of trifusal. One capsule of test product or reference product was orally administered to the volunteers, respectively, by randomized two period crossover study (2$\times$2 Latin square method). Blood samples were taken at predetermined time intervals for 4 hours and the determination of trifusal was accomplished using semi-microbore HPLC equipped with automated column switching system. The analytical method with HPLC was validated according to the Bioanalytic Method Validation guideline by F7A prior to determining the plasma samples. The pharmacokinetic parameters (AU $C_{0-4h}$ $C_{max}$ and $T_{max}$) were calculated and ANOVA test was utilized for statistical analysis of parameters. As a result of the assay validation, the limit of quantification of trifusal in human plasma by current assay procedure was 50 ng/ml using 500 $\mu$l of plasma. The accuracy of the assay was from 97.76% to 116.51% while the intra-day and inter-day coefficient of variation of the same concentration range was less than 15%. Average drug concentration at the designated time intervals and pharmacokinetic parameters calculated were not significantly different between two products (p>0.05). The difference of mean AU $C_{olongrightarrow4hr}$, $C_{max}$, and $T_{max}$ between the two products (2.92, 4.39, and -2.44%, respectively) were less than 20%. The power (1-$\beta$) and treatment difference ($\Delta$) for AU $C_{olongrightarrow4hr}$ and $C_{max}$ were more than 0.8 and less than 0.2, respectively. Although the power for $T_{max}$ was under 0.8, $T_{max}$ of the two products was not significantly different from each other (p>0.05). These results satisfied the criteria of KFDA guideline for bioequivalence, indicating the two products of triflusal were bioequivalent.quivalent.ent.ent.

  • PDF

Evaluation of the Accuracy of IMERG at Multiple Temporal Scales (시간 해상도 변화에 따른 IMERG 정확도 평가)

  • KIM, Joo-Hun;CHOI, Yun-Seok;KIM, Kyung-Tak
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.4
    • /
    • pp.102-114
    • /
    • 2017
  • The purpose of this study was the assessment of the accuracy of Global Precipitation Measurement (GPM) Integrated Multi-Satellite Retrievals for GPM (IMERG), a rainfall data source derived from satellite images, for evaluation of its applicability to use in ungauged or inaccessible areas. The study area was the overall area of the Korean peninsula divided into six regions. Automated Surface Observing System (ASOS) rainfall data from the Korean Meteorological Administration and IMERG satellite rainfall were used. Their average correlation coefficient was 0.46 for a 1-h temporal resolution, and it increased to 0.69 for a 24-h temporal resolution. The IMERG data quantitatively estimated less than the rainfall totals from ground gauges, and the bias decreased as the temporal resolution was decreased. The correlation coefficients of the two rainfall events, which had relatively greater rainfall amounts, were 0.68 and 0.69 for a 1-h temporal resolution. Additionally, the spatial distributions of the ASOS and IMERG data were similar to each other. The study results showed that the IMERG data were very useful in the assessment of the hydro-meteorological characteristics of ungauged or inaccessible areas. In a future study, verification of the accuracy of satellite-derived rainfall data will be performed by expanding the analysis periods and applying various statistical techniques.

Positive Random Forest based Robust Object Tracking (Positive Random Forest 기반의 강건한 객체 추적)

  • Cho, Yunsub;Jeong, Soowoong;Lee, Sangkeun
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.6
    • /
    • pp.107-116
    • /
    • 2015
  • In compliance with digital device growth, the proliferation of high-tech computers, the availability of high quality and inexpensive video cameras, the demands for automated video analysis is increasing, especially in field of intelligent monitor system, video compression and robot vision. That is why object tracking of computer vision comes into the spotlight. Tracking is the process of locating a moving object over time using a camera. The consideration of object's scale, rotation and shape deformation is the most important thing in robust object tracking. In this paper, we propose a robust object tracking scheme using Random Forest. Specifically, an object detection scheme based on region covariance and ZNCC(zeros mean normalized cross correlation) is adopted for estimating accurate object location. Next, the detected region will be divided into five regions for random forest-based learning. The five regions are verified by random forest. The verified regions are put into the model pool. Finally, the input model is updated for the object location correction when the region does not contain the object. The experiments shows that the proposed method produces better accurate performance with respect to object location than the existing methods.

Classification of Very High Concerns HRCT Images using Extended Bayesian Networks (확장 베이지안망을 적용한 고위험성 HRCT 영상 분류)

  • Lim, Chae-Gyun;Jung, Yong-Gyu
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.2
    • /
    • pp.7-12
    • /
    • 2012
  • Recently the medical field to efficiently process the vast amounts of information to decision trees, neural networks, Bayesian Networks, including the application method of various data mining techniques are investigated. In addition, the basic personal information or patient history, family history, in addition to information such as MRI, HRCT images and additional information to collect and leverage in the diagnosis of disease, improved diagnostic accuracy is to promote a common status. But in real world situations that affect the results much because of the variable exists for a particular data mining techniques to obtain information through the enemy can be seen fairly limited. Medical images were taken as well as a minor can not give a positive impact on the diagnosis, but the proportion increased subjective judgments by the automated system is to deal with difficult issues. As a result of a complex reality, the situation is more advantageous to deal with the relative probability of the multivariate model based on Bayesian network, or TAN in the K2 search algorithm improves due to expansion model has been proposed. At this point, depending on the type of search algorithm applied significantly influenced the performance characteristics of the extended Bayesian network, the performance and suitability of each technique for evaluation of the facts is required. In this paper, we extend the Bayesian network for diagnosis of diseases using the same data were carried out, K2, TAN and changes in search algorithms such as classification accuracy was measured. In the 10-fold cross-validation experiment was performed to compare the performance evaluation based on the analysis and the onset of high-risk classification for patients with HRCT images could be possible to identify high-risk data.

Dynamic ontology construction algorithm from Wikipedia and its application toward real-time nation image analysis (국가이미지 분석을 위한 위키피디아 실시간 동적 온톨로지 구축 알고리즘 및 적용)

  • Lee, Youngwhan
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.979-991
    • /
    • 2016
  • Measuring nation images was a challenging task when employing offline surveys was the only option. It was not only prohibitively expensive, but too much time-consuming and therefore unfitted to this rapidly changing world. Although demands for monitoring real-time nation images were ever-increasing, an affordable and reliable solution to measure nation images has not been available up to this date. The researcher in this study developed a semi-automatic ontology construction algorithm, named "double-crossing double keyword collection (or DCDKC)" to measure nation images from Wikipedia in real-time. The ontology, WikiOnto, can be used to reflect dynamic image changes. In this study, an instance of WikiOnto was constructed by applying the algorithm to the big-three exporting countries in East Asia, Korea, Japan, and China. Then, the numbers of page views for words in the instance of WikiOnto were counted. A collection of the counting for each country was compared to each other to inspect the possibility to use for dynamic nation images. As for the conclusion, the result shows how the images of the three countries have changed for the period the study was performed. It confirms that DCDKC can very well be used for a real-time nation-image monitoring system.

A Study on the Choice of Main Entry in German Cataloging Rules; a comparison with the title entry in the Orient (독일목록규칙의 기본기입선정에 관한 연구)

  • Kim Tae-soo
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.21
    • /
    • pp.61-101
    • /
    • 1991
  • This study reviews the development and change of main entry principle in German cataloging codes, with special emphasis on RAK. With rerard to the functions of catalog, comparison between the traditional title main entry in the Orient and author main entry in the West has been made. The analysis confirms in this study that various criteria in choice of the entries in RAK have been adopted. In case of works where the persons who have played different roles in the works are named on the title page, as well as related works and works of mixed responsibility, the criteria of entry determination are complex and time consuming process and have no absolute value. And there are also various kinds of problems in corporate entries including confirmation of originator(Urheber), choice of either the territorial authority corncerned or corporate bodies as an entry depending on the nature of the publications, and a unique bibliographical situation of treaties. This means the code is absence of absolute value in selecting entries, and this results in adoption of main entry principle which has lost its significance for the purpose of cataloging. With emergence of the ISBD and actualization of automated cataloging, morever, all entries are equal as points of access. It would eliminate the need for personal judgements required in choice of main entry by the present code. In doing so, it would bring uniformity and standardization to cataloging practice. In direct approach to works, title entry is more developed finding device than author entry in cataloging theories. Thus introduction of unit card system beginning with title which is adopted in KCR3 would be desirable, the complicated rules for the choice of entry could be abandoned from cataloging codes. Most of the user studies show that catalog users have placed higher value on the title entry as a finding device and each entry is equal as access points through unit entry. This means that choice of a given entry as a main entry is unnecessary in cataloging codes. Title entry would be a rather simple standard and direct approach for works. This study proves that the traditional title entry of Korea is superior to author main entry in the Western world in cataloging theory. Thus recommendation to be made is that abandonment of author main entry from cataloging codes should be considered in the future.

  • PDF

Distribution Analysis of Land Surface Temperature about Seoul Using Landsat 8 Satellite Images and AWS Data (Landsat 8 위성영상과 AWS 데이터를 이용한 서울특별시의 지표면 온도 분포 분석)

  • Lee, Jong-Sin;Oh, Myoung-Kwan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.1
    • /
    • pp.434-439
    • /
    • 2019
  • Recently, interest in urban temperature change and ground surface temperature change has been increasing due to weather phenomenon due to global warming, heat island phenomenon caused by urbanization in urban areas. In Korea, weather data such as temperature and precipitation have been collected since 1904. In recent years, there are 96 ASOS stations and 494 AWS weather observation stations. However, in the case of terrestrial networks, terrestrial meteorological data except measurement points are predicted through interpolation because they provide point data for each installation point. In this study, to improve the resolution of ground surface temperature measurement, the surface temperature using satellite image was calculated and its applicability was analyzed. For this purpose, the satellite images of Landsat 8 OLI TIRS were obtained for Seoul Metropolitan City by seasons and transformed to surface temperature by applying NASA equation to the thermal bands. The ground measurement data was based on the temperature data measured by AWS. Since the AWS temperature data is station based point data, interpolation is performed by Kriging interpolation method for comparison with Landsat image. As a result of comparing the satellite image base surface temperature with the AWS temperature data, the temperature difference according to the season was calculated as fall, winter, summer, based on the RMSE value, Spring, in order of applicability of Landsat satellite image. The use of that attribute and AWS support starts at $2.11^{\circ}C$ and RMSE ${\pm}3.84^{\circ}C$, which reflects information from the extended NASA.

Comparison and Analysis of Drought Index based on MODIS Satellite Images and ASOS Data for Gyeonggi-Do (경기도 지역에 대한 MODIS 위성영상 및 지점자료기반 가뭄지수의 비교·분석)

  • Yu-Jin, KANG;Hung-Soo, KIM;Dong-Hyun, KIM;Won-Joon, WANG;Han-Eul, LEE;Min-Ho, SEO;Yun-Jae, CHOUNG
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.25 no.4
    • /
    • pp.1-18
    • /
    • 2022
  • Currently, the Korea Meteorological Administration evaluates the meteorological drought by region using SPI6(standardized precipitation index 6), which is a 6-month cumulative precipitation standard. However, SPI is an index calculated only in consideration of precipitation at 69 weather stations, and the drought phenomenon that appears for complex reasons cannot be accurately determined. Therefore, the purpose of this study is to calculate and compare SPI considering only precipitation and SDCI (Scaled Drought Condition Index) considering precipitation, vegetation index, and temperature in Gyeonggi. In addition, the advantages and disadvantages of the station data-based drought index and the satellite image-based drought index were identified by using results calculated through the comparison of SPI and SDCI. MODIS(MODerate resolution Imaging Spectroradiometer) satellite image data, ASOS(Automated Synoptic Observing System) data, and kriging were used to calculate SDCI. For the duration of precipitation, SDCI1, SDCI3, and SDCI6 were calculated by applying 1-month, 3-month, and 6-month respectively to the 8 points in 2014. As a result of calculating the SDCI, unlike the SPI, drought patterns began to appear about 2-month ago, and drought by city and county in Gyeonggi was well revealed. Through this, it was found that the combination of satellite image data and station data increased efficiency in the pattern of drought index change, and increased the possibility of drought prediction in wet areas along with existing dry areas.