• Title/Summary/Keyword: cloud-based

Search Result 2,620, Processing Time 0.032 seconds

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.

Analysis of media trends related to spent nuclear fuel treatment technology using text mining techniques (텍스트마이닝 기법을 활용한 사용후핵연료 건식처리기술 관련 언론 동향 분석)

  • Jeong, Ji-Song;Kim, Ho-Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.2
    • /
    • pp.33-54
    • /
    • 2021
  • With the fourth industrial revolution and the arrival of the New Normal era due to Corona, the importance of Non-contact technologies such as artificial intelligence and big data research has been increasing. Convergent research is being conducted in earnest to keep up with these research trends, but not many studies have been conducted in the area of nuclear research using artificial intelligence and big data-related technologies such as natural language processing and text mining analysis. This study was conducted to confirm the applicability of data science analysis techniques to the field of nuclear research. Furthermore, the study of identifying trends in nuclear spent fuel recognition is critical in terms of being able to determine directions to nuclear industry policies and respond in advance to changes in industrial policies. For those reasons, this study conducted a media trend analysis of pyroprocessing, a spent nuclear fuel treatment technology. We objectively analyze changes in media perception of spent nuclear fuel dry treatment techniques by applying text mining analysis techniques. Text data specializing in Naver's web news articles, including the keywords "Pyroprocessing" and "Sodium Cooled Reactor," were collected through Python code to identify changes in perception over time. The analysis period was set from 2007 to 2020, when the first article was published, and detailed and multi-layered analysis of text data was carried out through analysis methods such as word cloud writing based on frequency analysis, TF-IDF and degree centrality calculation. Analysis of the frequency of the keyword showed that there was a change in media perception of spent nuclear fuel dry treatment technology in the mid-2010s, which was influenced by the Gyeongju earthquake in 2016 and the implementation of the new government's energy conversion policy in 2017. Therefore, trend analysis was conducted based on the corresponding time period, and word frequency analysis, TF-IDF, degree centrality values, and semantic network graphs were derived. Studies show that before the 2010s, media perception of spent nuclear fuel dry treatment technology was diplomatic and positive. However, over time, the frequency of keywords such as "safety", "reexamination", "disposal", and "disassembly" has increased, indicating that the sustainability of spent nuclear fuel dry treatment technology is being seriously considered. It was confirmed that social awareness also changed as spent nuclear fuel dry treatment technology, which was recognized as a political and diplomatic technology, became ambiguous due to changes in domestic policy. This means that domestic policy changes such as nuclear power policy have a greater impact on media perceptions than issues of "spent nuclear fuel processing technology" itself. This seems to be because nuclear policy is a socially more discussed and public-friendly topic than spent nuclear fuel. Therefore, in order to improve social awareness of spent nuclear fuel processing technology, it would be necessary to provide sufficient information about this, and linking it to nuclear policy issues would also be a good idea. In addition, the study highlighted the importance of social science research in nuclear power. It is necessary to apply the social sciences sector widely to the nuclear engineering sector, and considering national policy changes, we could confirm that the nuclear industry would be sustainable. However, this study has limitations that it has applied big data analysis methods only to detailed research areas such as "Pyroprocessing," a spent nuclear fuel dry processing technology. Furthermore, there was no clear basis for the cause of the change in social perception, and only news articles were analyzed to determine social perception. Considering future comments, it is expected that more reliable results will be produced and efficiently used in the field of nuclear policy research if a media trend analysis study on nuclear power is conducted. Recently, the development of uncontact-related technologies such as artificial intelligence and big data research is accelerating in the wake of the recent arrival of the New Normal era caused by corona. Convergence research is being conducted in earnest in various research fields to follow these research trends, but not many studies have been conducted in the nuclear field with artificial intelligence and big data-related technologies such as natural language processing and text mining analysis. The academic significance of this study is that it was possible to confirm the applicability of data science analysis technology in the field of nuclear research. Furthermore, due to the impact of current government energy policies such as nuclear power plant reductions, re-evaluation of spent fuel treatment technology research is undertaken, and key keyword analysis in the field can contribute to future research orientation. It is important to consider the views of others outside, not just the safety technology and engineering integrity of nuclear power, and further reconsider whether it is appropriate to discuss nuclear engineering technology internally. In addition, if multidisciplinary research on nuclear power is carried out, reasonable alternatives can be prepared to maintain the nuclear industry.

Construction of X-band automatic radar scatterometer measurement system and monitoring of rice growth (X-밴드 레이더 산란계 자동 측정시스템 구축과 벼 생육 모니터링)

  • Kim, Yi-Hyun;Hong, Suk-Young;Lee, Hoon-Yol
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.43 no.3
    • /
    • pp.374-383
    • /
    • 2010
  • Microwave radar can penetrate cloud cover regardless of weather conditions and can be used day and night. Especially a ground-based polarimetric scatterometer has advantages of monitoring crop conditions continuously with full polarization and different frequencies. Kim et al. (2009) have measured backscattering coefficients of paddy rice using L-, C-, X-band scatterometer system with full polarization and various angles during the rice growth period and have revealed the necessity of near-continuous automatic measurement to eliminate the difficulties, inaccuracy and sparseness of data acquisitions arising from manual operation of the system. In this study, we constructed an X-band automatic scatterometer system, analyzed scattering characteristics of paddy rice from X-band scatterometer data and estimated rice growth parameter using backscattering coefficients in X-band. The system was installed inside a shelter in an experimental paddy field at the National Academy of Agricultural Science (NAAS) before rice transplanting. The scatterometer system consists of X-band antennas, HP8720D vector network analyzer, RF cables and personal computer that controls frequency, polarization and data storage. This system using automatically measures fully-polarimetric backscattering coefficients of rice crop every 10 minutes. The backscattering coefficients were calculated from the measured data at a fixed incidence angle of $45^{\circ}$ and with full polarization (HH, VV, HV, VH) by applying the radar equation and compared with rice growth data such as plant height, stem number, fresh dry weight and Leaf Area Index (LAI) that were collected at the same time of each rice growth parameter. We examined the temporal behaviour of the backscattering coefficients of the rice crop at X-band during rice growth period. The HH-, VV-polarization backscattering coefficients steadily increased toward panicle initiation stage, thereafter decreased and again increased in early-September. We analyzed the relationships between backscattering coefficients in X-band and plant parameters and predicted the rice growth parameters using backscattering coefficients. It was confirmed that X-band is sensitive to grain maturity at near harvesting season.

Study on Influencing Factors of Traffic Accidents in Urban Tunnel Using Quantification Theory (In Busan Metropolitan City) (수량화 이론을 이용한 도시부 터널 내 교통사고 영향요인에 관한 연구 - 부산광역시를 중심으로 -)

  • Lim, Chang Sik;Choi, Yang Won
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.1
    • /
    • pp.173-185
    • /
    • 2015
  • This study aims to investigate the characteristics and types of car accidents and establish a prediction model by analyzing 456 car accidents having occurred in the 11 tunnels in Busan, through statistical analysis techniques. The results of this study can be summarized as below. As a result of analyzing the characteristics of car accidents, it was found that 64.9% of all the car accidents took place in the tunnels between 08:00 and 18:00, which was higher than 45.8 to 46.1% of the car accidents in common roads. As a result of analyzing the types of car accidents, the car-to-car accident type was the majority, and the sole-car accident type in the tunnels was relatively high, compared to that in common roads. Besides, people at the age between 21 and 40 were most involved in car accidents, and in the vehicle type of the first party to car accidents, trucks showed a high proportion, and in the cloud cover, rainy days or cloudy days showed a high proportion unlike clear days. As a result of analyzing the principal components of car accident influence factors, it was found that the first principal components were road, tunnel structure and traffic flow-related factors, the second principal components lighting facility and road structure-related factors, the third principal factors stand-by and lighting facility-related factors, the fourth principal components human and time series-related factors, the fifth principal components human-related factors, the sixth principal components vehicle and traffic flow-related factors, and the seventh principal components meteorological factors. As a result of classifying car accident spots, there were 5 optimized groups classified, and as a result of analyzing each group based on Quantification Theory Type I, it was found that the first group showed low explanation power for the prediction model, while the fourth group showed a middle explanation power and the second, third and fifth groups showed high explanation power for the prediction model. Out of all the items(principal components) over 0.2(a weak correlation) in the partial correlation coefficient absolute value of the prediction model, this study analyzed variables including road environment variables. As a result, main examination items were summarized as proper traffic flow processing, cross-section composition(the width of a road), tunnel structure(the length of a tunnel), the lineal of a road, ventilation facilities and lighting facilities.

An Analysis for Deriving New Convergent Service of Mobile Learning: The Case of Social Network Analysis and Association Rule (모바일 러닝에서의 신규 융합서비스 도출을 위한 분석: 사회연결망 분석과 연관성 분석 사례)

  • Baek, Heon;Kim, Jin Hwa;Kim, Yong Jin
    • Information Systems Review
    • /
    • v.15 no.3
    • /
    • pp.1-37
    • /
    • 2013
  • This study is conducted to explore the possibility of service convergence to promote mobile learning. This study has attempted to identify how mobile learning service is provided, which services among them are considered most popular, and which services are highly demanded by users. This study has also investigated the potential opportunities for service convergence of mobile service and e-learning. This research is then extended to examine the possibility of active convergence of common services in mobile services and e-learning. Important variables have been identified from related web pages of portal sites using social network analysis (SNA) and association rules. Due to the differences in number and type of variables on different web pages, SNA was used to deal with the difficulties of identifying the degree of complex connection. Association analysis has been used to identify association rules among variables. The study has revealed that most frequent services among common services of mobile services and e-learning were Games and SNS followed by Payment, Advertising, Mail, Event, Animation, Cloud, e-Book, Augmented Reality and Jobs. This study has also found that Search, News, GPS in mobile services were turned out to be very highly demanded while Simulation, Culture, Public Education were highly demanded in e-learning. In addition, It has been found that variables involving with high service convergence based on common variables of mobile and e-learning services were Games and SNS, Games and Sports, SNS and Advertising, Games and Event, SNS and e-Book, Games and Community in mobile services while Games, Animation, Counseling, e-Book, being preceding services Simulation, Speaking, Public Education, Attendance Management were turned out be highly convergent in e-learning services. Finally, this study has attempted to predict possibility of active service convergence focusing on Games, SNS, e-Book which were highly demanded common services in mobile and e-learning services. It is expected that this study can be used to suggest a strategic direction to promote mobile learning by converging mobile services and e-learning.

  • PDF

Fog Detection over the Korean Peninsula Derived from Satellite Observations of Polar-orbit (MODIS) and Geostationary (GOES-9) (극궤도(MODIS) 및 정지궤도(GOES-9) 위성 관측을 이용한 한반도에서의 안개 탐지)

  • Yoo, Jung-Moon;Yun, Mi-Young;Jeong, Myeong-Jae;Ahn, Myoung-Hwan
    • Journal of the Korean earth science society
    • /
    • v.27 no.4
    • /
    • pp.450-463
    • /
    • 2006
  • Seasonal threshold values for fog detection over the ten airport areas within the Korean Peninsula have been derived from the data of polar-orbit Aqua/Terra MODIS and geostationary GOES-9 during a two years. The values are obtained from reflectance at $0.65{\mu}m\;(R_{0.65})$ and the difference in brightness temperature between $3.7{\mu}m\;and\;11{\mu}m\;(T_{3.7-11})$. In order to examine the discrepancy between the threshold values of two kinds of satellites, the following four parameters have been analyzed under the condition of daytime/nighttime and fog/clear-sky, utilizing their simultaneous observations over the Seoul metropolitan area: brightness temperature at $3.7{\mu}m$, the temperature at $11{\mu}m,\;the\;T_{3.7-11}$ for day and night, and the $R_{0.65}$ for daytime. The parameters show significant correlations (r<0.5) in spatial distribution between the two kinds of satellites. The discrepancy between their infrared thresholds is mainly due to the disagreement in their spatial resolutions and spectral bands, particularly at $3.7{\mu}m$. Fog detection from GOES-9 over the nine airport areas except the Cheongju airport has revealed accuracy of 60% in the daytime and 70% in the nighttime, based on statistical verification. The accuracy decreases in foggy cases with twilight, precipitation, short persistence, or the higher cloud above fog. The sensitivity of radiance and reflectance with wavelength has been analyzed in numerical experiments with respect to various meteorological conditions to investigate optical characteristics of the three channels.

Intercomparing the Aerosol Optical Depth Using the Geostationary Satellite Sensors (AHI, GOCI and MI) from Yonsei AErosol Retrieval (YAER) Algorithm (연세에어로졸 알고리즘을 이용하여 정지궤도위성 센서(AHI, GOCI, MI)로부터 산출된 에어로졸 광학두께 비교 연구)

  • Lim, Hyunkwang;Choi, Myungje;Kim, Mijin;Kim, Jhoon;Go, Sujung;Lee, Seoyoung
    • Journal of the Korean earth science society
    • /
    • v.39 no.2
    • /
    • pp.119-130
    • /
    • 2018
  • Aerosol Optical Properties (AOPs) are retrieved using the geostationary satellite instruments such as Geostationary Ocean Color Imager (GOCI), Meteorological Imager (MI), and Advanced Himawari Imager (AHI) through Yonsei AErosol Retrieval algorithm (YAER). In this study, the retrieved aerosol optical depths (AOD)s from each instrument were intercompared and validated with the ground-based sunphotometer AErosol Robotic NETwork (AERONET) data. As a result, the four AOD products derived from different instruments showed consistent results over land and ocean. However, AODs from MI and GOCI tend to be overestimated due to cloud contamination. According to the comparison results with AERONET, the percentage within expected errors (EE) are 36.3, 48.4, 56.6, and 68.2% for MI, GOCI, AHI-minimum reflectivity method (MRM), and AHI-estimated surface reflectance from shortwave Infrared (ESR) product, respectively. Since MI AOD is retrieved from a single visible channel, and adopts only one aerosol type by season, EE is relatively lower than other products. On the other hand, the AHI ESR is more accurate than the minimum reflectance method as used by GOCI, MI, and AHI MRM method in May and June when the vegetation is relatively abundant. These results are explained by the RMSE and the EE for each AERONET site. The ESR method result show to be better than the other satellite product in terms of EE for 15 out of 22 sites used for validation, and they are better than the other product for 13 sites in terms of RMSE. In addition, the error in observation time in each product is found by using characteristics of geostationary satellites. The absolute median biases at 00 to 06 Universal Time Coordinated (UTC) are 0.05, 0.09, 0.18, 0.18, 0.14, 0.09, and 0.10. The absolute median bias by observation time has appeared in MI and the only 00 UTC appeared in GOCI.

Establishment of Release Limits for Airborne Effluent into the Environment Based on ALARA Concept (ALARA 개념(槪念)에 의한 기체상방사성물질(氣體狀放射性物質)의 환경방출한도(環境放出限度) 설정(設定))

  • Lee, Byung-Ki;Cha, Moon-Hoe;Nam, Soon-Kwon;Chang, Si-Young;Ha, Chung-Woo
    • Journal of Radiation Protection and Research
    • /
    • v.10 no.1
    • /
    • pp.50-63
    • /
    • 1985
  • A derivation of new release limit, named Derived Release Limit(DRL), into the atomsphere from a reference nuclear power plant has been performed on the basis of the new system of dose limitation recommended by the ICRP, instead of the (MPC)a limit which has been currently used until now as a general standard for radioactive effluents in Korea. In DRL Calculation, a Concentration Factor Method was applied, in which the concentrations of long-term routinely released radionuclides were in equilibrium with dose in environment under the steady state condition. The analytical model used in the exposure pathway analysis was the one which has been suggested by the USNRC and the exposure limits applied in this analysis were those recommended by the USEPA lately. In the exposure pathway analysis, all of the pathways are not considered and some may be excluded either because they are not applicable or their contribution to the exposure is insignificant compared with other pathways. In case, the environmental model developed in this study was applied to the Kori nuclear power plant as the reference power plant, the highest DRL value was calculated to be as $9.10{\times}10^6Ci/yr$ for Kr-85 in external whole body exposure from the semi-infinite radioactive cloud, while the lowest DRL value was observed 3.64Ci/yr for Co-60 in external whole body exposure from the contaminated ground, by the radioactive particulates. The most critical exposure pathway to an individual in the unrestricted area of interest (Kilchun-Ri, 1.3 km to the north of the release point) seems to be the exposure pathway from the contaminated ground and the most critical radionuclide in all pathways appears to be Co-60 in the same pathway. When comparing the actual release rate from KNU-l in 1982 with the DRL's obtained here the release of radionuclides from KNU-1 were much lower than the DRL's and it could be conclued that the exposure to an individual had been kept below the exposure limits recommended by the USEPA.

  • PDF

Characteristics of Vertical Ozone Distributions in the Pohang Area, Korea (포항지역 오존의 수직분포 특성)

  • Kim, Ji-Young;Youn, Yong-Hoon;Song, Ki-Bum;Kim, Ki-Hyun
    • Journal of the Korean earth science society
    • /
    • v.21 no.3
    • /
    • pp.287-301
    • /
    • 2000
  • In order to investigate the factors and processes affecting the vertical distributions of ozone, we analyzed the ozone profile data measured using ozonesonde from 1995 to 1997 at Pohang city, Korea. In the course of our study, we analyzed temporal and spatial distribution characteristics of ozone at four different heights: surface (100m), troposphere (10km), lower stratosphere (20km), and middle stratosphere (30km). Despite its proximity to a local, but major, industrial complex known as Pohang Iron and Steel Co. (POSCO), the concentrations of surface ozone in the study area were comparable to those typically observed from rural and/or unpolluted area. In addition, the findings of relative enhancement of ozone at this height, especially between spring and summer may be accounted for by the prevalence of photochemical reactions during that period of year. The temporal distribution patterns for both 10 and 20km heights were quite compatible despite large differences in their altitudes with such consistency as spring maxima and summer minima. Explanations for these phenomena may be sought by the mixed effects of various processes including: ozone transport across two heights, photochemical reaction, the formation of inversion layer, and so on. However, the temporal distribution pattern for the middle stratosphere (30km) was rather comparable to that of the surface. We also evaluated total ozone concentration of the study area using Brewer spectrophotometer. The total ozone concentration data were compared with those derived by combining the data representing stratospheric layers via Umkehr method. The results of correlation analysis showed that total ozone is negatively correlated with cloud cover but not with such parameter as UV-B. Based on our study, we conclude that areal characteristics of Pohang which represents a typical coastal area may be quite important in explaining the distribution patterns of ozone not only from surface but also from upper atmosphere.

  • PDF

Estimation of Rice and Soybean Growth Stage Using a Microwave Scatterometer (마이크로파 산란계를 이용한 벼, 콩 생육단계 추정)

  • Kim, Yi-Hyun;Hong, Suk-Young;Lee, Hoon-Yol;Lee, Jae-Eun;Lee, Kyung-Do
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.45 no.4
    • /
    • pp.503-510
    • /
    • 2012
  • Microwave radar can penetrate cloud cover regardless of weather conditions and can be used day and night. Especially a A ground-based polarimetric scatterometer operating at multiple frequencies can continuously monitor the crop conditions. We analyzed scattering characteristics of rice and soybean using pauli decomposition method. Surface scattering (${\alpha}$) is the dominant component over the entire stages for all bands and pauli decomposition value was the highest for L-band. Double bounce scattering (${\beta}$) and volume scattering (${\gamma}$) were approximately equal for C-band and volume scattering was higher than double bounce scattering for X-band in rice field. In soybean, double bounce scattering becomes higher than volume scattering during the R2 stage (DOY 224) and there was a significant difference between the two components after the R4 stage (DOY 242) for L-band. The maximum growth stage of soybean can also be detected using L-band double bounce scattering. The peak of double bounce effect coincides with the peak of growth biophysical variables on DOY 271. We found that pauli decomposition can provide insight on the relative magnitude of different scattering mechanisms during the rice and soybean growth cycle.