• Title/Summary/Keyword: Pseudo data

Search Result 795, Processing Time 0.028 seconds

Automatic Generation of Training Data for Korean Speech Recognition Post-Processor (한국어 음성인식 후처리기를 위한 학습 데이터 자동 생성 방안)

  • Seonmin Koo;Chanjun Park;Hyeonseok Moon;Jaehyung Seo;Sugyeong Eo;Yuna Hur;Heuiseok Lim
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.465-469
    • /
    • 2022
  • 자동 음성 인식 (Automatic Speech Recognition) 기술이 발달함에 따라 자동 음성 인식 시스템의 성능을 높이기 위한 방법 중 하나로 자동 후처리기 연구(automatic post-processor)가 진행되어 왔다. 후처리기를 훈련시키기 위해서는 오류 유형이 포함되어 있는 병렬 말뭉치가 필요하다. 이를 만드는 간단한 방법 중 하나는 정답 문장에 오류를 삽입하여 오류 문장을 생성하여 pseudo 병렬 말뭉치를 만드는 것이다. 하지만 이는 실제적인 오류가 아닐 가능성이 존재한다. 이를 완화시키기 위하여 Back TranScription (BTS)을 이용하여 후처리기 모델 훈련을 위한 병렬 말뭉치를 생성하는 방법론이 존재한다. 그러나 해당 방법론으로 생성 할 경우 노이즈가 적을 수 있다는 관점이 존재하다. 이에 본 연구에서는 BTS 방법론과 인위적으로 노이즈 강도를 추가한 방법론 간의 성능을 비교한다. 이를 통해 BTS의 정량적 성능이 가장 높은 것을 확인했을 뿐만 아니라 정성적 분석을 통해 BTS 방법론을 활용하였을 때 실제 음성 인식 상황에서 발생할 수 있는 실제적인 오류를 더 많이 포함하여 병렬 말뭉치를 생성할 수 있음을 보여준다.

  • PDF

Applicability of Pseudostatic Analysis for the Seismic Design of Temporary Retaining Structures in a Deep Excavation (흙막이 가시설 내진설계를 위한 등가정적해석의 유효성 분석)

  • Yu, Sang-Hwa;Kim, Dong-Chan;Kim, Jongkwan;Han, Jin-Tae
    • Journal of the Korean Geotechnical Society
    • /
    • v.39 no.9
    • /
    • pp.35-50
    • /
    • 2023
  • A preliminary study is conducted to develop seismic design guidelines for temporary retaining structures in a deep excavation. The study involved a comprehensive literature review of the seismic design standards applied domestically and internationally, as well as various methods to calculate seismic earth pressure for pseudostatic analysis. The FLAC 2D, a two-dimensional finite difference analysis program, was utilized to perform pseudostatic analysis using the Semirigid pressure method, Wood method, and Mononobe-Okabe method. The resulting analysis data for the wall moment and axial force of the strut were compared with the dynamic analysis outcomes to evaluate the applicability of pseudostatic analysis. The Semirigid pressure method predicted the most reasonable moment for Stiff walls experiencing horizontal displacements up to 0.4%H. Predicting the axial force of the strut exactly was challenging because the pseudostatic analysis cannot consider dynamic soil-structure interaction; however, it is deemed available for conservative preliminary review to ensure safety.

Computation of Apparent Resistivity from Marine Controlled-source Electromagnetic Data for Identifying the Geometric Distribution of Gas Hydrate (가스 하이드레이트 부존양상 도출을 위한 해양 전자탐사 자료의 겉보기 비저항 계산)

  • Noh, Kyu-Bo;Kang, Seo-Gi;Seol, Soon-Jee;Byun, Joong-Moo
    • Geophysics and Geophysical Exploration
    • /
    • v.15 no.2
    • /
    • pp.75-84
    • /
    • 2012
  • The sea layer in marine Controlled-Source Electromagnetic (mCSEM) survey changes the conventional definition of apparent resistivity which is used in the land CSEM survey. Thus, the development of a new algorithm, which computes apparent resistivity for mCSEM survey, can be an initiative of mCSEM data interpretation. First, we compared and analyzed electromagnetic responses of the 1D stratified gas hydrate model and the half-space model below the sea layer. Amplitude and phase components showed proper results for computing apparent resistivity than real and imaginary components. Next, the amplitude component is more sensitive to the subsurface resistivity than the phase component in far offset range and vice versa. We suggested the induction number as a selection criteria of amplitude or phase component to calculate apparent resistivity. Based on our study, we have developed a numerical algorithm, which computes appropriate apparent resistivity corresponding to measured mCSEM data using grid search method. In addition, we verified the validity of the developed algorithm by applying it to the stratified gas hydrate models with various model parameters. Finally, by constructing apparent resistivity pseudo-section from the mCSEM responses with 2D numerical models simulating gas hydrate deposits in the Ulleung Basin, we confirmed that the apparent resistivity can provide the information on the geometric distribution of the gas hydrate deposit.

Phosphate sorption to quintinite in aqueous solutions: Kinetic, thermodynamic and equilibrium analyses

  • Kim, Jae-Hyun;Park, Jeong-Ann;Kang, Jin-Kyu;Kim, Song-Bae;Lee, Chang-Gu;Lee, Sang-Hyup;Choi, Jae-Woo
    • Environmental Engineering Research
    • /
    • v.20 no.1
    • /
    • pp.73-78
    • /
    • 2015
  • The aim of this study was to examine the phosphate (P) removal by quintinite from aqueous solutions. Batch experiments were performed to examine the effects of reaction time, temperature, initial phosphate concentration, initial solution pH and stream water on the phosphate adsorption to quintinite. Kinetic, thermodynamic and equilibrium isotherm models were used to analyze the experimental data. Results showed that the maximum P adsorption capacity was 4.77 mgP/g under given conditions (initial P concentration = 2-20 mgP/L; adsorbent dose = 1.2 g/L; reaction time = 4 hr). Kinetic model analysis showed that the pseudo second-order model was the most suitable for describing the kinetic data. Thermodynamic analysis indicated that phosphate sorption to quintinite increased with increasing temperature from 15 to $45^{\circ}C$, indicating the spontaneous and endothermic nature of sorption process (${\Delta}H^0=487.08\;kJ/mol$; ${\Delta}S^0=1,696.12\;J/(K{\cdot}mol)$; ${\Delta}G^0=-1.67$ to -52.56 kJ/mol). Equilibrium isotherm analysis demonstrated that both Freundlich and Redlich-Peterson models were suitable for describing the equilibrium data. In the pH experiments, the phosphate adsorption to quintinite was not varied at pH 3.0-7.1 (1.50-1.55 mgP/g) but decreased considerably at a highly alkaline solution (0.70 mgP/g at pH 11.0). Results also indicated that under given conditions (initial P concentration=2 mgP/L; adsorbent dose=0.8 g/L; reaction time=4 hr), phosphate removal in the stream water (1.88 mgP/g) was lower than that in the synthetic solution (2.07 mgP/g), possibly due to the presence of anions such as (bi)carbonate and sulfate in the stream water.

A Design of Framework for Thin-Client by using X Protocol based Application (X 프로토콜 기반의 애플리케이션을 통한 씬-클라이언트 프레임워크 설계)

  • Song, Min-Gyu
    • Journal of Digital Contents Society
    • /
    • v.10 no.4
    • /
    • pp.509-520
    • /
    • 2009
  • The advancement of network & application technology causes a major change for the use of IT(Information Technology) equipment, including computer and mobile system. In the process from beginning with main frame in the 1960s and 70's, through the server-client paradigm in the 1980s and toward the development of network computer since 90's, computer systems are now evolutioning from isolated physical system to complementary network based virtual system[1][2]. In network based computer system, application and data required for operation are stored at not client as local system, but at server[1]. User can use application & data on a server as if those are on a local client, and a client is now toward a developing thin and network friendly system. In this paper, we discuss possible ways for the efficient implementation of thin-client. For the use of remote application & data as if in local environment, we make use of X protocol. Unlike formal simple Client - Server paradigm, we design a Proxy for middle-tier server for the improvement of QoS and session persistence. X server, Xvfb(X virtual frame buffer) are implemented on thin client and Server, respectively and we applied XSMP(X Session Management Protocol) to our framework for session management. In the end, beyond simple transfer of server display, we suggest thin client framework for the transfter of remote server application over internet.

  • PDF

A Study on the Digital Filter Design for Radio Astronomy Using FPGA (FPGA를 이용한 전파천문용 디지털 필터 설계에 관한 기본연구)

  • Jung, Gu-Young;Roh, Duk-Gyoo;Oh, Se-Jin;Yeom, Jae-Hwan;Kang, Yong-Woo;Lee, Chang-Hoon;Chung, Hyun0Soo;Kim, Kwang-Dong
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.9 no.1
    • /
    • pp.62-74
    • /
    • 2008
  • In this paper, we would like to propose the design of symmetric digital filter core in order to use in the radio astronomy. The function of FIR filter core would be designed by VHDL code required at the Data Acquisition System (DAS) of Korean VLBI Network (KVN) based on the FPGA chip of Vertex-4 SX55 model of Xilinx company. The designed digital filter has the symmetric structure to increase the effectiveness of system by sharing the digital filter coefficient. The SFFU(Symmetric FIR Filter Unit) use the parallel processing method to perform the data processing efficiently by using the constrained system clock. In this paper, therefore, for the effective design of SFFU, the Unified Synthesis software ISE Foundation and Core Generator which has excellent GUI environment were used to overall IP core synthesis and experiments. Through the synthesis results of digital filter core, we verified the resource usage is less than 40% such as Slice LUT and achieved the maximum operation frequency is more than 260MHz. We also confirmed the SFFU would be well operated without error according to the SFFU simulation result using the Modelsim 6.1a of Mentor Graphics Company. To verify the function of SFFU, we carried out the additional simulation experiments using the pseudo signal to the Matlab software. From the comparison experimental results of simulation and the designed digital FIR filter, we confirmed the FIR filter was well performed with filter's basic function. So we verified the effectiveness of the designed FIR digital filter with symmetric structure using FPGA and VHDL.

  • PDF

Radiometric Cross Calibration of KOMPSAT-3 and Lnadsat-8 for Time-Series Harmonization (KOMPSAT-3와 Landsat-8의 시계열 융합활용을 위한 교차검보정)

  • Ahn, Ho-yong;Na, Sang-il;Park, Chan-won;Hong, Suk-young;So, Kyu-ho;Lee, Kyung-do
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.6_2
    • /
    • pp.1523-1535
    • /
    • 2020
  • In order to produce crop information using remote sensing, we use classification and growth monitoring based on crop phenology. Therefore, time-series satellite images with a short period are required. However, there are limitations to acquiring time-series satellite data, so it is necessary to use fusion with other earth observation satellites. Before fusion of various satellite image data, it is necessary to overcome the inherent difference in radiometric characteristics of satellites. This study performed Korea Multi-Purpose Satellite-3 (KOMPSAT-3) cross calibration with Landsat-8 as the first step for fusion. Top of Atmosphere (TOA) Reflectance was compared by applying Spectral Band Adjustment Factor (SBAF) to each satellite using hyperspectral sensor band aggregation. As a result of cross calibration, KOMPSAT-3 and Landsat-8 satellites showed a difference in reflectance of less than 4% in Blue, Green, and Red bands, and 6% in NIR bands. KOMPSAT-3, without on-board calibrator, idicate lower radiometric stability compared to ladnsat-8. In the future, efforts are needed to produce normalized reflectance data through BRDF (Bidirectional reflectance distribution function) correction and SBAF application for spectral characteristics of agricultural land.

The Effect of Data Size on the k-NN Predictability: Application to Samsung Electronics Stock Market Prediction (데이터 크기에 따른 k-NN의 예측력 연구: 삼성전자주가를 사례로)

  • Chun, Se-Hak
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.239-251
    • /
    • 2019
  • Statistical methods such as moving averages, Kalman filtering, exponential smoothing, regression analysis, and ARIMA (autoregressive integrated moving average) have been used for stock market predictions. However, these statistical methods have not produced superior performances. In recent years, machine learning techniques have been widely used in stock market predictions, including artificial neural network, SVM, and genetic algorithm. In particular, a case-based reasoning method, known as k-nearest neighbor is also widely used for stock price prediction. Case based reasoning retrieves several similar cases from previous cases when a new problem occurs, and combines the class labels of similar cases to create a classification for the new problem. However, case based reasoning has some problems. First, case based reasoning has a tendency to search for a fixed number of neighbors in the observation space and always selects the same number of neighbors rather than the best similar neighbors for the target case. So, case based reasoning may have to take into account more cases even when there are fewer cases applicable depending on the subject. Second, case based reasoning may select neighbors that are far away from the target case. Thus, case based reasoning does not guarantee an optimal pseudo-neighborhood for various target cases, and the predictability can be degraded due to a deviation from the desired similar neighbor. This paper examines how the size of learning data affects stock price predictability through k-nearest neighbor and compares the predictability of k-nearest neighbor with the random walk model according to the size of the learning data and the number of neighbors. In this study, Samsung electronics stock prices were predicted by dividing the learning dataset into two types. For the prediction of next day's closing price, we used four variables: opening value, daily high, daily low, and daily close. In the first experiment, data from January 1, 2000 to December 31, 2017 were used for the learning process. In the second experiment, data from January 1, 2015 to December 31, 2017 were used for the learning process. The test data is from January 1, 2018 to August 31, 2018 for both experiments. We compared the performance of k-NN with the random walk model using the two learning dataset. The mean absolute percentage error (MAPE) was 1.3497 for the random walk model and 1.3570 for the k-NN for the first experiment when the learning data was small. However, the mean absolute percentage error (MAPE) for the random walk model was 1.3497 and the k-NN was 1.2928 for the second experiment when the learning data was large. These results show that the prediction power when more learning data are used is higher than when less learning data are used. Also, this paper shows that k-NN generally produces a better predictive power than random walk model for larger learning datasets and does not when the learning dataset is relatively small. Future studies need to consider macroeconomic variables related to stock price forecasting including opening price, low price, high price, and closing price. Also, to produce better results, it is recommended that the k-nearest neighbor needs to find nearest neighbors using the second step filtering method considering fundamental economic variables as well as a sufficient amount of learning data.

Adsorption Characteristics and Parameters of Acid Black and Quinoline Yellow by Activated Carbon (활성탄에 의한 Acid Black과 Quinoline Yellow의 흡착특성 및 파라미터)

  • Yi, Kyung Ho;Hwang, Eun Jin;Baek, Woo Seung;Lee, Jong-Jib;Dong, Jong-In
    • Clean Technology
    • /
    • v.26 no.3
    • /
    • pp.186-195
    • /
    • 2020
  • The isothermal adsorption, dynamic, and thermodynamic parameters of Acid black (AB) and Quinoline yellow (QY) adsorption by activated carbon were investigated using the initial concentration, contact time, temperature, and pH of the dyes as adsorption parameters. The adsorption equilibrium data fits the Freundlich isothermal adsorption model, and the calculated Freundlich separation factor values found that activated carbon can effectively remove AB and QY. Comparing the kinetic data showed that the pseudo second order model was within 10% error in the adsorption process. The intraparticle diffusion equation results were divided into two straight lines. Since the slope of the intraparticle diffusion line was smaller than the slope of the boundary layer diffusion line, it was confirmed that intraparticle diffusion was the rate-controlling step. The thermodynamic experiments indicated that the activation energies of AB and QY were 19.87 kJ mol-1 and 14.17 kJ mol-1, which corresponded with the physical adsorption process (5 ~ 40 kJ mol-1). The adsorption reaction was spontaneous because the free energy change in the adsorption of AB and QY by activated carbon was negative from 298 to 318 K. As the temperature increased, the free energy value decreased resulting in higher spontaneity. Adsorption of AB and QY by activated carbon showed the highest adsorption removal rate at pH 3 due to the effect of anions generated by dissociation. The adsorption mechanism was electrostatic attraction.

Adsorption Characteristics Analysis of 2,4-Dichlorophenol in Aqueous Solution with Activated Carbon Prepared from Waste Citrus Peel using Response Surface Modeling Approach (반응표면분석법을 이용한 폐감귤박 활성탄에 의한 수중의 2,4-Dichlorophenol 흡착특성 해석)

  • Lee, Chang-Han;Kam, Sang-Kyu;Lee, Min-Gyu
    • Korean Chemical Engineering Research
    • /
    • v.55 no.5
    • /
    • pp.723-730
    • /
    • 2017
  • The batch experiments by response surface methodology (RSM) have been applied to investigate the influences of operating parameters such as temperature, initial concentration, contact time and adsorbent dosage on 2,4-dichlorophenol (2,4-DCP) adsorption with an activated carbon prepared from waste citrus peel (WCAC). Regression equation formulated for the 2,4-DCP adsorption was represented as a function of response variables. Adequacy of the model was tested by the correlation between experimental and predicted values of the response. A fairly high value of $R^2$ (0.9921) indicated that most of the data variation was explained by the regression model. The significance of independent variables and their interactions were tested by the analysis of variance (ANOVA) and t-test statistics. These results showed that the model used to fit response variables was significant and adequate to represent the relationship between the response and the independent variables. The kinetics and isotherm experiment data can be well described with the pseudo-second order model and the Langmuir isotherm model, respectively. The maximum adsorption capacity of 2,4-DCP on WCAC calculated from the Langmuir isotherm model was 345.49 mg/g. The rate controlling mechanism study revealed that film diffusion and intraparticle diffusion were simultaneously occurring during the adsorption process. The thermodynamic parameters indicated that the adsorption reaction of 2,4-DCP on WCAC was an endothermic and spontaneous process.