• Title/Summary/Keyword: Batch method

Search Result 782, Processing Time 0.035 seconds

A Study on Fine-Tuning and Transfer Learning to Construct Binary Sentiment Classification Model in Korean Text (한글 텍스트 감정 이진 분류 모델 생성을 위한 미세 조정과 전이학습에 관한 연구)

  • JongSoo Kim
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.5
    • /
    • pp.15-30
    • /
    • 2023
  • Recently, generative models based on the Transformer architecture, such as ChatGPT, have been gaining significant attention. The Transformer architecture has been applied to various neural network models, including Google's BERT(Bidirectional Encoder Representations from Transformers) sentence generation model. In this paper, a method is proposed to create a text binary classification model for determining whether a comment on Korean movie review is positive or negative. To accomplish this, a pre-trained multilingual BERT sentence generation model is fine-tuned and transfer learned using a new Korean training dataset. To achieve this, a pre-trained BERT-Base model for multilingual sentence generation with 104 languages, 12 layers, 768 hidden, 12 attention heads, and 110M parameters is used. To change the pre-trained BERT-Base model into a text classification model, the input and output layers were fine-tuned, resulting in the creation of a new model with 178 million parameters. Using the fine-tuned model, with a maximum word count of 128, a batch size of 16, and 5 epochs, transfer learning is conducted with 10,000 training data and 5,000 testing data. A text sentiment binary classification model for Korean movie review with an accuracy of 0.9582, a loss of 0.1177, and an F1 score of 0.81 has been created. As a result of performing transfer learning with a dataset five times larger, a model with an accuracy of 0.9562, a loss of 0.1202, and an F1 score of 0.86 has been generated.

A Study on Characteristics of Pulverized Ion Exchange Resins (이온교환수지 분체 특성에 대한 연구)

  • Jaeyong Huh;Gyeongmi Goo;Yongwon Jang;Sanghyeon Kang
    • Membrane Journal
    • /
    • v.34 no.2
    • /
    • pp.132-139
    • /
    • 2024
  • The ion exchange resin used to remove total dissolved solids (TDS) is used by being packed in a column, and sufficient contact time between the ionic material and the ion exchange resin is required during the ion exchange process. In this study, the ion exchange resin that exhibits high TDS reduction even with a short contact time through pulverization of the ion exchange resin was characterized. The optimal size of resin considering flowability was over 100 ㎛. The highest pulverizing yield were obtained that 250~500 ㎛ size and 100~250 ㎛ size were 67.3% and 36.9%, respectively. Also, the highest yield and the pulverizing time of 100~500 ㎛ size was 87.1% and 2 minutes, respectively. Under batch test conditions, the time to reach a removal rate of 95% and 99% for 250~500 ㎛ resins was 1.82 and 1.96 times faster than non-pulverized ion exchange resin, respectively. The 100~250 ㎛ resins showed 15.9 times and 6.18 times faster, respectively. Under the column test, a total of 1.74 g of NaCl was removed by non-pulverized ion exchange resins, 1.83 g of NaCl was removed by 250~500 ㎛ resins and 1.63 g of NaCl was removed by 100 and 250 ㎛ resins. As the size of the resin decreased, the capacity slightly decreased. As a result, it was observed that the pulverized ion exchange resins could be a method of achieving high TDS removal performance under short contact time.

Object Tracking Based on Exactly Reweighted Online Total-Error-Rate Minimization (정확히 재가중되는 온라인 전체 에러율 최소화 기반의 객체 추적)

  • JANG, Se-In;PARK, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.53-65
    • /
    • 2019
  • Object tracking is one of important steps to achieve video-based surveillance systems. Object tracking is considered as an essential task similar to object detection and recognition. In order to perform object tracking, various machine learning methods (e.g., least-squares, perceptron and support vector machine) can be applied for different designs of tracking systems. In general, generative methods (e.g., principal component analysis) were utilized due to its simplicity and effectiveness. However, the generative methods were only focused on modeling the target object. Due to this limitation, discriminative methods (e.g., binary classification) were adopted to distinguish the target object and the background. Among the machine learning methods for binary classification, total error rate minimization can be used as one of successful machine learning methods for binary classification. The total error rate minimization can achieve a global minimum due to a quadratic approximation to a step function while other methods (e.g., support vector machine) seek local minima using nonlinear functions (e.g., hinge loss function). Due to this quadratic approximation, the total error rate minimization could obtain appropriate properties in solving optimization problems for binary classification. However, this total error rate minimization was based on a batch mode setting. The batch mode setting can be limited to several applications under offline learning. Due to limited computing resources, offline learning could not handle large scale data sets. Compared to offline learning, online learning can update its solution without storing all training samples in learning process. Due to increment of large scale data sets, online learning becomes one of essential properties for various applications. Since object tracking needs to handle data samples in real time, online learning based total error rate minimization methods are necessary to efficiently address object tracking problems. Due to the need of the online learning, an online learning based total error rate minimization method was developed. However, an approximately reweighted technique was developed. Although the approximation technique is utilized, this online version of the total error rate minimization could achieve good performances in biometric applications. However, this method is assumed that the total error rate minimization can be asymptotically achieved when only the number of training samples is infinite. Although there is the assumption to achieve the total error rate minimization, the approximation issue can continuously accumulate learning errors according to increment of training samples. Due to this reason, the approximated online learning solution can then lead a wrong solution. The wrong solution can make significant errors when it is applied to surveillance systems. In this paper, we propose an exactly reweighted technique to recursively update the solution of the total error rate minimization in online learning manner. Compared to the approximately reweighted online total error rate minimization, an exactly reweighted online total error rate minimization is achieved. The proposed exact online learning method based on the total error rate minimization is then applied to object tracking problems. In our object tracking system, particle filtering is adopted. In particle filtering, our observation model is consisted of both generative and discriminative methods to leverage the advantages between generative and discriminative properties. In our experiments, our proposed object tracking system achieves promising performances on 8 public video sequences over competing object tracking systems. The paired t-test is also reported to evaluate its quality of the results. Our proposed online learning method can be extended under the deep learning architecture which can cover the shallow and deep networks. Moreover, online learning methods, that need the exact reweighting process, can use our proposed reweighting technique. In addition to object tracking, the proposed online learning method can be easily applied to object detection and recognition. Therefore, our proposed methods can contribute to online learning community and object tracking, detection and recognition communities.

Incremental Generation of A Decision Tree Using Global Discretization For Large Data (대용량 데이터를 위한 전역적 범주화를 이용한 결정 트리의 순차적 생성)

  • Han, Kyong-Sik;Lee, Soo-Won
    • The KIPS Transactions:PartB
    • /
    • v.12B no.4 s.100
    • /
    • pp.487-498
    • /
    • 2005
  • Recently, It has focused on decision tree algorithm that can handle large dataset. However, because most of these algorithms for large datasets process data in a batch mode, if new data is added, they have to rebuild the tree from scratch. h more efficient approach to reducing the cost problem of rebuilding is an approach that builds a tree incrementally. Representative algorithms for incremental tree construction methods are BOAT and ITI and most of these algorithms use a local discretization method to handle the numeric data type. However, because a discretization requires sorted numeric data in situation of processing large data sets, a global discretization method that sorts all data only once is more suitable than a local discretization method that sorts in every node. This paper proposes an incremental tree construction method that efficiently rebuilds a tree using a global discretization method to handle the numeric data type. When new data is added, new categories influenced by the data should be recreated, and then the tree structure should be changed in accordance with category changes. This paper proposes a method that extracts sample points and performs discretiration from these sample points to recreate categories efficiently and uses confidence intervals and a tree restructuring method to adjust tree structure to category changes. In this study, an experiment using people database was made to compare the proposed method with the existing one that uses a local discretization.

Effects of HCl and EDTA on Soil Washing to Remediate Lead-contaminated Soil in a Firing Range (사격장 납 오염토양 복원을 위한 토양세척시 HCl과 EDTA의 영향 연구)

  • Kim, Hyo-Sik;Choi, Sang-Il
    • Journal of Soil and Groundwater Environment
    • /
    • v.13 no.1
    • /
    • pp.60-66
    • /
    • 2008
  • Laboratory soil washing experiments with HCl or EDTA were conducted to remediate lead-contaminated soil in a firing range. After lead bullets were removed by standard sieve #18 (1.0 mM), Pb concentrations were measured by EPA Method 3050B (9,443 mg/kg) and Korea Standard Test (4,803.5 mg/kg). The results of the batch test showed that the removal efficiency curve was logarithmic and approximately 90% of lead in soil was removed, when HCl was used. In case of EDTA, the removal efficiency increased proportionally to the concentration of EDTA, up to 98% lead removal with 0.1M EDTA. High mixing strength resulted in increase of removal efficiency and kinetics showed that the most lead was extracted in 10 min.

Liquid Membrane Permeation of Nitrogen Heterocyclic Compounds Contained in Model Coal Tar Fraction

  • Kim, Su-Jin;Kang, Ho-Cheol;Kim, Yong-Shik;Jeong, Hwa-Jin
    • Bulletin of the Korean Chemical Society
    • /
    • v.31 no.5
    • /
    • pp.1143-1148
    • /
    • 2010
  • We investigated the separation of nitrogen heterocyclic compound (NHC) contained in a model coal tar fraction comprising four kinds of NHC [indole (In), quinoline (Q), iso-quinoline (iQ), quinaldine (Qu)], three kinds of bicyclic aromatic compound (BAC) [1-methylnaphthalene (1MN), 2-methylnaphthalene (2MN), dimethylnaphthalene (DMN) mixture with ten structural isomers (DMNs; regarded as one component)], biphenyl (Bp) and phenyl ether (Pe) by liquid membrane permeation (LMP). A batch-stirred tank was used as the permeation unit. An aqueous solution of saponin and n-hexane were used as the liquid membrane and the outer oil phase, respectively. Yield and selectivity of individual NHC was much larger than that of BAC, Bp and Pe. Increasing the initial mass fraction of the saponin to the membrane solution ($C_{sap,0}$) and the initial volume fraction of O/W emulsion to total liquid in a stirred tank (${\phi}_{OW,0}$) resulted in deteriorating the yield of individual NHC, but increasing the stirring speed (N) resulted in improving the yield of each NHC. With increasing $C_{sap,0}$, the selectivity of each NHC based on DMNs increased. Increasing ${\phi}_{OW,0}$ and N resulted in decreasing the selectivity of individual NHC based on DMNs. At an experimental condition fixed, the sequence of the yield and selectivity in reference to DMNs for each NHC was Q > Qu = iQ > In. Furthermore, we compared LPM method with methanol extraction method in view of the separation efficiency (yield, selectivity) of NHC.

Practical Use of Vacuum Press for Curvature Formation in Wooden Furniture Design (목 가구 디자인에서 곡면 성형을 위한 베큠프레스의 활용)

  • Wee, Han-Lim
    • Archives of design research
    • /
    • v.18 no.4 s.62
    • /
    • pp.155-164
    • /
    • 2005
  • In contrast with product design field, some designers who work in furniture field tend to do their own studio works as well as typical designing part. Especially in the small furniture studios for the limited quantity batch production, custom made or handmade craft furniture which is finished with high quality, the propensity for their own production is more obvious than in the big furniture companies in this case. In this kind of small-scale furniture studios, they have more chance to create the various formative works and 'curved shape' is one of the most important elements to form creative pieces. Except by caning, it is very difficult to make curved wooden shape because of own characteristic of wood. Therefore, the special techniques of bending wood are essential to formative furniture production and vacuum press system is introduced as a main subject for the bending wood method in this study. Especially for the designers who work as makers as well at the small furniture studios, the value of vacuum press system on efficiency and productivity of work was sought by testing and improving the method of wood bending techniques. According to this practical searching, ideally sufficient condition on vacuum pressing work was founded as a result on this study.

  • PDF

Erection Process Planning & Scheduling using Genetic Algorithm (유전 알고리즘을 이용한 탑재 공정과 일정 계획)

  • J.W. Lee;H.J. Kim
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.32 no.1
    • /
    • pp.9-16
    • /
    • 1995
  • The erection process planning is to decide erection strategy and sequence that satisfies dock duration. The erection scheduling is to decide erection date of each block. The load profile varies according to scheduling and it is related to building cost. It must be possible to simulate the various combinations of process plan and schedule for optimal planning. To develop the process planning system for optimal planning, the system that generate the sequence of erection automatically and the load leveling system are required. This paper suggests the method that generates the erection sequence. The load leveling should be done to all the ships in the same dock batch to get reliable results. In this case since the search space is very large, efficient optimization method is needed Our research achieved the load leveling system using Genetic Algorithm. This system made it possible to simulate various process plans to which schedule is considered.

  • PDF

Synthesis of Concentrated Silver Nano Sol for Ink-Jet Method (잉크젯용 고농도 은 나노 졸 합성)

  • Park, Han-Sung;Seo, Dong-Soo;Choi, Youngmin;Chang, Hyunjoo;Kong, Ki-Jeong;Lee, Jung-O;Ryu, Beyong-Hwan
    • Journal of the Korean Ceramic Society
    • /
    • v.41 no.9
    • /
    • pp.670-676
    • /
    • 2004
  • The synthesis of highly concentrated silver nano sol assisted by polymeric dispersant (polyelectrolytes) for inkjet method was studied. The silver nano sol was prepared with AgNO$_3$, polyelectrolytes (HS5468cf ; polyacrylic ammonium salt), and reducing agent. The polyelectrolytes play an important role in formation of complex composed of Ag$\^$+/ion and carboxyl group (COO$\^$-/), result in preparation of highly dispersed silver nano particles. The optimization of added amount of polyelectrolytes, and concentration of silver nano sol was studied. The silver nanoparticles were evaluated by XRD, particle size/zeta potential analyzer and FE-TEM. The silver nanoparticles with the range of 10 nm in diameter were produced. The concentration of batch-synthesized silver nano sol was possible up to 40 wt%.

Adsorption Characteristics of Sr Ions by Coal Fly Ash-Based-Zeolite X using Response Surface Modeling Approach (반응표면분석법을 이용한 석탄회로 합성한 제올라이트 X에서의 Sr 이온 제거특성)

  • Lee, Chang-Han;Kam, Sang-Kyu;Lee, Min-Gyu
    • Journal of Environmental Science International
    • /
    • v.26 no.6
    • /
    • pp.719-728
    • /
    • 2017
  • In order to investigate the adsorption characteristics for Sr ion using the Na-X zeolite synthesized from coal fly ash, batch tests and response surface analyses were carried out. The adsorption kinetic data for Sr ions, using Na-X zeolite, fitted well with the pseudo-second-order model. The uptake of Sr ions followed the Langmuir isotherm model, with a maximum adsorption capacity of 196.46 mg/g. Thermodynamic studies were conducted at different reaction temperatures, with the results indicating that Sr ion adsorption by Na-X zeolite was an endothermic (${\Delta}H^o$>0) and spontaneous (${\Delta}G^o$<0) process. Using the response surface methodology of the Box-Behnken method, initial Sr ion concentration ($X_1$), initial temperature ($X_2$), and initial pH ($X_3$) were selected as the independent variables, while the adsorption of Sr ions by Na-X zeolite was selected as the dependent variable. The experimental data fitted well with a second-order polynomial equation by multiple regression analysis. The value of the determination coefficient ($R^2=0.9937$) and the adjusted determination coefficient (adjusted $R^2=0.9823$) was close to 1, indicating high significance of the model. Statistical results showed the order of Sr removal based on experimental factors to be initial pH > initial concentration > temperature.