Search | Korea Science

Wanda Pruning for Lightweighting Korean Language Model (Wanda Pruning에 기반한 한국어 언어 모델 경량화)

Jun-Ho Yoon;Daeryong Seo;Donghyeon Jeon;Inho Kang;Seung-Hoon Na
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.437-442
- /
- 2023
최근에 등장한 대규모 언어 모델은 다양한 언어 처리 작업에서 놀라운 성능을 발휘하고 있다. 그러나 이러한 모델의 크기와 복잡성 때문에 모델 경량화의 필요성이 대두되고 있다. Pruning은 이러한 경량화 전략 중 하나로, 모델의 가중치나 연결의 일부를 제거하여 크기를 줄이면서도 동시에 성능을 최적화하는 방법을 제시한다. 본 논문에서는 한국어 언어 모델인 Polyglot-Ko에 Wanda[1] 기법을 적용하여 Pruning 작업을 수행하였다. 그리고 이를 통해 가중치가 제거된 모델의 Perplexity, Zero-shot 성능, 그리고 Fine-tuning 후의 성능을 분석하였다. 실험 결과, Wanda-50%, 4:8 Sparsity 패턴, 2:4 Sparsity 패턴의 순서로 높은 성능을 나타냈으며, 특히 일부 조건에서는 기존의 Dense 모델보다 더 뛰어난 성능을 보였다. 이러한 결과는 오늘날 대규모 언어 모델 중심의 연구에서 Pruning 기법의 효과와 그 중요성을 재확인하는 계기가 되었다.
PDF

Generation Methodology Using Super In-Context Learning (Super In-Context Learning을 활용한 생성 방법론)

Seongtae Hong;Seungjun Lee;Gyeongmin Kim;Heuiseok Lim
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.382-387
- /
- 2023
현재 GPT-4와 같은 거대한 언어 모델이 기계 번역, 요약 및 대화와 같은 다양한 작업에서 압도적인 성능을 보이고 있다. 그러나 이러한 거대 언어 모델은 학습 및 적용에 상당한 계산 리소스와 도메인 특화 미세 조정이 어려운 등 몇 가지 문제를 가지고 있다. In-Context learning은 데이터셋에서 추출한 컨택스트의 정보만으로 효과적으로 작동할 수 있는 효율성을 제공하여 앞선 문제를 일부 해결했지만, 컨텍스트의 샷 개수와 순서에 민감한 문제가 존재한다. 이러한 도전 과제를 해결하기 위해, 우리는 Super In-Context Learning (SuperICL)을 활용한 새로운 방법론을 제안한다. 기존의 SuperICL은 적용한 플러그인 모델의 출력 정보를 이용하여 문맥을 새로 구성하고 이를 활용하여 거대 언어 모델이 더욱 잘 분류할 수 있도록 한다. Super In-Context Learning for Generation은 다양한 자연어 생성 작업에 효과적으로 최적화하는 방법을 제공한다. 실험을 통해 플러그인 모델을 교체하여 다양한 작업에 적응하는 가능성을 확인하고, 자연어 생성 작업에서 우수한 성능을 보여준다. BLEU 및 ROUGE 메트릭을 포함한 평가 결과에서도 성능 향상을 보여주며, 선호도 평가를 통해 모델의 효과성을 확인했다.
PDF

Verification for Multithreaded Java Code using Java Memory Model (자바 메모리 모델을 이용한 멀티 스레드 자바 코드 검증)

Lee, Min;Kwon, Gi-Hwon
- The KIPS Transactions:PartD
- /
- v.15D no.1
- /
- pp.99-106
- /
- 2008
Recently developed compilers perform some optimizations in order to speed up the execution time of source program. These optimizations require the reordering of the sequence of program statements. This reordering does not give any problems in a single-threaded program. However, the reordering gives some significant errors in a multi-threaded program. State-of-the-art model checkers such as JavaPathfinder do not consider the reordering resulted in the optimization step in a compiler since they just consider a single memory model. In this paper, we develop a new verification tool to verify Java source program based on Java Memory Model. And our tool is capable of handling the reordering in verifying Java programs. As a result, our tool finds an error in the test program which is not revealed with the traditional model checker JavaPathFinder.
https://doi.org/10.3745/KIPSTD.2008.15-D.1.99 인용 PDF KSCI

Optimal method of digital photogrammetry (수치항공사진측량의 최적화 방안 연구)

이정화
- Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
- /
- 2002.04a
- /
- pp.67-75
- /
- 2002
Digital photogrammetry is one of the powerful tools for surveying in more perceptual ways and exploiting the continuously developing computer technology. Nowadays, digital photogrammetry is being used for a number of industrial measurements and inspections but the automation aspect of this technique is not fully developed yet. Photogrammetric work, which is obtained through usual workflow, delays for a big amount of CGP surveying, interpretation and cadastral information. Therefore through studying ways of reducing the volume of photogrammetric works, financial opportunities for digital photogrammetry can be found. This research is focused on the development of the new workflow and study algorithm in digital photogrammetry. Using this result we can reduce financial expenses and improve technologies of topographic and cadastral plans creation.
PDF

Optimization of T/C Lifting Plan using Dependency Structure Matrix (DSM) (DSM을 활용한 타워크레인 양중계획 최적화에 관한 연구)

Kim, Seungho;Kim, Sangyong;Jean, Jihoon;An, Sung-Hoon
- Journal of the Korea Institute of Building Construction
- /
- v.16 no.2
- /
- pp.151-159
- /
- 2016
Tower crane (T/C) is one of the major equipment that is highly demanded in construction projects. Especially, most high-rise building projects require T/C to perform lifting and hoisting activities of materials. Therefore, lifting plan of T/C needs to reduce construction duration and cost. However, most lifting plan of the T/C in construction sites has still performed depending on experience and intuition of the site manager without systematic process of rational work. Dependency structure matrix (DSM) is useful tool in planning the activity sequences and managing information exchanges unlike other existing tools. To improve lifting plan of T/C efficiently, this study presents a framework for the scheduling T/C using DSM through the case study in real world construction site. The results of case study showed that the scheduling T/C using DSM is useful to optimize the T/C lifting plan in terms of easiness, specially in the typical floor cycle lifting planning.
https://doi.org/10.5345/JKIBC.2016.16.2.151 인용 PDF KSCI

A Study on Improvement of Run-Time in KS-SIGNAL, Traffic Signal Optimization Model for Coordinated Arterials (간선도로 연동화 신호최적화 모형 KS-SIGNAL의 수행속도 향상을 위한 연구)

박찬호;김영찬
- Journal of Korean Society of Transportation
- /
- v.18 no.4
- /
- pp.7-18
- /
- 2000
KS-SIGNAL, a traffic signal optimization model for coordinated arterials, is an optimization model using the mixed integer linear Programming that minimizes total delay on arterials by optimizing left-turn Phase sequences. However, the Previous version of KS-SIGNAL had a difficulty in reducing computation speed because the related variables and constraints multiply rapidly in accordance with the increase of intersections. This study is designed to propose a new model, improving optimizing computation speed in KS-SIGMAl, and evaluate it. This Paper Puts forth three kinds of methodological approaches as to achieve the above goals. At the first step to reduce run-time in the proposed model objective function and a few constraints are Partially modified, which replaces variable in related to queue clearance time with constant, by using thru-movements at upstream intersection and the length of red time at downstream intersection. The result shows that the run-time can be reduced up to 70% at this step. The second step to load the library in LINDO for Windows, in order to solve mixed integer linear programming. The result suggests that run-time can be reduced obviously up to 99% of the first step result. The third step is to add constraints in related to left-turn Phase sequences. The proposed methodological approach, not optimizing all kinds of left-turn sequences, is more reasonable than that of previous model , only in the view of reducing run-tim. In conclusion, run-time could be reduced up to 30% compared with the second results. This Proposed model was tested by several optimization scenarios. The results in this study reveals that signal timing plan in KS-SIGNAL is closer to PASSER-II (bandwidth maximizing model) rather than to TRANSYT-7F(delay minimizing model).
PDF

A distance metric of nominal attribute based on conditional probability (조건부 확률에 기반한 범주형 자료의 거리 측정)

이재호;우종하;오경환
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2003.09b
- /
- pp.53-56
- /
- 2003
유사도 혹은 자료간의 거리 개념은 많은 기계학습 알고리즘에서 사용되고 있는 중요한 측정개념이다 하지만 입력되는 자료의 속성들중 순서가 정의되지 않은 범주형 속성이 포함되어 있는 경우, 자료간의 유사도나 거리 측정에 어려움이 따른다. 비거리 기반의 알고리즘들의 경우-C4.5, CART-거리의 측정없이 작동할 수 있지만, 거리기반의 알고리즘들의 경우 범주형 속성의 거리 정보 결여로 효과적으로 적용될 수 없는 문제점을 갖고 있다. 본 논문에서는 이러한 범주형 자료들간 거리 측정을 자료 집합의 특성을 충분히 고려한 방법을 제안한다. 이를 위해 자료 집합의 선험적인 정보를 필요로 한다. 이런 선험적 정보인 조건부 확률을 기반으로한 거리 측정방법을 제시하고 오류 피드백을 통해서 속성 간 거리 측정을 최적화 하려고 노력한다. 주어진 자료 집합에 대해 서로 다른 두 범주형 값이 목적 속성에 대해서 유사한 분포를 보인다면 이들 값들은 비교적 가까운 거리로 결정한다 이렇게 결정된 거리를 기반으로 학습 단계를 진행하며 이때 발생한 오류들에 대해 피드백 작업을 진행한다. UCI Machine Learning Repository의 자료들을 이용한 실험 결과를 통해 제안한 거리 측정 방법의 우수한 성능을 확인하였다.
PDF

Autonomous Reconstruction of 3D Indoor Environment for Virtual Reality using a Mobile Robot based on the Observation Planning (경로 예측에 기반한 이동로봇을 이용한 가상 현실을 위한 삼차원 실내 환경 모델의 자율 복원에 관한 연구)

Moon, Jung-Hyun;You, Bum-Jae;Kim, Hag-Bae;Oh, Sang-Rok
- Proceedings of the KIEE Conference
- /
- 2005.07d
- /
- pp.2774-2776
- /
- 2005
본 논문에서는 이동로봇에 삼차원 레이저 스캐너를 장착하여 삼차원 데이터의 수집, 수집된 데이터의 정합, 데이터 수집을 위한 이동로봇의 경로계획 및 장애물 회피주행 등 모든 작업들을 유기적으로 결합시켜 실내 환경에 다한 삼차원 모델을 자율제작하는 시스템을 제안한다. 이를 위해 스캔순서최적화를 통한 빠른 동적 물체 정보의 제거, 계층적 육면체 맵과 기하학적맵을 이용한 최적 경로 예측에 의한 다음 스캐닝 위치의 결정, 오도미터 정보와 명암 정보를 이용해 수정된 ICP 알고리즘을 통한 데이터의 정합을 통하여 이동물체와 관계없는 실내환경에 대한 삼차원 모델의 자율복원 한다.
PDF

JOB Scheduling Analysis in FMC using TPN (TPN을 이용한 FMC의 JOB 스케쥴링 분석)

안광수
- Journal of the Korea Society of Computer and Information
- /
- v.4 no.3
- /
- pp.13-19
- /
- 1999
In this paper, we suggests a WIP (Work In Process) of FMC (Flexible Manufacturing Cell) analysis methods based on the TPN (Time Petri Nets) unfolding. Unfolding of PN is a partial order-based method for the verification of concurrent system without the state space explosion. The aim of this work is to formulate the general cyclic state scheduling problem to minimize the WIP to satisfy economical constraints. The method is based on unfolding of the original net into the equivalent acyclic description.

A Study on the Development of a Mid-term Scheduling System for Shipbuilding Using ILOG (ILOG를 이용한 조선 중일정계획 시스템 개발에 관한 연구)

우상복;김상훈;한형상;김기동;김훈주
- Proceedings of the Korea Society for Simulation Conference
- /
- 2001.05a
- /
- pp.143-143
- /
- 2001
조선 산업에서의 이익 극대화를 위해서는, 공기 단축을 통한 매출 증대, 주어진 생산 자원(인력, 설비 등)의 효율적인 이용을 통한 생산비 절감이 필수적인 사안이다. 다시 말하면, 효율적인 자원의 이용으로 선박의 건조 공기를 줄임으로써 조선 산업에서의 이익 극대화를 도모할 수 있고 이는 효과적인 생산 계획과 관리를 통해 얻을 수 있다. 조선 산업에서의 생산 계획과 관리는 내업, 선행, 선내 등 선박 건조의 전 과정에서 이루어지고 있는데, 조선 산업은 타 산업에 비해 환경변화에 따른 단기간의 불확실성은 비교적 적은 반면에, 가공, 절단, 조립, 의장, 도장, 선행탑재, 탑재, 안벽 작업 등 다단계 제조 프로세스가 장기간에 걸쳐 상당히 복잡하게 구성되어 있다. 또한, 유연하게 적용할 수 있는 인적자원과 공정순서, 그리고 각종 시간적 공간적 자원 제약 등으로 인하여 효과적이고 일관된 생산 계획을 신속하게 수립하기가 어려울 뿐만 아니라, 주문 생산 방식이기 때문에 정확한 일정계획 데이터의 생성도 어려운 실정이다. 본 연구는, 조선의 혁신적인 생산관리 능력 향상을 지원하기 위한 "조선 통합 생산계획 시스템 개발" 에 관련된 연구이다. 본 연구에서는 조선 생산계획에 적합한 일정계획 방법론 및 엔진 소프트웨어를 활용하고 총체적인 생산계획 및 일정계획 업무의 개념 재정립을 통하여, 통합 최적화를 실시간에 지원하는 조선 최적 일정계획 시스템 구축에 궁극적인 목적을 두고 있다. 이를 위해 본 연구에서는 조선 프로세스와 일정계획 업무규칙을 분석하고 이를 바탕으로 "조선 최적 일정계획 시스템"을 개발하였다. 조선 최적 일정계획 시스템은 블록, PE, 탑재, 선내 단계의 일정계획 최적화와 What-if Simulation을 지원하기 위하여 ILOG Solver/scheduler로 구현하였으며, 구현된 시스템에 대해 실제 계획에서 사용하는 데이터를 대상으로 다양한 최적화 기능에 대한 실험을 실시하였다. 실제 데이터를 이용한 실험결과, 풀이시간과 최적해 측면 모두에서 비교적 만족할 만한 결과를 보여주었다.교적 만족할 만한 결과를 보여주었다.
PDF

Search Result 36, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)