Search | Korea Science

Data Dependency Elimination for Parallelism in nested Loops (중첩루프에서 병렬화를 위한 자료 종속성제거)

Song, Wol-Bong;Park, Du-Sun
- The Transactions of the Korea Information Processing Society
- /
- v.5 no.6
- /
- pp.1494-1506
- /
- 1998
본 논문에서는 루프구조의 효율적인 병렬수행을 위한 병렬성 추출에 대하여 불변과 가변 종속거리에 모두적용할 수 있는 통합된 새로운 기법을 제시한다. 이것은 컴파일시간에 순차 루프를 중첩된 DOALL 루프로의 자동 변환에 대한 절차로서, 중첩 루프의 전체적인 병렬화를 하기 위하여 문장들을 반복적으로 수행시키는 것에 의해서 자료 종속을 효과적으로 제거하는 알고리즘이다. 본 논문에 제시된 방법은 성능평가에서도 매우 뛰어난 방법임을 보였다.
PDF

Unfolding Nested Loops of Functional Languages for Multithreaded Architectures (다중스레드 구조를 위한 함수형 언어의 중첩루프 펼침)

하상호
- Journal of KIISE:Software and Applications
- /
- v.29 no.11
- /
- pp.826-836
- /
- 2002
We need an enormous amount of memories for name spaces as well as additional processors if we are to effectively exploit a massively parallelism in nested loops of functional languages such as Id. If there is no sufficient amount of memories enough to exploit that parallelism, the execution of programs can be aborted during the unfolding of loops. Additionally, if loops are overunfolded, compared with the number of processors available, the system performance can be degraded severely due to the overhead of loop unfolding. This paper suggests and analyzes an algorithm which can be used to effectively unfold nested loops of functional languages on multithreaded architectures. This algorithm has a feature to unfold a given nested loop safely and near optimally, considering the system resources of processors and memories available when the loop is to be unfolded.
PDF KSCI

A Program Restructuring framework for Parallel Processing (병렬처리를 위한 프로그램 재구조화)

송월봉
- Journal of the Korea Computer Industry Society
- /
- v.4 no.4
- /
- pp.501-508
- /
- 2003
In this paper A new theory of linear loop transformation called Elimination of Data Dependency(BDD) is presented. The current framework of linear loop transformation cannot identify a significant fraction of parallelism. For this reason, a method to extract the maximum loop parallelism in perfect nested loops is presented. This technique is applicable to general loop nests where the dependence include both distance and directions.
PDF

Parallel Programming for Exploiting Hybrid Parallel Model of CLUMP system and its Performance Evaluation (다중 메모리 모델의 CLUMP 시스템을 이용하기 위한 병렬 프로그래밍 기법과 성능 평가)

이용욱;라마크리쉬나
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.10c
- /
- pp.621-623
- /
- 2000
클러스터를 구성하는 단위 노드로 SMP가 새로운 대안으로 시장에 등장하였다. 이러한 멀티프로세서 클러스터(CLUMP)는 하나의 시스템에 다중 메모리 구조를 가지는데, CLUMP가 가지는 다중 메모리 구조를 효과적으로 사용하기 위해서 본 논문에서는 중첩된 병렬화 프로그램 모델을 제안하였다. 중첩된 병렬화 모델은 중첩된 루프 레벨의 병렬화, 중첩된 태스크 레벨의 병렬화, 그리고 다중 중첩된 병렬화로 나뉜다. 본 논문에서는 중첩된 루프 레벨의 병렬화를 실험대상으로 하여 그 성능을 평가하고 단일 메모리 구조의 병렬화 프로그램과 성능을 비교하였다. 실험 결과 시험한 중첩된 병렬화 모델이 단일 메모리 구조의 병렬화 프로그램에 비하여 좋은 성능을 나타내었지만, 실험대상이 된 루프 레벨 병렬화의 잠재적인 특징으로 인해 실행에 참여하는 노드 수가 많아질수록 성능 향상 폭이 감소하는 결과를 보였다. 프로그램의 성능 향상 폭과 확장성은 문제 크기가 클수록 좋은 특성을 보였다.
PDF

A Communication and Computation Overlapping Model through Loop Sub-partitioning and Dynamic Scheduling in Data Parallel Programs (데이타 병렬 프로그램에서 루프 세부 분할 및 동적 스케쥴링을 통한 통신과 계산의 중첩 모델)

Kim, Jung-Hwan;Han, Sang-Yong;Cho, Seung-Ho;Kim, Heung-Hwan
- Journal of KIISE:Computer Systems and Theory
- /
- v.27 no.1
- /
- pp.23-33
- /
- 2000
We propose a model which overlaps communication with computation for efficient communication in the data-parallel programming paradigm. The overlapping model divides a given loop partition into several sub-partitions to obtain computation which can be overlapped with communication. A loop partition sometimes refers to other data partitions, but not all iterations in the loop partition require non-local data. So, a loop partition may be divided into a set of loop iterations which require non-local data, and a set of loop iterations which do not. Each loop sub-partition is dynamically scheduled depending on associated message arrival, The experimental results for a few benchmarks in IBM SP2 show enhanced performance in our overlapping model.
PDF

불변 및 가변 종속거리를 위한 최적 병렬알고리즘

송월봉
- Proceedings of the Korea Society for Industrial Systems Conference
- /
- 1997.11a
- /
- pp.353-363
- /
- 1997
중첩 루프의 전체적인 병렬화를 하기 위해서 자료 종속을 효과적으로 제거하는 알고리즘이다. 즉 순차 루프를 중첩된 DOALL루프로의 자동 변환에 대한 절차이다.
PDF

(A Design and Implementation of Parallelizing Compiler in Loop Structure) (루프구조의 병렬화 컴파일러 설계 및 구현)

송월봉
- Journal of the Korea Computer Industry Society
- /
- v.3 no.8
- /
- pp.981-988
- /
- 2002
In this paper, a simple parallel compiler of a sequential loop is presented. This is a procedure for the automatic conversion of a sequential loop into a nested parallel DOALL loops at compile time. For this. the source program of Parafrase II parallel compiler is analyzed and a new general method the extracting parallelism in order to parallel processing effectively in nested loop is implemented.
PDF

Extracting Maximum Parallelism for Parallel Computing (병렬 계산을 위한 최대 병렬성 추출 방법)

Park, Doo-Soon
- The Journal of Korean Association of Computer Education
- /
- v.8 no.1
- /
- pp.93-103
- /
- 2005
Since the most program execution time is consumed in a loop structure, extracting parallelism from sequential loop programs is critical for the faster program execution. Conventional studies for extracting the parallelism are focused mostly on a uniform data dependence distance. In this paper, we proposed data dependency elimination method for a nested loop and extended data dependency elimination method to extract parallelism from the loop with procedure calls. The data dependency elimination method and the extended data dependency elimination method can be applied to uniform and non-uniform data dependency distance. We compared our method with conventional methods using CRAY-T3E for the performance evaluation. The results show that the proposed algorithms are very effective.
PDF

A Implementation of Loop Interchange Parallel Compiler (루프인터체인지 병렬컴파일러 구현)

Song, Worl-Bong
- Journal of the Korea Computer Industry Society
- /
- v.8 no.3
- /
- pp.167-172
- /
- 2007
Generally, In a application program the core part for parallel processing is a loop. therefore in this paper, loop interchange parallel compiler is proposed. this is a procedure for the automatic conversion of a loop interchange. According to execution to the outside CDOALL statements of cedar fortran, loop interchange is more effectively method the extracting parallelism in order to parallel processing in iterations. This method will be expected to effectively execution result with mixed into linear conversion and go far toward solving the effectively implementation of the non-unimodular nested loop.
PDF

A Synchronization Method for Parallelizing Nested Do Loop (중첩 루프의 병렬화를 위한 동기화 기법)

Park, Hyun-Ho;Kim, Yong-Man;Bae, Eun-Ho;Youn, Sung-Dae
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.04a
- /
- pp.239-242
- /
- 2001
일반적인 응용 프로그램에서 병렬성이 많은 구조는 루프 구조이며, 루프를 병렬로 처리하기 위해 동기화가 필요하다. 본 논문에서는 다중첨자를 갖는 1차원 배열의 루프의 병렬화를 위해 다수 개의 동일한 종속값을 이용하여 종속함수를 생성하고 이를 이용하여 종속관계가 성립하지 않는 비종속 구간(Non-dependence part)을 구한다. 그리고 동일한 값을 가지는 복수개의 종속값 간의 동기화는 외부루프 분할 기법을 이용하여 간소화 한 후 단일 첨자를 갖는 루프에 동기화를 수행하는 기법을 제시한다.
PDF

Search Result 34, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)