Search | Korea Science

Comparison of Parallel Computation Performances for 3D Wave Propagation Modeling using a Xeon Phi x200 Processor (제온 파이 x200 프로세서를 이용한 3차원 음향 파동 전파 모델링 병렬 연산 성능 비교)

Lee, Jongwoo;Ha, Wansoo
- Geophysics and Geophysical Exploration
- /
- v.21 no.4
- /
- pp.213-219
- /
- 2018
In this study, we simulated 3D wave propagation modeling using a Xeon Phi x200 processor and compared the parallel computation performance with that using a Xeon CPU. Unlike the 1st generation Xeon Phi coprocessor codenamed Knights Corner, the 2nd generation x200 Xeon Phi processor requires no additional communication between the internal memory and the main memory since it can run an operating system directly. The Xeon Phi x200 processor can run large-scale computation independently, with the large main memory and the high-bandwidth memory. For comparison of parallel computation, we performed the modeling using the MPI (Message Passing Interface) and OpenMP (Open Multi-Processing) libraries. Numerical examples using the SEG/EAGE salt model demonstrated that we can achieve 2.69 to 3.24 times faster modeling performance using the Xeon Phi with a large number of computational cores and high-bandwidth memory compared to that using the 12-core CPU.
https://doi.org/10.7582/GGE.2018.21.4.213 인용 PDF KSCI HTML

Parallelization of Poisson equation solver on Intel Xeon Phi environment (인텔 제온 파이를 활용한 푸아송 방정식 풀이의 병렬화)

Cho, Kyu Nam;Seo, Jae Min;Kim, Do-Hyeong;Ryu, Hoon;Jeong, Chang-sung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2015.10a
- /
- pp.178-180
- /
- 2015
코프로세서(Co-processor)를 사용한 병렬 처리 시스템은 멀티코어 프로그래밍과 함께 과학기술계산 분야 프로그램 개발에 많이 사용되고 있다. 본 연구에서는 CPU 기반의 코프로세서인 인텔 제온 파이 환경에서의 푸아송 방정식 해법을 병렬화 하였다. 본 연구를 통해서 인텔 제온 파이 활용 가능성을 확인 하고, 일반적인 병렬화 기법이 인텔 제온 파이 환경에서도 적합한지를 확인하였다.
https://doi.org/10.3745/PKIPS.y2015m10a.178 인용 PDF

A Study of Distribute Computing Performance Using a Convergence of Xeon-Phi Processor and Quantum ESPRESSO (퀀텀 에스프레소와 제온 파이 프로세서의 융합을 이용한 분산컴퓨팅 성능에 대한 연구)

Park, Young-Soo;Park, Koo-Rack;Kim, Dong-Hyun
- Journal of the Korea Convergence Society
- /
- v.7 no.5
- /
- pp.15-21
- /
- 2016
Recently the degree of integration of processor and developed rapidly. However, clock speed is not increased, a situation that increases the number of cores in the processor. In this paper, we analyze the performance of a typical Intel Xeon Phi of many core process used for the current operation accelerate. Utilizing the Quantum ESPRESSO, which was calculated using the FFTW library. By varying the number of ranks in MPI when running the benchmarks the performance Xeon Phi. The result shows a good performance in the handling of four job on one physical core. However, four or more to expand the number of MPI Rank is degraded. Through this convergence it was found to improve the performance of Quantum ESPRESSO. It is possible to check the hardware characteristics of the Xeon Phi.
https://doi.org/10.15207/JKCS.2016.7.5.015 인용 PDF KSCI

Parallelizing 3D Frequency-domain Acoustic Wave Propagation Modeling using a Xeon Phi Coprocessor (제온 파이 보조 프로세서를 이용한 3차원 주파수 영역 음향파 파동 전파 모델링 병렬화)

Ryu, Donghyun;Jo, Sang Hoon;Ha, Wansoo
- Geophysics and Geophysical Exploration
- /
- v.20 no.3
- /
- pp.129-136
- /
- 2017
3D seismic data processing methods such as full waveform inversion or reverse-time migration require 3D wave propagation modeling and heavy calculations. We compared efficiency and accuracy of a Xeon Phi coprocessor to those of a high-end server CPU using 3D frequency-domain wave propagation modeling. We adopted the OpenMP parallel programming to the time-domain finite difference algorithm by considering the characteristics of the Xeon Phi coprocessors. We applied the Fourier transform using a running-integration to obtain the frequency-domain wavefield. A numerical test on frequency-domain wavefield modeling was performed using the 3D SEG/EAGE salt velocity model. Consequently, we could obtain an accurate frequency-domain wavefield and attain a 1.44x speedup using the Xeon Phi coprocessor compared to the CPU.
https://doi.org/10.7582/GGE.2017.20.3.129 인용 PDF KSCI

폐타이어 유효자원에의 길

Korea Tire Manufacturers Association
- The tire
- /
- s.52
- /
- pp.6-8
- /
- 1974
한때는 폐기물 즉 쓰레기로 취급되었든 타이어를 유효자원으로 재평가하여 자원화를 도모코저 하는 기술개발이 진척되고 있다. 일본 신소제강소에선 이미 실용 프란트화에 성공하고 또 일본 제온에서도 모델 프란트를 착수하고 있다. 이렇게 되면 소각한다든가, 15㎝ 이하로 파쇄해서 매립한다는 것은 자원을 낭비한다고 하지 않을 수 없다. 그래서 금번은 이 쓰레기로부터 자원으로 전환하는 폐타이어 대책의 동향을 추구해보았다.
PDF

Benchmarking the Intel Xeon Phi Coprocessor with Intel MKL library (인텔 MKL 라이브러리를 이용한 Xeon Phi Coprocessor 벤치마크)

Park, Young-Soo;Park, Koo-Rack;Kim, Jin-Mook
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2014.07a
- /
- pp.1-4
- /
- 2014
인텔 Many Integrated Core (MIC) 아키텍쳐는 61개의 코어가 하나의 칩에 결합되어 있다. Xeon Phi 로 명명된 인텔 MIC는 인텔 E5 Xeon CPU 보다 2배의 single precision GFLOPs 성능을 제공한다. 인텔 MIC 는 수치연산에 최적화 되어 있는 아키텍쳐이다. 우리는 Xeon Phi 7120P를 가지고 벤치마킹을 하였고 클락스피드 1.238GHz, 61Core 이고 한 개의 코어당 4쓰레드를 사용하며 이론상 최고 성능은 Peak Double Precision(GFLOP)는 약 2-TFlops 이다. 이에 우리는 인텔 X86 아키텍쳐에서 openMP 와 인텔 MKL(Math kernel library) 라이브러리를 이용한 병렬프로그램을 작성하여 쓰레드 수를 증가 시키면서 인텔 Xeon Phi 와 E5 Xeon CPU에서 single precision 성능을 벤치마킹 하여, Xeon Phi 와 Xeon E5 의 이론적인 성능을 비교해 보고자 한다. 또한 openMP와 인텔 MKL라이브러리를 사용한 병렬환경에서 CPU의 성능 지표인 클락스피드와 코어수 외에 Vector unit size 의 크기가 성능에 어떤 영향을 미치는지 살펴보았다.
PDF

Distributed Stream Processing System with apache Hadoop for PTAM on Xeon Phi Cluster (PTAM을 위한 제온파이 기반 하둡 분산 스트림 프로세싱 시스템)

Seo, Jae Min;Cho, Kyu Nam;Kim, Do Hyung;Jeong, Chang-Sung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2015.10a
- /
- pp.184-186
- /
- 2015
본 논문에서는 PTAM을 위한 새로운 분산 스트림 프로세싱 시스템을 제안한다. PTAM은 하나의 시스템에서 동작하도록 설계되었다. 이는 PTAM이 가지고 있는 한계점을 말해주는 부분인데, PTAM은 Bundle Adjustment의 계산 부하가 커지는 경우에 map을 구축하는데 있어 많은 시간과 리소스가 필요하다. 이에 하둡을 통해 계산 부하를 분산하고, PE(Processing Element)를 Xeon phi 시스템을 통해 동작되는 시스템을 제안한다.
https://doi.org/10.3745/PKIPS.y2015m10a.184 인용 PDF

Cytogenetic Analysis of the Triploid Pacific Abalone, Haliotis discus hannai (북방전복, Haliotis discus hannai 3배체의 세포유전학적 연구)

Jee, Young-Ju;Chang, Young-Jin
- The Korean Journal of Malacology
- /
- v.28 no.1
- /
- pp.37-43
- /
- 2012
In this study, we invesgated a cytogenetic analysis of the Pacific triploid abalone, Haliotis discus hannai induced by low temperature treatment. We got a lot of mitotic metaphase chromosome spreads from the triploid and diploid Pacific abalones' hatched larvae (trochophores). The chromosome number of diploid abalone was 2n = 36 and that of triploid abalone was 3n = 54, so the chromosome number of triploid abalone was 1.5 times higher than that of diploid abalone. We developed a modified flow cytometric method for Pacific abalone from the existing methods. We uesd 51 months aged triploid and diploid Pacific abalones' hemolymph to get the DNA contents by flow cytometry. The DNA content of diploid abalone was 1.743 pg/cell and the DNA content of triploid abalone was 1.49 times higher than that of diploid one. It proved that triploid abalone was consisted with two sets of maternal diploid and one set of paternal genome.
https://doi.org/10.9710/kjm.2012.28.1.037 인용 PDF KSCI

Automatic gasometer reading system using selective optical character recognition (관심 문자열 인식 기술을 이용한 가스계량기 자동 검침 시스템)

Lee, Kyohyuk;Kim, Taeyeon;Kim, Wooju
- Journal of Intelligence and Information Systems
- /
- v.26 no.2
- /
- pp.1-25
- /
- 2020
In this paper, we suggest an application system architecture which provides accurate, fast and efficient automatic gasometer reading function. The system captures gasometer image using mobile device camera, transmits the image to a cloud server on top of private LTE network, and analyzes the image to extract character information of device ID and gas usage amount by selective optical character recognition based on deep learning technology. In general, there are many types of character in an image and optical character recognition technology extracts all character information in an image. But some applications need to ignore non-of-interest types of character and only have to focus on some specific types of characters. For an example of the application, automatic gasometer reading system only need to extract device ID and gas usage amount character information from gasometer images to send bill to users. Non-of-interest character strings, such as device type, manufacturer, manufacturing date, specification and etc., are not valuable information to the application. Thus, the application have to analyze point of interest region and specific types of characters to extract valuable information only. We adopted CNN (Convolutional Neural Network) based object detection and CRNN (Convolutional Recurrent Neural Network) technology for selective optical character recognition which only analyze point of interest region for selective character information extraction. We build up 3 neural networks for the application system. The first is a convolutional neural network which detects point of interest region of gas usage amount and device ID information character strings, the second is another convolutional neural network which transforms spatial information of point of interest region to spatial sequential feature vectors, and the third is bi-directional long short term memory network which converts spatial sequential information to character strings using time-series analysis mapping from feature vectors to character strings. In this research, point of interest character strings are device ID and gas usage amount. Device ID consists of 12 arabic character strings and gas usage amount consists of 4 ~ 5 arabic character strings. All system components are implemented in Amazon Web Service Cloud with Intel Zeon E5-2686 v4 CPU and NVidia TESLA V100 GPU. The system architecture adopts master-lave processing structure for efficient and fast parallel processing coping with about 700,000 requests per day. Mobile device captures gasometer image and transmits to master process in AWS cloud. Master process runs on Intel Zeon CPU and pushes reading request from mobile device to an input queue with FIFO (First In First Out) structure. Slave process consists of 3 types of deep neural networks which conduct character recognition process and runs on NVidia GPU module. Slave process is always polling the input queue to get recognition request. If there are some requests from master process in the input queue, slave process converts the image in the input queue to device ID character string, gas usage amount character string and position information of the strings, returns the information to output queue, and switch to idle mode to poll the input queue. Master process gets final information form the output queue and delivers the information to the mobile device. We used total 27,120 gasometer images for training, validation and testing of 3 types of deep neural network. 22,985 images were used for training and validation, 4,135 images were used for testing. We randomly splitted 22,985 images with 8:2 ratio for training and validation respectively for each training epoch. 4,135 test image were categorized into 5 types (Normal, noise, reflex, scale and slant). Normal data is clean image data, noise means image with noise signal, relfex means image with light reflection in gasometer region, scale means images with small object size due to long-distance capturing and slant means images which is not horizontally flat. Final character string recognition accuracies for device ID and gas usage amount of normal data are 0.960 and 0.864 respectively.
https://doi.org/10.13088/jiis.2020.26.2.001 인용 PDF KSCI

Search Result 9, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)