• Title/Summary/Keyword: unified memory

Search Result 52, Processing Time 0.037 seconds

Parallel Approximate String Matching with k-Mismatches for Multiple Fixed-Length Patterns in DNA Sequences on Graphics Processing Units (GPU을 이용한 다중 고정 길이 패턴을 갖는 DNA 시퀀스에 대한 k-Mismatches에 의한 근사적 병열 스트링 매칭)

  • Ho, ThienLuan;Kim, HyunJin;Oh, SeungRohk
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.6
    • /
    • pp.955-961
    • /
    • 2017
  • In this paper, we propose a parallel approximate string matching algorithm with k-mismatches for multiple fixed-length patterns (PMASM) in DNA sequences. PMASM is developed from parallel single pattern approximate string matching algorithms to effectively calculate the Hamming distances for multiple patterns with a fixed-length. In the preprocessing phase of PMASM, all target patterns are binary encoded and stored into a look-up memory. With each input character from the input string, the Hamming distances between a substring and all patterns can be updated at the same time based on the binary encoding information in the look-up memory. Moreover, PMASM adopts graphics processing units (GPUs) to process the data computations in parallel. This paper presents three kinds of PMASM implementation methods in GPUs: thread PMASM, block-thread PMASM, and shared-mem PMASM methods. The shared-mem PMASM method gives an example to effectively make use of the GPU parallel capacity. Moreover, it also exploits special features of the CUDA (Compute Unified Device Architecture) memory structure to optimize the performance. In the experiments with DNA sequences, the proposed PMASM on GPU is 385, 77, and 64 times faster than the traditional naive algorithm, the shift-add algorithm and the single thread PMASM implementation on CPU. With the same NVIDIA GPU model, the performance of the proposed approach is enhanced up to 44% and 21%, compared with the naive, and the shift-add algorithms.

FlexDesigner:Object-Oriented Non-manifold Modeling Kernel with Hierarchically Modularized Structure (FlexDesigner:계층적으로 모듈화된 주초의 객체 지향 방식 비다양체 모델링 커널)

  • 이강수;이건우
    • Korean Journal of Computational Design and Engineering
    • /
    • v.2 no.4
    • /
    • pp.222-236
    • /
    • 1997
  • Conventional solid or surface modeling systems cannot represent both the complete solid model and the abstract model in a unified framework. Recently, non-manifold modeling systems are proposed to solve this problem. This paper describes FlexDesigner, an open kernel system for modeling non-manifold models. It summarizes the data structure for non-manifold models, system design methodology, system modularization, and the typical characteristics of each module in the system. A data structure based on partial-topological elements is adopted to represent the relationship among topological elements. It is efficient in the usage of memory and has topological completeness compared with other published data structures. It can handle many non-manifold situations such as isolate vertices, dangling edges, dangling faces, a mixed dimensional model, and a cellular model. FlexDesigner is modularized hierarchically and designed by the object-oriented methodology for reusability. FlexDesigner is developed using the C++ and OpenGL on both SGI workstation and IBM PC.

  • PDF

An ANALYTICTRANSFORM KERNEL DERIVATION METHOD FOR VERSATILE VIDEO CODING (VVC) (VVC 비디오 코덱을 위한 변환 커널 유도 방법)

  • Shrestha, Sandeep;lee, Bumshik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2019.11a
    • /
    • pp.246-248
    • /
    • 2019
  • In the ongoing standardization of Versatile Video Coding (VVC), DCT-2, DST-7 and DCT-8 are accounted as the vital transform kernels. While storing all of those transform kernels, ROM memory storage is considered as the major problem. So, to deal with this scenario, a common sparse unified matrix concept is introduced in this paper. From the proposed matrix, any point transform kernels (DCT-2, DST-7, DCT-8, DST-4 and DCT-4) can be achieved after some mathematical computation. DCT-2, DST-7 and DCT-8 are the used major transform kernel in this paper.

  • PDF

A Study on ZigBee Application Model Development using UML (UML을 이용한 지그비 어플리케이션모델개발에 관한 연구)

  • Jung, Seung-Mo;Yoo, Joo-Hyoung;Lim, Dong-Jin
    • Proceedings of the KIEE Conference
    • /
    • 2009.07a
    • /
    • pp.1814_1816
    • /
    • 2009
  • ZigBee is a technology that is being rapidly developed since its power consumption is low and the stability of its communication is high. However, documented data which is coded using conventional programming languages such as C or assembly programming language would not be able to fulfill the various requirements upon application development by ZigBee. Unified Modelling Languge (UML) could be one of the alternatives to solve this problem. UML provides a variety of diagrams by which the results of the software development can be presented visually and by which the developers can communicate more spontaneously. This paper shows the results of an ongoing study into the application of model-driven methods for ZigBee Application. Also, this paper shows that this approach is feasible by comparing memory usage, latency, and power consumption of UML modelling code with those of handwritten code.

  • PDF

RECENT ADVANCES IN DOMAIN DECOMPOSITION METHODS FOR TOTAL VARIATION MINIMIZATION

  • LEE, CHANG-OCK;PARK, JONGHO
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.24 no.2
    • /
    • pp.161-197
    • /
    • 2020
  • Total variation minimization is standard in mathematical imaging and there have been numerous researches over the last decades. In order to process large-scale images in real-time, it is essential to design parallel algorithms that utilize distributed memory computers efficiently. The aim of this paper is to illustrate recent advances of domain decomposition methods for total variation minimization as parallel algorithms. Domain decomposition methods are suitable for parallel computation since they solve a large-scale problem by dividing it into smaller problems and treating them in parallel, and they already have been widely used in structural mechanics. Differently from problems arising in structural mechanics, energy functionals of total variation minimization problems are in general nonlinear, nonsmooth, and nonseparable. Hence, designing efficient domain decomposition methods for total variation minimization is a quite challenging issue. We describe various existing approaches on domain decomposition methods for total variation minimization in a unified view. We address how the direction of research on the subject has changed over the past few years, and suggest several interesting topics for further research.

A Constitutive Model for the Rate-dependent Deformation Behavior of a Solid Polymer (속도 의존적인 폴리머 거동에 대한 구성적 모델)

  • Ho, K.
    • Transactions of Materials Processing
    • /
    • v.22 no.4
    • /
    • pp.216-222
    • /
    • 2013
  • Solid polymers exhibit rate-dependent deformation behavior such as nonlinear strain rate sensitivity and stress relaxation like metallic materials. Despite the different microstructures of polymeric and metallic materials, they have common properties with respect to inelastic deformation. Unlike most metallic materials, solid polymers and shape memory alloys (SMAs) exhibit highly nonlinear stress-strain behavior upon unloading. The present work employs the viscoplasticity theory [K. Ho, 2011, Trans. Mater. Process. 20, 350-356] developed for the pseudoelastic behavior of SMAs, which is based on unified state variable theory for the rate-dependent inelastic deformation behavior of typical metallic materials, to depict the curved unloading behavior of polyphenylene oxide (PPO). The constitutive equations are characterized by the evolution laws of two state variables that are related to the elastic modulus and the back stress. The simulation results are compared with the experimental data obtained by Krempl and Khan [2003, Int. J. Plasticity 19, 1069-1095].

Computationally Efficient Implementation of a Hamming Code Decoder Using Graphics Processing Unit

  • Islam, Md Shohidul;Kim, Cheol-Hong;Kim, Jong-Myon
    • Journal of Communications and Networks
    • /
    • v.17 no.2
    • /
    • pp.198-202
    • /
    • 2015
  • This paper presents a computationally efficient implementation of a Hamming code decoder on a graphics processing unit (GPU) to support real-time software-defined radio, which is a software alternative for realizing wireless communication. The Hamming code algorithm is challenging to parallelize effectively on a GPU because it works on sparsely located data items with several conditional statements, leading to non-coalesced, long latency, global memory access, and huge thread divergence. To address these issues, we propose an optimized implementation of the Hamming code on the GPU to exploit the higher parallelism inherent in the algorithm. Experimental results using a compute unified device architecture (CUDA)-enabled NVIDIA GeForce GTX 560, including 335 cores, revealed that the proposed approach achieved a 99x speedup versus the equivalent CPU-based implementation.

A Study on a Declines in Performance by Memory Copy in CUDA (CUDA의 메모리 복사로 인한 성능 저하 연구)

  • Kang, Jihun;Lee, DaeWon;Kang, InSung;Yu, HeonChang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.11a
    • /
    • pp.135-138
    • /
    • 2013
  • GPGPU(General Purpose Graphics Processing Unit) 병렬처리 시스템인 CUDA(Compute Unified Device Architecture)는 컴퓨터에서의 고속 연산 처리를 위해 많이 사용되어왔다. CUDA에서 연산 처리를 하기 위해서는 CUDA의 특성을 이해해야 한다. CUDA는 CPU(Central Processing Unit)가 처리하는 Host 영역과 GPU(Graphics Processing Unit)가 처리하는 영역인 Device 영역이 존재하며, 이 두 영역간의 데이터 복사를 통해 연산 처리를 진행한다. 이런 구조적인 특성상 메인 메모리에서 GPU 메모리로 입력 데이터를 전달해야 GPU를 이용해 연산을 처리할 수 있는 구조를 가지고 있다. 하지만 이러한 처리 구조로 인해 연산 시간과 별도로 메인 메모리와 GPU 메모리간의 데이터 복사시간이 존재하며, 추가적으로 발생하는 메모리 복사 시간으로 인해 오버헤드가 발생하게 된다. 본 논문에서는 실험을 통해 메모리 복사 시간, 연산의 반복 횟수 그리고 연산의 복잡성이 전체 성능에 어떤 영향을 미치는지 논하고자 한다.

Poly-Si MFM (Multi-Functional-Memory) with Channel Recessed Structure

  • Park, Jin-Gwon;Jo, Won-Ju
    • Proceedings of the Korean Vacuum Society Conference
    • /
    • 2012.02a
    • /
    • pp.156-157
    • /
    • 2012
  • 단일 셀에서 비휘발성 및 고속의 휘발성 메모리를 모두 구동할 수 있는 다기능 메모리는 모바일 기기 및 embedded 장치의 폭발적인 성장에 있어 그 중요성이 커지고 있다. 따라서 최근 이러한 fusion기술을 응용한 unified RAM (URAM)과 같은 다기능 메모리의 연구가 주목 받고 있다. 이러한 다목적 메모리는 주로 silicon on insulator (SOI)기반의 1T-DRAM과 SONOS기술 기반의 비휘발성 메모리의 조합으로 이루어진다. 하지만 이런 다기능 메모리는 주로 단결정기반의 SOI wafer 위에서 구현되기 때문에 값이 비싸고 사용범위도 제한되어 있다. 따라서 이러한 다기능메모리를 다결정 실리콘을 이용하여 제작한다면 기판에 자유롭게 메모리 적용이 가능하고 추후 3차원 적층형 소자의 구현도 가능하기 때문에 다결정실리콘 기반의 메모리 구현은 필수적이라고 할 수 있겠다. 본 연구에서는 다결정실리콘을 이용한 channel recessed구조의 다기능메모리를 제작하였으며 각 1T-DRAM 및 NVM동작에 따른 memory 특성을 살펴보았다. 실험에 사용된 기판은 상부 비정질실리콘 100 nm, 매몰산화층 200 nm의 SOI구조의 기판을 이용하였으며 고상결정화 방법을 이용하여 $600^{\circ}C$ 24시간 열처리를 통해 결정화 시켰다. N+ poly Si을 이용하여 source/drain을 제작하였으며 RIE시스템을 이용하여 recessed channel을 형성하였다. 상부 ONO게이트 절연막은 rf sputter를 이용하여 각각 5/10/5 nm 증착하였다. $950^{\circ}C$ N2/O2 분위기에서 30초간 급속열처리를 진행하여 source/drain을 활성화 하였다. 계면상태 개선을 위해 $450^{\circ}C$ 2% H2/N2 분위기에서 30분간 열처리를 진행하였다. 제작된 Poly Si MFM에서 2.3V, 350mV/dec의 문턱전압과 subthreshold swing을 확인할 수 있었다. Nonvolatile memory mode는 FN tunneling, high-speed 1T-DRAM mode에서는 impact ionization을 이용하여 쓰기/소거 작업을 실시하였다. NVM 모드의 경우 약 2V의 memory window를 확보할 수 있었으며 $85^{\circ}C$에서의 retention 측정시에도 10년 후 약 0.9V의 memory window를 확보할 수 있었다. 1T-DRAM 모드의 경우에는 약 $30{\mu}s$의 retention과 $5{\mu}A$의 sensing margin을 확보할 수 있었다. 차후 engineered tunnel barrier기술이나 엑시머레이저를 이용한 결정화 방법을 적용한다면 device의 특성향상을 기대할 수 있을 것이다. 본 논문에서는 다결정실리콘을 이용한 다기능메모리를 제작 및 메모리 특성을 평가하였다. 제작된 소자의 단일 셀 내에서 NVM동작과 1T-DRAM동작이 모두 가능한 것을 확인할 수 있었다. 다결정실리콘의 특성상 단결정 SOI기반의 다기능 메모리에 비해 낮은 특성을 보여주었으나 이는 결정화방법, high-k절연막 적용 및 engineered tunnel barrier를 적용함으로써 해결 가능하다고 생각된다. 또한 sputter를 이용하여 저온증착된 O/N/O layer에서의 P/E특성을 확인함으로써 glass위에서의 MFM구현의 가능성도 확인할 수 있었으며, 차후 system on panel (SOP)적용도 가능할 것이라고 생각된다.

  • PDF

A Case Report of Non-Motor Symptoms Evaluated Using the Non-Motor Symptom Scale in a Patient with Secondary Parkinsonism Presumed to be Probable Lewy Body Dementia and Improved with Combined Treatment with Herbal Medicine and Acupuncture (루이소체 치매로 추정되는 이차성 파킨슨증 환자의 Non-Motor Symptom Scale(NMSS)로 평가한 비운동성 증상을 한약과 침의 복합치료로 호전시킨 증례보고 1례)

  • Roh, Min-yeong;Lee, Ji-hyun;Han, Yang-hee;Leem, Jung-tae
    • The Journal of Internal Korean Medicine
    • /
    • v.42 no.5
    • /
    • pp.833-845
    • /
    • 2021
  • Parkinson's syndrome is a degenerative brain disease that presents characteristic motor symptoms of tremor, rigidity, and gait disturbance. In addition to these motor symptoms, Parkinson's syndrome also presents non-motor symptoms (NMSs) such as sleep disturbance and cognitive decline. NMSs reduce patient's quality of life and psychosocial functioning and cause economic burden on the patient, so appropriate evaluation and treatment are required. Lewy body dementia is one of the several diseases belonging to Parkinson's syndrome. Its symptoms such as cognitive function, memory impairment, and hallucinations occur with Parkinsonism. Although drug therapy is being used with drug treatment to treat non-motor symptoms, it has limitations such as side effects, which stimulated interest in other complementary treatment methods such as oriental medicine treatment, dance, and yoga. The patient in this case complained of tremor in the right upper extremity, muscle hypertension and pain, and persistent vision, memory, and cognitive decline. The patient was diagnosed with probable Lewy body dementia. The patient was hospitalized for 4 months and received acupuncture and herbal medicines. After treatment, the patient's NMS scale scores decreased from 90 to 63, and the Unified Parkinson's Disease Rating Scale scores (summed I, II, and III) decreased from 17 points to 8 points. The Beck Depression Inventory score decreased from 22 points to 13 points. In addition, the patient's subjective evaluation revealed improvement. In this case, a patient diagnosed with probable Lewy body dementia who did not respond to the standard treatment and did not want to take medications showed improvement in not only motor symptoms but also NMSs after integrative Korean medicine treatment.