• Title/Summary/Keyword: Memory reduction

Search Result 469, Processing Time 0.026 seconds

Parallel Implementations of Digital Focus Indices Based on Minimax Search Using Multi-Core Processors

  • HyungTae, Kim;Duk-Yeon, Lee;Dongwoon, Choi;Jaehyeon, Kang;Dong-Wook, Lee
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.2
    • /
    • pp.542-558
    • /
    • 2023
  • A digital focus index (DFI) is a value used to determine image focus in scientific apparatus and smart devices. Automatic focus (AF) is an iterative and time-consuming procedure; however, its processing time can be reduced using a general processing unit (GPU) and a multi-core processor (MCP). In this study, parallel architectures of a minimax search algorithm (MSA) are applied to two DFIs: range algorithm (RA) and image contrast (CT). The DFIs are based on a histogram; however, the parallel computation of the histogram is conventionally inefficient because of the bank conflict in shared memory. The parallel architectures of RA and CT are constructed using parallel reduction for MSA, which is performed through parallel relative rating of the image pixel pairs and halved the rating in every step. The array size is then decreased to one, and the minimax is determined at the final reduction. Kernels for the architectures are constructed using open source software to make it relatively platform independent. The kernels are tested in a hexa-core PC and an embedded device using Lenna images of various sizes based on the resolutions of industrial cameras. The performance of the kernels for the DFIs was investigated in terms of processing speed and computational acceleration; the maximum acceleration was 32.6× in the best case and the MCP exhibited a higher performance.

Reproductive Toxicity of DA-125, A New Anthracycline Anticancer Agent: Peri- and Postnatal Study in Rats (새로운 안트라사이클린계 항암제 DA-125의 생식독성연구: 랫트 주산기 및 수유기시험)

  • 정문구;이순복;한상섭;노정구
    • Biomolecules & Therapeutics
    • /
    • v.3 no.1
    • /
    • pp.38-46
    • /
    • 1995
  • DA-125, a new anthracycline antitumor antibiotic, was administered at dose levels of 0, 0.04, 0.2 and 1.0 mg/kg/day intravenously to pregnant and subsequently delivered Sprague-Dawley rats from day 17 of gestation to day 21 of lactation. Effects of test agent on general toxicity of dams and growth, behaviour and mating performance of F1 offspring were examined. At 1 mg/kg, one out of the twentytwo dams showed difficult delivery, characterized by a stillbirth. Reduction in body weight, loss in food intake, and decrease in spleen weight were also observed in dams. In addition, the lower rates of successful performances in memory test (28.6%) and necrosis of tail end (9.5%) were seen in F1 offspring. At 0.04 and 0.2 mg/kg, no toxic effect on dams and F1 offspring was observed. There were no malformed Fl and F2 fetuses in all groups. The results indicate that the no effect dose levels(NOELs) of DA-125 are 0.2 mg/kg/day for dams and Fl offspring, and over 1 mg/kg/day for F2 fetuses.

  • PDF

Turbine Alignment (II): Computer Program Development (발전설비의 터빈 축정렬 (II) : 자동화를 위한 전산 프로그램 개발)

  • Hwang, Cheol-Ho;Kim, Jeong-Tae;Jun, Oh-Sung;Lee, Hyun;Lee, Byung-Jun
    • Journal of KSNVE
    • /
    • v.4 no.1
    • /
    • pp.33-42
    • /
    • 1994
  • When a vibration is generated due to the misalignment, the reduction of the vibration level is not attainable unless a correct shaft alignment is conducted. In a turbine system, an alignment procedure requires quite a lot amount of expense and time. To reduce this effort, an algorithm of the turbine alignment is developed to be used in the computer program. The program consists of five parts : input, calculation, display of the results, file management, and printer output. In the input part, users must provide the data on the turbine number, the reference value of the alignment, and the number of the feet of the generator. In calculation, the moving distance of the bearing and the necessary amount of the shims are calculated. In the display and the output parts, the calculated results are displayed and calculated. In the display and the output parts, the calculated results are displayed and printed. Then, by using the file management, results and procedures conducted are saved in the floppy diskette or in the hard disk. The developed program can be run in IBM PC compatible with more than 640 KB of main memory with the operating system of MS-DOS v 3.3 or higher. It is developed for novice users with no experience or specialty in this field. The program is not only useful in the power plant application, but also helpful for recording of the alignment procedures.

  • PDF

An Efficient Hybrid Diagnosis Algorithm for Sequential Circuits (순차 회로를 위한 효율적인 혼합 고장 진단 알고리듬)

  • 김지혜;이주환;강성호
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.41 no.5
    • /
    • pp.51-60
    • /
    • 2004
  • Due to the improvements in circuit design and manufacturing technique, the complexity of a circuit is growing. Since the complexity of a circuit causes high frequency of faults, it is very important to locate faults for improvement of yield and reduction of production cost. But unfortunately it takes a long time to find sites of defects by e-beam proving if the physical level. A fault diagnosis algorithm in the Sate level has meaning to reduce diagnosis time by limiting fault sites. In this paper, we propose an efficient fault diagnosis algorithm in the logical level. Our method is hybrid fault diagnosis algorithm using a new fault dictionary and additional fault simulation which minimizes memory consumption and simulation time.

Low-Power Systolic Array Viterbi Decoder Implementation With A Clock-gating Method (Clock-gating 방법을 사용한 저전력 시스톨릭 어레이 비터비 복호기 구현)

  • Ryu Je-Hyuk;Cho Jun-Dong
    • The KIPS Transactions:PartA
    • /
    • v.12A no.1 s.91
    • /
    • pp.1-6
    • /
    • 2005
  • This paper presents a new algorithm on low power survivor path memory implementation of the trace-back systolic array Viterbi algorithm. A novel idea is to reuse the already-generated trace-back routes to reduce the number of trace-back operations. And the spurious switching activity of the trace-back unit is reduced by making use of a clock gating method. Using the SYNOPSYS power estimation tool, DesignPower, our experimental result shows the average $40{\%}$ power reduction and $23{\%}$ area increase against the trace-back unit introduced in [1].

Data Input/Output Time Reduction Scheme with the Simultaneous Transmission Method for Multi-participants Video Conference System (다자간 화상회의 시스템에서의 동시 전송방법에 의한 데이터 입출력 시간 단축 방안)

  • 김현기
    • Journal of Korea Multimedia Society
    • /
    • v.3 no.3
    • /
    • pp.234-240
    • /
    • 2000
  • In this paper, we propose the method in which a stream of multimedia data simultaneously transfers to the main memory and the multimedia processor from the network interface card using a conventional system bus. The proposed method can reduce the input/output time of multimedia data and improve the data stream in the system bus. Also, we compared the number of system bus accesses, bus cycles and data transmission time to the number of participants between the proposed method and the conventional methods in the multi-party video conference systems. The comparison results of performance anticipate that the number of bus accesses of the proposed method was reduced by 50%, and the total transmission time was reduced by 75% as much as the conventional method regardless of the relation of the participant numbers.

  • PDF

Study on Noise Reduction of Plasma Display Panel (플라즈마 디스플레이의 소음 저감 연구)

  • Park, Dae-Kyong;Kweon, Hae-Sub;Jang, Dong-Seob
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2002.11b
    • /
    • pp.693-698
    • /
    • 2002
  • For the evaluation of the plasma display panel (PDP)'s noise, vibration and sound characteristics of fanless PDP are measured and investigated. PDP is a type of two-electrode vacuum tube which operates on the same principle as a household fluorescent light. An inert gas such as argon or neon is injected between two glass plates on which transparent electrodes have been formed, and the glass is illuminated by generating discharge. For this discharge, both high voltage and currents are needed and cause an acoustic noise. We investigated the noise characteristics connected with both a electromagnetic elements from SMPS to panel through X, Y and logic board, and a mechanical elements form panel to case through transfer path which related with vibration and heat. To reduce the noise of PDP, a discharge pulse memory design related with both higher brightness and lower power consumption is important and mechanical characteristics connected with dissipation process of both heat and vibration generated by panel discharge must be investigated.

  • PDF

Parallel processing in structural reliability

  • Pellissetti, M.F.
    • Structural Engineering and Mechanics
    • /
    • v.32 no.1
    • /
    • pp.95-126
    • /
    • 2009
  • The present contribution addresses the parallelization of advanced simulation methods for structural reliability analysis, which have recently been developed for large-scale structures with a high number of uncertain parameters. In particular, the Line Sampling method and the Subset Simulation method are considered. The proposed parallel algorithms exploit the parallelism associated with the possibility to simultaneously perform independent FE analyses. For the Line Sampling method a parallelization scheme is proposed both for the actual sampling process, and for the statistical gradient estimation method used to identify the so-called important direction of the Line Sampling scheme. Two parallelization strategies are investigated for the Subset Simulation method: the first one consists in the embarrassingly parallel advancement of distinct Markov chains; in this case the speedup is bounded by the number of chains advanced simultaneously. The second parallel Subset Simulation algorithm utilizes the concept of speculative computing. Speedup measurements in context with the FE model of a multistory building (24,000 DOFs) show the reduction of the wall-clock time to a very viable amount (<10 minutes for Line Sampling and ${\approx}$ 1 hour for Subset Simulation). The measurements, conducted on clusters of multi-core nodes, also indicate a strong sensitivity of the parallel performance to the load level of the nodes, in terms of the number of simultaneously used cores. This performance degradation is related to memory bottlenecks during the modal analysis required during each FE analysis.

The Advanced Rasterizer and Cache Memory Architecture for Latency Reduction Of 3D GPU (3차원 그래픽 가속기의 지연 감소를 위한 개선된 래스터라이져 및 캐쉬 메모리 구조 제안 및 실험)

  • Park Jin-Hong;Kim Il-San;Park Woo-Chan;Han Tack-Don
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07a
    • /
    • pp.727-729
    • /
    • 2005
  • 현재 3차원 그래픽 가속기에서 성능 향상에 대한 문제점으로 대두되고 있는 것은 실제 화면에 그려지는 정보가 저장되는 프레임버퍼에 대한 접근 지연이다. 따라서 본 논문은 기존 픽셀 캐쉬가 포함된 래스터라이져 구조에서 캐쉬 읽기 접근 실패 시 발생하는 패널티와 이에 따른 프레임버퍼에 대한 지연이 발생하는 문제점을 개선하고자, 기존 래스터라이져를 래스터라이져와 합성기로 구분하고 그 사이에 캐쉬 읽기 접근 실패 시 프레임 버퍼에서 정보를 읽어오지 않는 깊이 캐쉬와 색상 캐쉬가 쌍을 이룬 픽셀 캐쉬 메모리 시스템으로 구성된 개선된 3차원 그래픽 가속기 구조을 제안하고 실험을 수행하였다. 실험 결과 제안하는 3차원 그래픽 가속기 구조가 기존 구조에 비해 캐쉬 접근 실패율이 약 $23\%$ 감소하였으며, 평균 메모리 접근 사이클이 $10\%-13\%$ 감소하였으며 이는 상당수의 프레임버퍼에 대한 접근 지연을 감소시킨 것이다. 합성기와 메모리 간의 대역폭은 약 $10\%$ 증가하지만 파이프라인의 작업에는 영향을 미치지는 않는다.

  • PDF

Nonlinear dynamic analysis of RC frames using cyclic moment-curvature relation

  • Kwak, Hyo-Gyoung;Kim, Sun-Pil;Kim, Ji-Eun
    • Structural Engineering and Mechanics
    • /
    • v.17 no.3_4
    • /
    • pp.357-378
    • /
    • 2004
  • Nonlinear dynamic analysis of a reinforced concrete (RC) frame under earthquake loading is performed in this paper on the basis of a hysteretic moment-curvature relation. Unlike previous analytical moment-curvature relations which take into account the flexural deformation only with the perfect-bond assumption, by introducing an equivalent flexural stiffness, the proposed relation considers the rigid-body-motion due to anchorage slip at the fixed end, which accounts for more than 50% of the total deformation. The advantage of the proposed relation, compared with both the layered section approach and the multi-component model, may be the ease of its application to a complex structure composed of many elements and on the reduction in calculation time and memory space. Describing the structural response more exactly becomes possible through the use of curved unloading and reloading branches inferred from the stress-strain relation of steel and consideration of the pinching effect caused by axial force. Finally, the applicability of the proposed model to the nonlinear dynamic analysis of RC structures is established through correlation studies between analytical and experimental results.