• 제목/요약/키워드: batch processing

검색결과 290건 처리시간 0.023초

"Multi-use Data Platform" 하둡 2.0과 관련 데이터 처리 프레임워크 기술

  • 김직수
    • 방송과미디어
    • /
    • 제22권4호
    • /
    • pp.11-17
    • /
    • 2017
  • 본 고에서는 멀티 응용 데이터 플랫폼으로 진화하고 있는 하둡(Hadoop) 2.0의 주요 특징과 관련된 다양한 데이터 처리 프레임워크들에 대해서 기술하고자 한다. 기존의 맵리듀스(MapReduce) 기반의 배치 처리(Batch Processing)에 최적화되어 있던 하둡 1.0과는 달리, YARN의 등장과 함께 시작된 하둡 2.0 플랫폼은 다양한 형태의 데이터 처리 워크플로우들(Batch, Interactive, Streaming 등)을 지원할 수 있는 기능을 제공하고 있다. 또한, 최근에는 고성능컴퓨팅 분야에서 주로 활용되던 기술들도 하둡 2.0 플랫폼에서 지원되고 있다. 마지막으로 YARN 어플리케이션 개발 사례로서 본 연구팀에서 개발 중에 있는 Many-Task Computing (MTC) 응용을 위한 신규 데이터 처리 프레임워크를 소개한다.

Downstream Processing of Recombinant Hirudin Produced in Saccharomyces cerevisiae

  • Chung, Bong-Hyun;Kim, Won-Kyung;Rao, K.Jagannadha;Kim, Chul-Ho;Rhee, Sang-Ki
    • Journal of Microbiology and Biotechnology
    • /
    • 제9권2호
    • /
    • pp.179-183
    • /
    • 1999
  • A recombinant form of hirudin, a potent thrombin-specific inhibitor derived from the bloodsucking leech, was expressed as a secretory product in Saccharomyces cerevisiae under the control of GALl0 promoter and the mating factor $\alpha$pre-pro leader sequence. In an attempt to produce recombinant hirudin (r-Hir) of therapeutic purity in large quantities, the fed-batch fermentation was carried out by using this recombinant yeast, and subsequently downstream processing was developed with the preparative-scale column chromatography systems. About 234 mg/l of biologically active r-Hir was produced as a secretory product by the fed-batch fermentation strategy developed for an efficient downstream processing. Using a two-step chromatography process (an anion exchange chromatography followed by the reverse phase HPLC), the r-Hir was purified to>98% with an overall recovery yield of 84%. According to the N-terminal amino acid sequencing, the purified r-Hir was found to have the predicted N-terminal amino acid sequence. The biological activity of the purified r-Hir to inhibit thrombin was also identical to that of the commercial hirudin.

  • PDF

워크스테이션 네트워크를 이용한 자바 분산 배치 처리 시스템 (A Java Distributed Batch-processing System using Network of Workstation)

  • 전진수;김정선
    • 한국정보과학회논문지:컴퓨팅의 실제 및 레터
    • /
    • 제5권5호
    • /
    • pp.583-594
    • /
    • 1999
  • VLSI 기술과 네트워크 기술의 비약적인 발달로 인해 자원의 공유를 위한 분산 컴퓨팅 환경의 구축이 보편화되어 가고 있다. 그러나, 네트워크에 연결된 수많은 컴퓨터들의 사용 추이를 살펴볼 때, 사용자의 유형과 시간대에 따라 그러한 컴퓨터들이 유휴 상태에 놓여 있는 경우가 적지 않다는 사실을 알 수 있다. 유휴 상태의 컴퓨터 자원을 최대한 활용할 수 있다면, 결과적으로 막대한 예산의 재투자 없이도 강력한 총체적 컴퓨팅 파워를 얻을 수 있다. 본 논문에서는 기존의 분산 컴퓨팅 환경을 토대로 워크스테이션 풀(pool)을 구축함으로써, 유휴 상태의 컴퓨터들을 최대한 활용할 수 있도록 하기 위해 개발한 Java Distributed Batch-processing System (JDBS)에 대해 기술한다. JDBS 시스템은 CPU-intensive한 독립된 작업들을 배치 형태로 처리하는 분산 배치 처리 시스템으로서, 자바로 구현되었기 때문에 풀에 참여할 수 있는 기종이 다양할 뿐만 아니라 JDBS와 유사한 기존의 시스템들에 비해 시스템의 구축이 훨씬 용이하다. 그 밖에도 복수 클러스터 구조와 지능형 전략을 사용함으로써 규모 확장성과 안정성을 향상시켰으며, 풀로의 가입과 탈퇴, 작업의 제출, 제출된 작업의 모니터링을 쉽게 할 수 있도록 하기 위해 그래픽 인터페이스를 제공한다.

모노레일 대차프레임에 대한 구조 안전성 및 피로강도 평가에 관한 연구 (A Study on Evaluation of Structural Integrity and Fatigue Analysis for the Bogie Frame of Monorail)

  • 고희영;신광복;이광섭;이은규
    • 한국철도학회:학술대회논문집
    • /
    • 한국철도학회 2010년도 춘계학술대회 논문집
    • /
    • pp.609-614
    • /
    • 2010
  • In this paper, the structural integrity and fatigue strength for the bogie frame of Monorail being developed in domestic was evaluated. Presently, the standard of evaluation for the bogie frame of monorail was not regulated. Therefore, the evaluation of the structural integrity and fatigue strength for the bogie frame was performed on the basis of the UIC 615-4 standard. The structural integrity of the designed bogie frame was evaluated by displacement and Von-Mises stress under each load conditions. And the fatigue strength was evaluated by combined main in-service load conditions specified at UIC 615-4 standard and it was compared with result of fatigue analysis using winLIFE v3.1 with the function of batch processing. The results shows that the structural integrity and fatigue strength of the designed bogie frame was satisfied, and the fatigue analysis using batch processing was more effective than conventional fatigue analysis using combined load conditions.

  • PDF

파라메트릭 기법을 사용한 로봇링크 설계변경의 일괄처리 적용연구 (A study on Batch-processing in Performing Design-changes of Robot-links using Parametric Method)

  • 박태원;문하경;정일호;서종휘;김혁;최용원;최재락
    • 한국시뮬레이션학회논문지
    • /
    • 제14권1호
    • /
    • pp.43-53
    • /
    • 2005
  • We developed the module of the software that robot designers can perform their work faster and more easily. The parametric modeler is founded on the virtual robot design program. The virtual robot design program is the powerful software which may be used to solve various problems of robot kinematics and dynamics. The parametric modeler in the software we developed is that all the positions of joints and links are changed automatically when the designer changes one joint or one link in the robot system. Without parametric method, robot-designers must change all the positions of connected joints and links. It might become time-consuming. However, it is very efficient for designers to use the method of batch-processing in performing design-changes of robot-links using the parametric modeler.

  • PDF

Sentiment Orientation Using Deep Learning Sequential and Bidirectional Models

  • Alyamani, Hasan J.
    • International Journal of Computer Science & Network Security
    • /
    • 제21권11호
    • /
    • pp.23-30
    • /
    • 2021
  • Sentiment Analysis has become very important field of research because posting of reviews is becoming a trend. Supervised, unsupervised and semi supervised machine learning methods done lot of work to mine this data. Feature engineering is complex and technical part of machine learning. Deep learning is a new trend, where this laborious work can be done automatically. Many researchers have done many works on Deep learning Convolutional Neural Network (CNN) and Long Shor Term Memory (LSTM) Neural Network. These requires high processing speed and memory. Here author suggested two models simple & bidirectional deep leaning, which can work on text data with normal processing speed. At end both models are compared and found bidirectional model is best, because simple model achieve 50% accuracy and bidirectional deep learning model achieve 99% accuracy on trained data while 78% accuracy on test data. But this is based on 10-epochs and 40-batch size. This accuracy can also be increased by making different attempts on epochs and batch size.

Apache Kudu와 Impala를 활용한 Lambda Arch tecture 설계 (Lambda Architecture Design using Apache Kudu and Impala)

  • 황윤영;이필원;신용태
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2020년도 춘계학술발표대회
    • /
    • pp.60-62
    • /
    • 2020
  • 데이터의 양은 기술의 발전으로 발생하는 크게 증가하였고 다양한 빅데이터 처리 플랫폼이 등장하고 있다. 이 중 가장 널리 사용되고 있는 품랫폼이 Apache 소프트웨어 재단에서 개발한 Hadoop이며, Hadoop은 IoT 분야에도 사용된다. 그러나 기존에 Hadoop 기반 IoT 센서 데이터 수집 분석 환경은 Hadoop의 코어 프로젝트인 HDFS의 Small File로 인한 네임노드의 과부하 문제와 Import된 데이터의 Update나 Delete가 불가능하다는 문제가 있다. 본 논문에서는 Apache Kudu와 Impala를 활용해 Lambda Architecture를 설계한다. 제안하는 Architecture는 IoT 센서 데이터를 Cold-Data와 Hot-Data로 분류해 각 성격에 맞는 스토리지에 저장하고 Batch를 동해 생성된 Batch-View와 Apache Kudu와 Impala를 통해 생성된 Real-time View를 활용해 기존 Hadoop 기반 IoT 센서 데이터 수집 분석 환경의 문제를 해결하고 사용자가 분석된 데이터에 접근하는 시간을 단축한다.

공통 및 고유부품으로 구성되는 제품의 부품공급을 위한 단일설비 일정계획 (Fabrication Scheduling of Products with Common and Unique Components at a Single Facility)

  • 윤상흠
    • 한국경영과학회지
    • /
    • 제28권4호
    • /
    • pp.105-114
    • /
    • 2003
  • This paper considers a single facility scheduling problem for the fabrication of components supporting the subsequent assembly into products. Each product consists of two types of components, one is common to all products and the other is unique to itself. The unique components are processed individually and the common components are processed in batches, and a distinct setup is incurred for the production of a batch of common components. Under the so-called “batch availability” environment, we propose an efficient dynamic programming(DP) procedure for the optimal sequencing and batching schedule with minimum total completion times of products in the situation that each product is said to be complete if both common and unique components of the product are finished. Computational results show that the proposed DB procedure is more efficient than the previous work which has considered the same problem as this paper.

일괄처리를 위한 배치통합문제의 근사해법 (An Approximation Algorithm for 2-batch Consolidation with Small Items)

  • 명영수
    • 한국경영과학회지
    • /
    • 제38권1호
    • /
    • pp.61-67
    • /
    • 2013
  • We consider a problem of grouping orders for batch processing that arises in production systems where customer orders are processed in batches. This problem can be viewed as a variant of bin packing problem where items can be split and a pair of items can be placed in a bin when the items are compatible with each other. In this paper, we consider a special case that at most two different items can be placed in a single bin and the size of every item is at most the size of a bin.

개인 건강 정보 처리를 위한 배치 어플리케이션에서 데이터 질의 속도 향상을 위한 PHDItemReader 설계 및 구현 (Desgin and Implementation of PHDItemReader to Speed up Data Query in Batch Application for Processing Personal Health Record)

  • 전동철;황희정
    • 한국멀티미디어학회논문지
    • /
    • 제23권12호
    • /
    • pp.1496-1506
    • /
    • 2020
  • With the progress of miniaturization and high performance of various sensors, a lot of data is generated in various fields and being collected in real-time, but the use of such large-capacity data is often unable to keep up with the collection technology. In the medical field, health data is collected and managed by platform, which causes inconvenience to users in searching their own health data and receiving medical services. In this paper, in order to solve these problems, we designed and implemented PHDItemReader to improve the speed of data query in a batch application environment that can integrate and process health data having various data expression formats. The experiment compared and analyzed 3 types of query speed based on 1,000,000 hypothetical health data, and as a result of the experiment, it was verified that the PHDItemReader implemented in this paper improved up to about 21% compared to the existing one.