• Title/Summary/Keyword: Large Scale Data

Search Result 2,796, Processing Time 0.035 seconds

Measurement Error Variance Estimation Based on Complex Survey Data with Subsample Re-Measurements

  • Heo, Sunyeong;Eltinge, John L.
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.553-566
    • /
    • 2003
  • In many cases, the measurement error variances may be functions of the unknown true values or related covariates. This paper considers design-based estimators of the parameters of these variance functions based on the within-unit sample variances. This paper devotes to: (1) define an error scale factor $\delta$; (2) develop estimators of the parameters of the linear measurement error variance function of the true values under large-sample and small-error conditions; (3) use propensity methods to adjust survey weights to account for possible selection effects at the replicate level. The proposed methods are applied to medical examination data from the U.S. Third National Health and Nutrition Examination Survey (NHANES III).

Provenance and Validation from the Humanities to Automatic Acquisition of Semantic Knowledge and Machine Reading for News and Historical Sources Indexing/Summary

  • NANETTI, Andrea;LIN, Chin-Yew;CHEONG, Siew Ann
    • Asian review of World Histories
    • /
    • v.4 no.1
    • /
    • pp.125-132
    • /
    • 2016
  • This paper, as a conlcusion to this special issue, presents the future work that is being carried out at NTU Singapore in collaboration with Microsoft Research and Microsoft Azure for Research. For our research team the real frontier research in world histories starts when we want to use computers to structure historical information, model historical narratives, simulate theoretical large scale hypotheses, and incent world historians to use virtual assistants and/or engage them in teamwork using social media and/or seduce them with immersive spaces to provide new learning and sharing environments, in which new things can emerge and happen: "You do not know which will be the next idea. Just repeating the same things is not enough" (Carlo Rubbia, 1984 Nobel Price in Physics, at Nanyang Technological University on January 19, 2016).

The Measurement of Real Deformation Behavior in Pilot LNG Storage Tank Membrane by using Strain Gage (스트레인 게이지를 이용한 Pilot LNG 저장탱크 멤브레인 실 변형 거동 측정)

  • Kim, Young-Kyun;Yoon, Ihn-Soo;Oh, Byoung-Taek;Hong, Seong-Ho;Yang, Young-Myung
    • Proceedings of the KSME Conference
    • /
    • 2004.11a
    • /
    • pp.108-113
    • /
    • 2004
  • Korea Gas Corp. has developed the design technology of the LNG storage tank. The membrane to be applied inside of the LNG storage tank is provided with corrugations to absorb thermal contraction and expansion caused by LNG temperature changes. It is very important to measure their thermal strains under LNG temperatures by analytical and experimental stress analysis of the membrane. We have developed a stress measurement system using strain gages and measured the strain during cooldown and storing the LNG. We also analyzed the measured data by comparison with the FEM data. On the basis of these results, we could design and assure the application of the Kogas Membrane to large scale LNG storage.

  • PDF

Enhancement of Displacement Resolution of Vibration Data Measured by using Camera Images (카메라 영상을 이용한 진동변위 측정 시 측정해상도 향상 기법)

  • Son, Ki-Sung;Jeon, Hyeong-Seop;Han, Soon Woo;Park, Jong Won
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.24 no.9
    • /
    • pp.716-723
    • /
    • 2014
  • Vibration measurement using image processing is a fully non-contact measurement method and has many application fields. The resolution of vibration data measured by image processing depends on the camera performance and is lower than that measured by accelerometers. This work discusses the method to increase resolution of vibration signal measured by image processing based on the image mosaic technique with a high-power lens. The working principle of resolution enhancement was explained theoretically and verified by several experiments. It was shown that the proposed method can measure vibrations of relatively large scale structures with increased resolutions.

Development of LPAKO : Software of Simplex Method for Liner Programming (단체법 프로그램 LPAKO 개발에 관한 연구)

  • 박순달;김우제;박찬규;임성묵
    • Korean Management Science Review
    • /
    • v.15 no.1
    • /
    • pp.49-62
    • /
    • 1998
  • The purpose of this paper is to develope a large-scale simplex method program LPAKO. Various up-to-date techniques are argued and implemented. In LPAKO, basis matrices are stored in a LU factorized form, and Reid's method is used to update LU maintaining high sparsity and numerical stability, and further Markowitz's ordering is used in factorizing a basis matrix into a sparse LU form. As the data structures of basis matrix, Gustavson's data structure and row-column linked list structure are considered. The various criteria for reinversion are also discussed. The dynamic steepest-edge simplex algorithm is used for selection of an entering variable, and a new variation of the MINOS' perturbation technique is suggested for the resolution of degeneracy. Many preprocessing and scaling techniques are implemented. In addition, a new, effective initial basis construction method are suggested, and the criteria for optimality and infeasibility are suggested respectively. Finally, LPAKO is compared with MINOS by test results.

  • PDF

Attitude Estimation of an Aircraft using Image Data (영상데이타를 이용한 항공기 자세각 추정)

  • Park, Sung-Su
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.19 no.4
    • /
    • pp.44-50
    • /
    • 2011
  • This paper presents the algorithm for attitude determination of an aircraft using binary image. An image feature vector, which is invariant to translation, scale and rotation, is constructed to capture the functional relations between the feature vector and the corresponding aircraft attitude. An iterated least squares method is suggested for estimating the attitude of given aircraft using the constructed feature vector library. Simulation results show that the proposed algorithm yields good estimates of aircraft attitude in most viewing range, although a relatively large error occurs in some limited viewing direction.

Scaling Reuse Detection in the Web through Two-way Boosting with Signatures and LSH

  • Kim, Jong Wook
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.6
    • /
    • pp.735-745
    • /
    • 2013
  • The emergence of Web 2.0 technologies, such as blogs and wiki, enable even naive users to easily create and share content on the Web using freely available content sharing tools. Wide availability of almost free data and promiscuous sharing of content through social networking platforms created a content borrowing phenomenon, where the same content appears (in many cases in the form of extensive quotations) in different outlets. An immediate side effect of this phenomenon is that identifying which content is re-used by whom is becoming a critical tool in social network analysis, including expert identification and analysis of information flow. Internet-scale reuse detection, however, poses extremely challenging scalability issues: considering the large size of user created data on the web, it is essential that the techniques developed for content-reuse detection should be fast and scalable. Thus, in this paper, we propose a $qSign_{lsh}$ algorithm, a mechanism for identifying multi-sentence content reuse among documents by efficiently combining sentence-level evidences. The experiment results show that $qSign_{lsh}$ significantly improves the reuse detection speed and provides high recall.

Development for Systems Engineering Framework Model of Next Generation High Speed Railway Train (차세대 고속전철 시스템 엔지니어링 체계 모델 개발)

  • 유일상;박영원
    • Journal of the Korean Society for Railway
    • /
    • v.4 no.4
    • /
    • pp.147-154
    • /
    • 2001
  • A high-speed rail system represents a typical example of large-scale multi-disciplinary systems, consisting of subsystems such as train, electrical hardware, electronics, control, information, communication, civil technology etc. The Systems Engineering, as a methodology for engineering and management of today's ever-growing complex system, must be applied to development of such systems. This paper presents systems engineering framework model to have to be applied to the systems engineering technology development task for the korean next-generation high-speed railway systems in progress and demonstrates data models and schema for computer-aided systems engineering software, RDD-100, for use in its development and management.

  • PDF

A critrion for identification of the mixture normal distribution (정규 분포의 혼합성 판단기준)

  • 홍종선;최병수;엄종석
    • The Korean Journal of Applied Statistics
    • /
    • v.7 no.1
    • /
    • pp.131-140
    • /
    • 1994
  • In order to find the identification function from data and to apply the identification function for the pattern classification, we consider the existing problem of the number of patterns in such data. In this paper, a new criteria for the identification of Gaussaian mixture distribution could be established as a charateristic of the sample variance, which is a bootstrap estimate of the sample variance. We examine the properties and fittness of the criteria through a large scale of computer simulations.

  • PDF

Deep Learning Model Parallelism (딥러닝 모델 병렬 처리)

  • Park, Y.M.;Ahn, S.Y.;Lim, E.J.;Choi, Y.S.;Woo, Y.C.;Choi, W.
    • Electronics and Telecommunications Trends
    • /
    • v.33 no.4
    • /
    • pp.1-13
    • /
    • 2018
  • Deep learning (DL) models have been widely applied to AI applications such image recognition and language translation with big data. Recently, DL models have becomes larger and more complicated, and have merged together. For the accelerated training of a large-scale deep learning model, model parallelism that partitions the model parameters for non-shared parallel access and updates across multiple machines was provided by a few distributed deep learning frameworks. Model parallelism as a training acceleration method, however, is not as commonly used as data parallelism owing to the difficulty of efficient model parallelism. This paper provides a comprehensive survey of the state of the art in model parallelism by comparing the implementation technologies in several deep learning frameworks that support model parallelism, and suggests a future research directions for improving model parallelism technology.