• Title/Summary/Keyword: 웹 통계

Search Result 404, Processing Time 0.031 seconds

Mode effects in concurrent mixed-mode surveys (병행적 혼합조사의 모드효과 분석)

  • Baek, Jeeseon;Min, Kyung A
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.5
    • /
    • pp.787-806
    • /
    • 2016
  • Mixed-mode (MM) designs in which data are collected by different modes in one design have become increasingly popular. An MM data collection has several advantages such as reductions of coverage error, non-response and cost. However, MM designs may introduce mode effects that are confounded by selection effects and measurement effects, which can make MM data quality poor. In order to investigate mode effects, SRI implemented a concurrent mixed-mode experiment in 2014 where respondents could choose between a self-administrated Web survey and a self-administrated paper survey. This paper separately estimates selection effects and measurement effects. We found that measurement effects on some items are large.

Statistical Metadata for Users: A Case Study on the Level of Metadata Provision on Statistical Agency Websites (웹 이용자를 위한 통계 메타데이터: 통계정보 제공사이트의 메타데이터 제공 수준 평가 사례 연구)

  • Oh, Jung-Sun
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.2
    • /
    • pp.161-179
    • /
    • 2007
  • As increasingly diverse kinds of information materials are available on the Internet, it becomes a challenge to define an adequate level of metadata provision for each different type of material in the context of digital libraries. This study explores issues of metadata provision for a particular type of material, statistical tables. Statistical data always involves numbers and numeric values which should be interpreted with an understanding of underlying concepts and constructs. Because of the unique data characteristics, metadata in the statistical domain is essential not only for finding and discovering relevant data, but also for understanding and using the data found. However, in statistical metadata research, more emphasis has been put on the question of what metadata is necessary for processing the data and less on what metadata should be presented to users. In this study, a case study was conducted to gauge the status of metadata provision for statistical tables on the Internet. The websites of two federal statistical agencies in the United States were selected and a content analysis method was used for that purpose. The result showing insufficient and inconsistent provision of metadata demonstrate the need for more discussions on statistical metadata from the ordinary web users' perspective.

Traffic Analysis of Statistics based on Internet Application Services (인터넷 응용 서비스의 통계에 근거한 트래픽 분석)

  • 정태수;최진섭;정중수;김정태;김대영
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.5
    • /
    • pp.995-1003
    • /
    • 2004
  • A number of Internet application services are used with the development of Internet backbone nowadays. Well-known services such as WWW, ]n, email are provided at first time. Tremendous unwell-known services are presented according to the demands of various contents. After analyzing PDU information of the packet using unwell-known port travelling on the internet, searching internet service type and its statistical data is provided with internet traffic analyst as very useful information. This paper presents the mechanism to extract the internet application services operated on (un)well-known port of UDP or TCP used occasionally through netflow and tcpdump method introduced by ethereal and the operation scheme of the service. Afterwards to get the detailed statistics of the analyzed application service, the agent and the server environment, the agent gathering raw data traffics and the server adapting the traffic received from the agent BNF(Backus-Naur Form) method, is also introduced. Adapting the presented mechanism eve. LAN of Andong national university, the internet traffic service type and the detailed statistics of the analyzed application services which provides with internet traffic analyst are presented as very useful information.

Development of Web based Diagnosis Evaluation System for Slow-learning Students in Elementary School Mathematics (수학과 학습 부진아를 위한 웹기반 진단평가 시스템의 개발 및 적용)

  • Lee, Jong-Bae;Han, Kyu-Jung
    • Journal of The Korean Association of Information Education
    • /
    • v.12 no.3
    • /
    • pp.275-282
    • /
    • 2008
  • If a learner should fail to complete previous courses before moving on to the next level will face difficulty keeping up with it. Such personal tuition for those having trouble coping with their class is an issue that needs urgent addressing, which cannot be a burden only to teachers, this study has been conducted to sought for a solution. This study has developed and put into application a web based analyzing system to assess the area of deficit in students followed by obliterating accumulated learning deficits to impart assistance for their study. The subject of the study comprised of ten students from a school where the researcher is on duty and the field of assessment with the analyzing system were numbers and calculations. As a result, we could find out its efficiency in elevating their capability and interest in learning, which was proven to be statistically significant using ANOVA.

  • PDF

A study on unstructured text mining algorithm through R programming based on data dictionary (Data Dictionary 기반의 R Programming을 통한 비정형 Text Mining Algorithm 연구)

  • Lee, Jong Hwa;Lee, Hyun-Kyu
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.20 no.2
    • /
    • pp.113-124
    • /
    • 2015
  • Unlike structured data which are gathered and saved in a predefined structure, unstructured text data which are mostly written in natural language have larger applications recently due to the emergence of web 2.0. Text mining is one of the most important big data analysis techniques that extracts meaningful information in the text because it has not only increased in the amount of text data but also human being's emotion is expressed directly. In this study, we used R program, an open source software for statistical analysis, and studied algorithm implementation to conduct analyses (such as Frequency Analysis, Cluster Analysis, Word Cloud, Social Network Analysis). Especially, to focus on our research scope, we used keyword extract method based on a Data Dictionary. By applying in real cases, we could find that R is very useful as a statistical analysis software working on variety of OS and with other languages interface.

Rank-Size Distribution with Web Document Frequency of City Name : Case study with U.S incorporated places of 100,000 or more population (인터넷 문서빈도를 통해 본 도시순위규모에 관한 연구 -미국 10만 이상의 인구를 갖는 도시들을 사례로-)

  • Hong, Il-Young
    • Journal of the Korean association of regional geographers
    • /
    • v.13 no.3
    • /
    • pp.290-300
    • /
    • 2007
  • In this study, web document frequency of city place name is analyzed and it is used as the dataset for rank-size analysis. The search keywords are compared in the context of spatial meaning and the different domain corpus is applied. The acquired search results are applied for the further analysis. Firstly, the rank-size analysis is applied to compare the result between population and document frequency. Secondly, in case of correlation analysis, the significant changes are revealed when the spatial criteria for search keywords are increased. In case of corpus, COM, NET, and ORG shows the higher coefficient values. Lastly, the cluster analysis is applied to classify the list of cities that shows the similarity and difference. These analyses have a significant role in representing the rank-size distribution of city names that are reflected on the web documents in the information society.

  • PDF

Terminology Recognition System based on Machine Learning for Scientific Document Analysis (과학 기술 문헌 분석을 위한 기계학습 기반 범용 전문용어 인식 시스템)

  • Choi, Yun-Soo;Song, Sa-Kwang;Chun, Hong-Woo;Jeong, Chang-Hoo;Choi, Sung-Pil
    • The KIPS Transactions:PartD
    • /
    • v.18D no.5
    • /
    • pp.329-338
    • /
    • 2011
  • Terminology recognition system which is a preceding research for text mining, information extraction, information retrieval, semantic web, and question-answering has been intensively studied in limited range of domains, especially in bio-medical domain. We propose a domain independent terminology recognition system based on machine learning method using dictionary, syntactic features, and Web search results, since the previous works revealed limitation on applying their approaches to general domain because their resources were domain specific. We achieved F-score 80.8 and 6.5% improvement after comparing the proposed approach with the related approach, C-value, which has been widely used and is based on local domain frequencies. In the second experiment with various combinations of unithood features, the method combined with NGD(Normalized Google Distance) showed the best performance of 81.8 on F-score. We applied three machine learning methods such as Logistic regression, C4.5, and SVMs, and got the best score from the decision tree method, C4.5.

Web-based microarray analysis using the virtual chip viewer and bioconductor. (MicroArray의 직관적 시각적 분석을 위한 웹 기반 분석 도구)

  • Lee, Seung-Won;Park, Jun-Hyung;Kim, Hyun-Jin;Kang, Byeong-Chul;Park, Hee-Kyung;Kim, In-Ju;Kim, Cheol-Min
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2005.05a
    • /
    • pp.198-201
    • /
    • 2005
  • DNA microarray 칩은 신약 개발, 유전적 질환 진단, Bio-molecular 상호작용 연구, 유전자의 기능연구 등 폭넓게 사용되고 있다. 이 논문은 cDNA mimcroarray 데이터를 분석하기 위한 웹형태의 시스템 개발에 대한 내용을 다룬다. 하나의 cDNA microarray에는 수 백에서 수 만개의 유전자가 심어져 있으며, 데이터를 분석할 때 대량의 데이터와 다양한 형태의 오류로 인해서 데이터간의 차이를 보정하는 분석 도구와 통계적 기법들이 사용되어야 한다. 본 논문에서는 가상 칩 뷰어를 이용하여 실제 microarray 데이터의 foreground intensity에서 백그라운드의 intensity를 제거하여 일반화된 칩 이미지를 생성한다. 이 가상 칩 뷰어는 여러 가지 필터효과와 서로 다른 두 형광의 차이를 조정하는 global normalization 기법을 사용하여 발현 유전자 분석을 시각적으로 할 수 있고, 중복된 마이크로어레이 칩 데이터를 통하여 시간이 많이 걸리는 분석전 칩의 유효성을 검토할 수 있다. 칩 데이터의 normalization을 위한 통계 방법으로 R 통계 도구와 linear 모델을 사용하여 microarray 칩의 유전자 발현 양상을 분석한다. 통계적 방법을 사용하지 않은 데이터를 추출, 이 데이터의 패턴 그래프 그리고 발현 레벨을 분류하여 마이크로어레이의 각 스팟의 유효성 검토의 정확성을 높였다. 이 시스템은 칩의 유효성 검토, 스팟의 유효성 검토, 유전자 선정에 대해 분석의 용이성과 정확성을 높일 수 있었다.

  • PDF

Implementation of a Real-time Data Display System for a Catchment Scale Automated Weather Observation Network (집수역 규모 무인기상관측망을 위한 실황자료 표출시스템 구축)

  • Jung, Myung Ryong;Kim, Jin-Hee;Moon, Young Eel;Yun, Jin I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.15 no.4
    • /
    • pp.304-311
    • /
    • 2013
  • There have been increasing cases for farmers to install automated weather stations (AWS) at their farms and orchards in order to take countermeasures to more frequent weather disasters caused by climate variability and weather extremes. Although raw data are the same, the additive values as agrometeorological information may vary depending on data processing methods. User demands on appropriate information could also be different among crop species, cropping systems and even cultivars. We designed an internet based AWS data processing and display system to help diverse users (e.g., farmers), extension workers to access their weather data on specific demands. The system was implemented at a rural catchment with 52 $km^2$ land area where 14 automated weather stations are in operation. This note introduces the system and describes the major modules in detail. By linking regional AWS networks, a feasibility for this system as an early warning system is also discussed.

Effect of PMIS Quality on Intention to Use and User Satisfaction (건설 PMIS 품질이 사용의도 및 사용자 만족도에 미치는 영향)

  • Sung, Min-Woo;Kim, Ka-Ram;Lee, Seul-Ki;Yu, Jung-Ho
    • Journal of the Korea Institute of Building Construction
    • /
    • v.12 no.1
    • /
    • pp.122-132
    • /
    • 2012
  • Establishing a success model of a specific information system is critical to understanding the mechanism of IS success, the various dimensions of IS performance, and the factors and their causal relations in IS success. As one of the key IT applications, the project management information system (PMIS), particularly the web-based PMIS (Web-PMIS), has played a significant role in construction management processes in Korea. However, there have been few research attempts made to construct a Web-PMIS success model. This study primarily aims to propose a Web-PMIS success model based on DeLone and McLean's IS success model, and to discuss whether or not the D&M IS success model can be applied to the construction Web-PMIS. A questionnaire was sent out to Web-PMIS users (construction managers and constructors), and 253 completed questionnaires were received. Through multi-regression analysis, it was confirmed that it is statistically acceptable to apply the D&M IS success model to the Web-PMIS. However, the explanatory power of the model is not sufficient, and some of the model factors are not statistically significant enough. Relying on the statistical analysis results, this study also discusses the development direction for the Web-PMIS success model.