• Title/Summary/Keyword: Data Cleaning

Search Result 422, Processing Time 0.022 seconds

Clustering of Smart Meter Big Data Based on KNIME Analytic Platform (KNIME 분석 플랫폼 기반 스마트 미터 빅 데이터 클러스터링)

  • Kim, Yong-Gil;Moon, Kyung-Il
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.2
    • /
    • pp.13-20
    • /
    • 2020
  • One of the major issues surrounding big data is the availability of massive time-based or telemetry data. Now, the appearance of low cost capture and storage devices has become possible to get very detailed time data to be used for further analysis. Thus, we can use these time data to get more knowledge about the underlying system or to predict future events with higher accuracy. In particular, it is very important to define custom tailored contract offers for many households and businesses having smart meter records and predict the future electricity usage to protect the electricity companies from power shortage or power surplus. It is required to identify a few groups with common electricity behavior to make it worth the creation of customized contract offers. This study suggests big data transformation as a side effect and clustering technique to understand the electricity usage pattern by using the open data related to smart meter and KNIME which is an open source platform for data analytics, providing a user-friendly graphical workbench for the entire analysis process. While the big data components are not open source, they are also available for a trial if required. After importing, cleaning and transforming the smart meter big data, it is possible to interpret each meter data in terms of electricity usage behavior through a dynamic time warping method.

Implementation of a Data Processing Method to Enhance the Quality and Support the What-If Analysis for Traffic History Data (교통이력 데이터의 품질 개선과 What-If 분석을 위한 자료처리 기법의 구현)

  • Lee, Min-Soo;Cheong, Su-Jeong;Choi, Ok-Ju;Meang, Bo-Yeon
    • The KIPS Transactions:PartD
    • /
    • v.17D no.2
    • /
    • pp.87-102
    • /
    • 2010
  • A vast amount of traffic data is produced every day from detection devices but this data includes a considerable amount of errors and missing values. Moreover, this information is periodically deleted before it could be used as important analysis information. Therefore, this paper discusses the implementation of an integrated traffic history database system that continuously stores the traffic data as a multidimensional model and increases the validity and completeness of the data via a flow of processing steps, and provides a what-if analysis function. The implemented system provides various techniques to correct errors and missing data patterns, and a what-if analysis function that enables the analysis of results under various conditions by allowing the flexible definition of various process related environment variables and combinations of the processing flows. Such what-if analysis functions dramatically increase the usability of traffic data but are not provided by other traffic data systems. Experimantal results for cleaning the traffic history data showed that it provides superior performance in terms of validity and completeness.

The Dyeability Properties of Some Yellow Natural Dyeb (Part ll) - Extracted from Turmeric - (황색 천연염료의 염색성 (제2보) -울금을 중심으로-)

  • Jo, Seung-Sik;Song, Hwa-Sun;Kim, Byeong-Hui
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.21 no.6
    • /
    • pp.1051-1059
    • /
    • 1997
  • The objectives of this study were to investigate the effects of mordants and dyeing: on the dyeability and color fastness of the fabrics dyed with the extract from Turmeric. The following results were drawn from the data obtained. 1. The wavelengths of the strongest absorption band of Turmeric extract were 400 nm respectively and the wavelengths were 440 nm after the mordants were added in the color extract. The bands of Turmeric extract shifted to long wave length side as pH increased. In all cases, the abosorbancies were increased as pH increased. 2. The main color substance in extract from Turmeric were expected to be curcumin respectively by spectrophotometric and HPLC studies. 3. As to the concentration of color extract for dyeing, about 20 g/L was the optimum concentration to dye silk and cotton fabrics with extract. 4. The K/S values of dyed fabrics were increased gradually as the concentration of mordants increased, and the highest K/S values were obtained at 5∼10%. When using the mordanting methods, silk fabric by premordanting and cotton fabric by premordanting and synmordanting had influenced upon K/S valse. 5. The color fastness of fabrics dyed with Turmeric extract against dry cleaning, washing, rubbing and perspiration was improved 1 level or so but light fastness was remained.

  • PDF

A Comparison of Time Use between Korean and the USA Families (한.미 양국간 가족의 시간사용 비교 연구)

  • 이연숙;이기영;김외숙;조희금;주인숙
    • Journal of Families and Better Life
    • /
    • v.20 no.3
    • /
    • pp.139-156
    • /
    • 2002
  • The purpose of this study is to compare the patterns of time use between Korean and USA families. The data for 353 Seoul-based Korean families with two children living in Seoul and 130 USA families with two children living in the State of Utah were collected using a structured questionnaire and time diary. The major findings were as the following: 1. The Korean couples spent more time at personal care, paid work, and travel than the USA couples did, while the USA couples spent more time at housework and social-cultural activities than the Korean couples did. 2. The Korean wives spent more time doing food and clothing related housework than the USA wives did. Compared with the Korean wives, however, the time spent at house cleaning and management, family care and shopping and home management were longer than USA wives. The time U.S. husbands engaged in housework was much greater than by the Korean husbands. 3. Regardless of sex and school level, the Korean children spent less time at sleeping/rest, housework and socio-cultural activities and more time at eating and learning than those of U.S. These time use patterns of the families in both countries may reflect the differences of the cultural contexts, social norms, life styles, and the degrees of urbanization. To fully explain the findings, further study on the differences in social and cultural factors between the two countries is needed.

The Development Measuring System of Temperature Effect to Produce Electric Power of Solar Cell

  • Sadmai, Ong-art
    • International journal of advanced smart convergence
    • /
    • v.4 no.1
    • /
    • pp.104-113
    • /
    • 2015
  • This paper focuses on a temperature effects on a PV panel which has been installed in Thailand. The main objective is cleaning PV panels and reduce temperature of PV panel by water injects from waterway and experimental results of PV power what it is difference. This project is designed by PLC control system which water injects and control PV temperature, In addition, this project consists of hardware and software such as water pump, water injection and PLC control has been automatically and it can be control system manually. The automatic control system is working when PV temperature rises up over 45 degree Celsius after that the pumping machine would inject water to the surface of PV panels and it must be stop when the PV panel temperature comes down less than 45 degree Celsius. The result of actual experimental found that the control system has been done correctly under specify condition. The experimental has been shown electrical data before and after water injects on PV system found that the electrical power a bit increases and The energy has been taken from PV panel less than energy consumption equipment of control system which taken to operate the water injecting system.

The Performance Analysis of a Counter-rotating Tubular Type Turbine with the Number of Runner Vane (러너베인 깃수의 변화에 따른 튜블러형 상반전 수차의 성능해석)

  • Park, Jihoon;Lee, Nakjoong;Hwang, Youngho;Kim, Youtaek;Lee, Youngho
    • 한국신재생에너지학회:학술대회논문집
    • /
    • 2010.06a
    • /
    • pp.192.1-192.1
    • /
    • 2010
  • Micro hydraulic turbines take a growing interest because of its small and simple structure as well as high possibility of applying to micro and small hydropower resources. The differential pressure exiting within the city water pipelines can be used efficiently to generate electricity like the energy generated through gravitational potential energy in dams. In order to reduce water pressure at the inlet of water cleaning centers, pressure reducing valves are used widely. Therefore, pressure energy is wasted. Instead of using the pressure reduction valve, a micro counter-rotating hydraulic turbine can be replaced to get energy caused by the large differential pressure found in the city water pipelines. In this paper, detail studies have been carried out to acquire basic design data of micro counter-rotating hydraulic turbine, output power, head, and efficiency characteristics on various number of runner vane. Moreover, the influences of pressure, tangential and axial velocity distributions on turbine performance are also investigated.

  • PDF

Rates for Handwashing Adherence Before and After Nursing Contact in Intensive Care Units (중환자실 간호사의 간호행위 전.후 손씻기 수행율 비교)

  • Kim, Young-Jung;Kim, Hee-Seung;Chang, Yun-Young
    • Journal of Korean Academy of Fundamentals of Nursing
    • /
    • v.18 no.2
    • /
    • pp.195-200
    • /
    • 2011
  • Purpose: The purpose of this study was to assess rates for handwashing adherence before and after nursing contact in intensive care units (ICU). Methods: The participants included 90 nurses working in intensive care units of an 800-bed university-affiliated hospital in Gyeonggi Province and 2000-bed university-affiliated hospital in Seoul. Time for handwashing was calculated using the average number of handwashings during an 8-hour day shift. Nursing contact was based on indications as defined by the Centers for Disease Control and Prevention (CDC, 2002). Data were analyzed using frequency, percent, t-test and ${\chi}^2$-test. Results: During an 8-hour day shift, the average number of times that hands were washed was 25.0. The rates were significantly lower before the nursing contact than after the nursing contact when it involved sectioning, observation or contact with a wound, cleaning enteric feeding bag, physical exam, use of gloves, or contact with contaminants. Conclusions: The results indicate that as handwashing rates were significantly lower before nursing contacts than after nursing contacts, there is need to develop strategies to address this deficiency in handwashing.

Indolent B-Cell Lymphoid Malignancy in the Spleen of a Man Who Handled Benzene: Splenic Marginal Zone Lymphoma

  • Lee, Jihye;Kang, Young Joong;Ahn, Jungho;Song, Seng-Ho
    • Safety and Health at Work
    • /
    • v.8 no.3
    • /
    • pp.315-317
    • /
    • 2017
  • We present the case of a 45-year-old man with a history of benzene exposure who developed splenic marginal zone lymphoma. For 6 years, he had worked in an enclosed space cleaning instruments with benzene. He was diagnosed with splenic marginal zone lymphoma 19 years after retirement. During his time of working in the laboratory in the 1980s, working environments were not monitored for hazardous materials. We indirectly estimated the cumulative level of past benzene exposure using job-exposure matrices and technical assumptions. Care must be taken in investigating the relevance of occupational benzene exposure in the occurrence of indolent B-cell lymphoma. Because of the long latency period and because occupational measurement data do not exist for the period during the patient's exposure, the epidemiological impact of benzene exposure may be underestimated.

A Study on the Development of Agricultural and Stockbreeding Products Information System Using IOT Based Connected System IOT (기반 Connected System을 이용한 농축산물정보시스템 구축)

  • Lee, Sung-Ha;Park, Chul-Ju
    • Korean Journal of Artificial Intelligence
    • /
    • v.5 no.2
    • /
    • pp.26-42
    • /
    • 2017
  • This study perceived that there are limits to prompt and accurate monitoring when an accident occurs and the correct information of egg production stage, such as the date of spawning, cleaning, and refrigerating cannot be identified, since eggshell codes using barcode only show numbers identifying a city and province and the name of producers. To fix this problem, this study partially suggested the RFID (Radio Frequency Identification) technology and IoT-based Connected System. The proposed system in this study shares data with related agencies as the system of agricultural and livestock product information runs as the main server, and the database information of the proposed system is provided by farmhouses, distributors, and sellers. Through various media such as a webpage or mobile application built to provide the relevant information, customers can search and obtain information about agricultural and livestock products they want. Since the information on an entire process is open to the public, information ranging from simple to clear, additional ones such as hazardous elements can be viewed.

The Changes of the Garbage Problem Importance through the Number of Articles, Column Headings and Contents of Dong A Ilbo (동아일보 기사 수, 단수, 내용을 통한 쓰레기 문제의 중요도 변천분석 : 1920-1990년사이)

  • 신경주
    • Journal of the Korean housing association
    • /
    • v.13 no.3
    • /
    • pp.1-9
    • /
    • 2002
  • Desolation of the earth due to environmental pollution is rising as a world wide problem and concern. At this point we need to look into the problem and set up a direction for the future. In order to reveal the change of garbage problems in our county's civil life, a researcher analyzed 369 garbage related articles from the first edition of Dong A Ilbo up to 1990. The following is the result of data from garbage related articles. It is organized by age and era(10 years). 1) Number of articles by year roses in 1921 after first publication of garbage problem article. In the 1930s, the number of articles drastically increased in 1937. From then on, the number of articles declined until early 1970s but roses again from 1978. 2) Yearly change in articles was a mere 1.2 columns in between 1920 to 1960. In the 1970's, relative importance increased and over 5 columns were published. Articles rose in the 1980s with over 3.4 and 5 columns. 3) The contents of the articles can be classified into cleaning problems, collecting and transporting, expenses, and recycling. Garbage disposal problems continued until the 1970s. Regarding garbage collecting problems, form of collecting container and location was discussed. Laws were revised after garbage disposal areas were discussed in the 1920s. Expenses were levied from the 1930s and rising cost and double charge problems were subjected. Garbage recycling began in the 1920s and continued until 1900s.