• Title/Summary/Keyword: 스몰데이터

Search Result 12, Processing Time 0.027 seconds

Constructing a Knowledge Graph for Improving Quality and Interlinking Basic Information of Cultural and Artistic Institutions (문화예술기관 기본정보의 품질개선과 연계를 위한 지식그래프 구축)

  • Euntaek Seon;Haklae Kim
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.4
    • /
    • pp.329-349
    • /
    • 2023
  • With the rapid development of information and communication technology, the speed of data production has increased rapidly, and this is represented by the concept of big data. Discussions on quality and reliability are also underway for big data whose data scale has rapidly increased in a short period of time. On the other hand, small data is minimal data of excellent quality and means data necessary for a specific problem situation. In the field of culture and arts, data of various types and topics exist, and research using big data technology is being conducted. However, research on whether basic information about culture and arts institutions is accurately provided and utilized is insufficient. The basic information of an institution can be an essential basis used in most big data analysis and becomes a starting point for identifying an institution. This study collected data dealing with the basic information of culture and arts institutions to define common metadata and constructed small data in the form of a knowledge graph linking institutions around common metadata. This can be a way to explore the types and characteristics of culture and arts institutions in an integrated way.

Processing Method of Mass Small File Using Hadoop Platform (하둡 플랫폼을 이용한 대량의 스몰파일 처리방법)

  • Kim, Chang-Bok;Chung, Jae-Pil
    • Journal of Advanced Navigation Technology
    • /
    • v.18 no.4
    • /
    • pp.401-408
    • /
    • 2014
  • Hadoop is composed with MapReduce programming model for distributed processing and HDFS distributed file system. Hadoop is suitable framework for big data processing, but processing of mass small files have many problems. The processing of mass small file in hadoop have problems to created one mapper per one file, and it have problems to needed many memory for store of meta information of file. This paper have comparison evaluation processing method of mass small file with various method in hadoop platform. The processing of general compression format is inadequate because of processing by one mapper regardless of data size. The processing of sequence and hadoop archive file is removed memory problem of namenode by compress and combine of small file. Hadoop archive file is faster then sequence file about combine time of small file. The processing using CombineFileInputFormat class is needed not combine of small file, and it have similar speed big data processing method.

A Study on Elementary Education Examples for Data Science using Entry (엔트리를 활용한 초등 데이터 과학 교육 사례 연구)

  • Hur, Kyeong
    • Journal of The Korean Association of Information Education
    • /
    • v.24 no.5
    • /
    • pp.473-481
    • /
    • 2020
  • Data science starts with small data analysis and includes machine learning and deep learning for big data analysis. Data science is a core area of artificial intelligence technology and should be systematically reflected in the school curriculum. For data science education, The Entry also provides a data analysis tool for elementary education. In a big data analysis, data samples are extracted and analysis results are interpreted through statistical guesses and judgments. In this paper, the big data analysis area that requires statistical knowledge is excluded from the elementary area, and data science education examples focusing on the elementary area are proposed. To this end, the general data science education stage was explained first, and the elementary data science education stage was newly proposed. After that, an example of comparing values of data variables and an example of analyzing correlations between data variables were proposed with public small data provided by Entry, according to the elementary data science education stage. By using these Entry data-analysis examples proposed in this paper, it is possible to provide data science convergence education in elementary school, with given data generated from various subjects. In addition, data science educational materials combined with text, audio and video recognition AI tools can be developed by using the Entry.

Load Balancing Scheme for Heterogeneous Cellular Networks Using e-ICIC (eICIC 가 적용된 이종 셀룰러 망을 위한 부하 분산 기법)

  • Hong, Myung-Hoon;Park, Seung-Young
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39A no.5
    • /
    • pp.280-292
    • /
    • 2014
  • Recently, heterogeneous networks consisting of small-cells on top of traditional macro-cellular network has attracted much attention, because traditional macro-cellular network is not suitable to support more demanding mobile data traffic due to its limitation of spatial reuse. However, due to the transmit power difference between macro- and small-cells, most users are associated with macro-cells rather than small-cells. To solve this problem, enhanced inter-cell interference coordination (eICIC) has been introduced. Particularly, in eICIC, the small-cell coverage is forcibly expanded to associate more users with small-cells. Then, to avoid cross-tier interference from macro-cells, these users are allowed to receive the data during almost blank subframe (ABS) in which macro-cells almost remain silent. However, this approach is not sufficient to balance the load between macro- and small-cells because it only expands the small-cell coverage. In this paper, we propose a load balance scheme improving proportional fairness for heterogeneous networks employing eICIC. In particular, the proposed scheme combines the greedy-based user association and the ABS rate determination in a recursive manner to perform the load balance.

사물인터넷 기반의 비즈니스 어프로치

  • Kim, Hakyong
    • Review of KIISC
    • /
    • v.25 no.2
    • /
    • pp.5-11
    • /
    • 2015
  • 최근 수 년 사이에 사물인터넷(Internet of Things)에 대한 학계 및 산업계의 관심이 높다. 그러나, 아직까지 사물인터넷 기반의 제품이나 서비스 혹은 관련 기술에 대한 깊이 있는 연구 결과를 찾는 것은 쉬운 일이 아니다. 사물인터넷이 새로운 기술을 의미하기 보다는 기존에 연구 개발된 기술들을 바탕으로 다양한 디바이스들을 유기적으로 연결함으로써 새로운 가치나 서비스를 만들어내는 개념이기 때문이다. 문제는 이처럼 새로운 가치나 서비스를 만드는 과정이 산업계 종사자들에게조차 친숙한 것이 아니며, 그러다 보니 그러한 과정에서 발생하는 기술적이고 학술적인 부분들에 대해서도 체계적으로 연구되지 못하고 있다는 것이다. 따라서, 본 논문에서는 사물인터넷 기반의 비즈니스 어프로치를 6 가지 측면에서 제안함으로써 사물인터넷 시장의 활성화와 그에 따른 관련 기술 및 학술적 연구의 단초를 제공하고자 한다. 6 가지 비즈니스 어프로치는 다음과 같다. (1) 애프터마켓형 제품을 출시하라. (2) 네트워크 효과를 이용하라. (3) 새로운 기능을 제공하기 보다는 구체적인 혜택을 제공하라. (4) 디바이스를 서비스와 연계하라. (5) 스몰데이터를 이용하라. 그리고 (6) 데이터를 합성하라. 마지막으로, 사물인터넷 개념을 바탕으로 서비스를 생성하는 과정에서 고려해야 할 사항들에 대해 소개하며 마무리하고자 한다.

Using Big Data and Small Data to Understand Linear Parks - Focused on the 606 Trail, USA and Gyeongchun Line Forest, Korea - (빅데이터와 스몰데이터로 본 선형공원 - 시카고 606 트레일과 서울 경춘선 숲길을 중심으로 -)

  • Sim, Ji-Soo;Oh, Chang Song
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.5
    • /
    • pp.28-41
    • /
    • 2020
  • This study selects two linear parks representing each culture and reveals the differences between them using a visitor survey as small data and social media analytics as big data based on the three components of the model of landscape perception. The 606 in Chicago, U.S., and the Gyeongchun Line in Seoul, Korea, are representative parks built on railroads. A total of 505 surveys were collected from these parks. The responses were analyzed using descriptive statistics, principal component analysis, and linear regression. Also, more than 20,000 tweets which mentioned two linear parks respectively were collected. By using those tweets, the authors conducted the clustering analysis and draw the bigram network diagram for identifying and comparing the placeness of each park. The result suggests that more diverse design concept links to less diversity in behavior; that half of the park users use the park as a shortcut; and that same physical exercise provides different benefits depending on the park. Social media analysis showed the 606 is more closely related to the neighborhoods rather than the Gyeongchun Line Forest. The Gyeongchun Line Forest was a more event-related place than the 606.

이동통신에서의 주파수 공동 사용

  • Yu, Heung-Ryeol
    • Information and Communications Magazine
    • /
    • v.31 no.11
    • /
    • pp.79-86
    • /
    • 2014
  • 지속되는 무선 데이터 트래픽 증가 추세의 대처 방안 중 새로운 주파수 발굴에는 많은 시간과 비용이 수반되어, 이를 해결하기 위한 방안으로 유럽 및 미국에서 주파수 공동 사용의 도입이 추진되고 있다. 주파수 공동 사용은 둘 이상의 주파수 이용자가 특정 주파수를 공동 사용 조건에 따라 사용하는 것을 의미하며, 지역, 시간에 따라 제한적으로 사용하고 있는 공공용 주파수 대역을 활용할 계획이다. 현재 유럽은 2.3 GHz 대역, 미국은 3.5 GHz 대역을 스몰 셀로 주파수 공동 활용하기 위해 제도 정비 및 표준화를 추진 중이다. 이러한 국제적인 활용 동향은 향후 해당 대역의 국내 활용 방안 정립에 참고가 될 수 있을 것으로 보인다.

Caching Strategy Adopting Delayed Offloading Scheme with User Mobility in Cellular Network (셀룰러 네트워크에서 딜레이드 오프로딩 스키마를 적용한 사용자 이동성 고려 캐싱 기법)

  • Choi, Yoonjeong;Lim, Yujin
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.83-86
    • /
    • 2021
  • 비디오 컨텐츠 사용이 증가하면서, 사용자가 요구한 파일을 제시간 안에 전달하는 문제가 중요해졌다. 사용자와 가까운 곳에 파일을 캐싱해 두고 필요할 때 다운받으면 파일을 보다 빨리 전달할 수 있는데 사용자가 움직일 경우 이동성을 고려해야 한다. 본 논문에서는 사용자의 이동 경로와 파일의 인기도를 함께 고려해 딜레이드 오프로딩(delayed offloading) 스키마를 적용한 환경에서 마이크로 기지국(micro base station, MBS)에서 다운받는 데이터 크기를 최소로 만들어 비용을 최소화 하는 캐싱 기법을 제안한다. 실험을 통해 타알고리즘에 비해 MBS 로부터 다운받는 양을 줄이고 스몰 셀 기지국(small cell base station, SBS)에서 다운받을 성공 확률을 높이는데 효과가 있다는 것을 보였다.

A Location-Aided Cooperative Transmission Method in Mobile Ad-hoc Wireless Sensor Networks (모바일 Ad-hoc 무선 센서 네트워크에서 위치도움 협력 전송 방법)

  • Son, Dong-Hwan;Lee, Joo-Sang;An, Beongku;Kong, Hyung-Yun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.8 no.2
    • /
    • pp.23-28
    • /
    • 2008
  • In this paper, we propose location-aided cooperative routing protocol (LACARP) for supporting power saving and stable route lifetime in mobile ad-hoc wireless sensor networks. The main ideas and features of the proposed routing protocol are as follows. First, the definition of the area of route search using location-based information to support power saving transmission. Second, the expect zone-based establishment of routing route within the area of route search. Third, the cooperative-aided transmission method. In the operation of data transmission over the established rout the datas are transmitted via both the established route and cooperative route aided by neighbor nodes. The performance evaluation using OPNET(Optimized Network Engineering Tool) shows the LACARP can improve the packet delivery ratio and power saving transmission efficiently.

  • PDF

Index management technique using Small block in storage device based on NAND flash memory

  • Lee, Seung-Woo;Oh, Se-Jin
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.10
    • /
    • pp.1-14
    • /
    • 2020
  • In this paper, we propose to solve the problem of increasing system memory usage due to an increase in the number of mapping information management when using a NAND flash memory-based storage device in an existing sector-based file system. The proposed technique is to store only mapping information in page units based on index blocks and manage them in block units. To this end, the proposed technique uses a sequential offset for storing and managing a plurality of mapping information in one page in a small block, and a reverse offset for a spare page corresponding to a change in mapping information in the block. Through this, the proposed technique has the advantage that the number of block-unit deletions is less than that of the existing technique, and the system memory usage required for mapping information management is low. Reduced by about 32%.