• Title/Summary/Keyword: Network Selection System

Search Result 595, Processing Time 0.02 seconds

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

  • Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.21-44
    • /
    • 2018
  • In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.

Study on the Selecting of Suitable Sites for Integrated Riparian Eco-belts Connecting Dam Floodplains and Riparian Zone - Case Study of Daecheong Reservoir in Geum-river Basin - (댐 홍수터와 수변구역을 연계한 통합형 수변생태벨트 적지 선정방안 연구 - 금강 수계 대청호 사례 연구 -)

  • Bahn, Gwonsoo;Cho, Myeonghyeon;Kang, Jeonkyeong;Kim, Leehyung
    • Journal of Wetlands Research
    • /
    • v.23 no.4
    • /
    • pp.327-341
    • /
    • 2021
  • The riparian eco-belt is an efficient technique that can reduce non-point pollution sources in the basin and improve ecological connectivity and health. In Korea, a legal system for the construction and management of riparian eco-belts is in operation. However, it is currently excluded that rivers and floodplains in dam reservoir that are advantageous for buffer functions such as control of non-point pollutants and ecological habitats. Accordingly, this study presented and analyzed a plan to select a site for an integrated riparian ecol-belt that comprehensively evaluates the water quality and ecosystem characteristics of each dam floodplain and riparian zone for the Daecheong Dam basin in Geum River watershed. First, the Daecheong Dam basin was divided into 138 sub-basin with GIS, and the riparian zone adjacent to the dam floodplain was analyzed. Sixteen evaluation factors related to the ecosystem and water quality impact that affect the selection of integrated riparian eco-belt were decided, and weights for the importance of each factor were set through AHP analysis. The priority of site suitability was derived by conducting an integrated evaluation by applying weights to sub-basin by floodplains and riparian zone factors. In order to determine whether the sites derived through GIS site analysis are sutiable for actual implementation, five sites were inspected according to three factors: land use, pollution sources, and ecological connectivity. As a result, it was confirmed that all sites were appropriate to apply integrated riparian ecol-belt. It is judged that the riparian eco-belt site analysis technique proposed through this study can be applied as a useful tool when establishing an integrated riparian zone management policy in the future. However, it might be necessary to experiment various evaluation factors and weights for each item according to the characteristics and issues of each dam. Additional research need to be conducted on elaborated conservation and restoration strategies considering the Green-Blue Network aspect, evaluation of ecosystem services, and interconnection between related laws and policy and its improvements.

A Study on the Selection of Base Port and Establishment of International Cooperation System for Seafarer Rotation In case of Emergency - Focusing on the Service Network of HMM - (비상 시 선원교대를 위한 거점항만 선정과 국제협력 방안 - HMM 정기선을 중심으로 -)

  • Kim, Bo-ram;Lee, Hye-jin
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.27 no.2
    • /
    • pp.275-285
    • /
    • 2021
  • COVID-19 is threatening the safety of ships and seafarers by delaying seafarer rotation. Shipping companies and governments have a blindspot in case of the onboard environment of seafarers. An effective, alternative plan should be devised to eliminate the possibility of human accidents in an emergency that threatens the safety of seafarers. According to the survey of former and current seafarers, the most important factor in boarding life was safety, and the most necessary thing during emergencies was to secure smooth seafarer rotation rather than improve wages and welfare. By analyzing the major routes of national shipping companies by continent, ports with a large number of calls and a high Air Connectivity Index were selected as the base port. In addition, the route was designed for effective, domestic seafarer rotation during international shipping. Other countries must be consulted to establish a travel route linking ships, ports, and airports for the safe return of sailors to their home countries during an emergency. In addition, it is necessary to work together for the seafarers who are in trouble of seafarer rotation through cooperation with the International Maritime Organization(IMO). Starting with this, the government should have a monitoring system for the return and non-return routes as well as the number of seafarers on board. If such a system is established, it will be able to determine the response direction of our country's policy in case of an emergency. Along with the shipping company's ef orts to improve the treatment of seafarers, national and social attention will be needed to review domestic laws and improve awareness about seafarers.

An Analysis of Accessibility to Hydrogen Charging Stations in Seoul Based on Location-Allocation Models (입지배분모형 기반의 서울시 수소충전소 접근성 분석)

  • Sang-Gyoon Kim;Jong-Seok Won;Yong-Beom Pyeon;Min-Kyung Cho
    • Journal of the Society of Disaster Information
    • /
    • v.20 no.2
    • /
    • pp.339-350
    • /
    • 2024
  • Purpose: This study analyzes accessibility of 10 hydrogen charging stations in Seoul and identifies areas that were difficult to access. The purpose is to re-analyze accessibility by adding a new location in terms of equity and safety of location placement, and then draw implications by comparing the improvement effects. Method: By applying the location-allocation model and the service area model based on network analysis of the ArcGIS program, areas with weak access were identified. The location selection method applied the 'Minimize Facilities' method in consideration of the need for rapid arrival to insufficient hydrogen charging stations. The limit distance for arrival within a specific time was analyzed by applying the average vehicle traffic speed(23.1km/h, Seoul Open Data Square) in 2022 to three categories: 3,850m(10minutes), 5,775m(15minutes), 7,700m(20minutes). In order to minimize conflicts over the installation of hydrogen charging stations, special standards of the Ministry of Trade, Industry and Energy applied to derive candidate sites for additional installation of hydrogen charging stations among existing gas stations and LPG/CNG charging stations. Result: As a result of the analysis, it was confirmed that accessibility was significantly improved by installing 5 new hydrogen charging stations at relatively safe gas stations and LPG/CNG charging stations in areas where access to the existing 10 hydrogen charging stations is weak within 20 minutes. Nevertheless, it was found that there are still areas where access remains difficult. Conclusion: The location allocation model is used to identify areas where access to hydrogen charging stations is difficult and prioritize installation, decision-making to select locations for hydrogen charging stations based on scientific evidence can be supported.

Image Watermarking for Copyright Protection of Images on Shopping Mall (쇼핑몰 이미지 저작권보호를 위한 영상 워터마킹)

  • Bae, Kyoung-Yul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.147-157
    • /
    • 2013
  • With the advent of the digital environment that can be accessed anytime, anywhere with the introduction of high-speed network, the free distribution and use of digital content were made possible. Ironically this environment is raising a variety of copyright infringement, and product images used in the online shopping mall are pirated frequently. There are many controversial issues whether shopping mall images are creative works or not. According to Supreme Court's decision in 2001, to ad pictures taken with ham products is simply a clone of the appearance of objects to deliver nothing but the decision was not only creative expression. But for the photographer's losses recognized in the advertising photo shoot takes the typical cost was estimated damages. According to Seoul District Court precedents in 2003, if there are the photographer's personality and creativity in the selection of the subject, the composition of the set, the direction and amount of light control, set the angle of the camera, shutter speed, shutter chance, other shooting methods for capturing, developing and printing process, the works should be protected by copyright law by the Court's sentence. In order to receive copyright protection of the shopping mall images by the law, it is simply not to convey the status of the product, the photographer's personality and creativity can be recognized that it requires effort. Accordingly, the cost of making the mall image increases, and the necessity for copyright protection becomes higher. The product images of the online shopping mall have a very unique configuration unlike the general pictures such as portraits and landscape photos and, therefore, the general image watermarking technique can not satisfy the requirements of the image watermarking. Because background of product images commonly used in shopping malls is white or black, or gray scale (gradient) color, it is difficult to utilize the space to embed a watermark and the area is very sensitive even a slight change. In this paper, the characteristics of images used in shopping malls are analyzed and a watermarking technology which is suitable to the shopping mall images is proposed. The proposed image watermarking technology divide a product image into smaller blocks, and the corresponding blocks are transformed by DCT (Discrete Cosine Transform), and then the watermark information was inserted into images using quantization of DCT coefficients. Because uniform treatment of the DCT coefficients for quantization cause visual blocking artifacts, the proposed algorithm used weighted mask which quantizes finely the coefficients located block boundaries and coarsely the coefficients located center area of the block. This mask improves subjective visual quality as well as the objective quality of the images. In addition, in order to improve the safety of the algorithm, the blocks which is embedded the watermark are randomly selected and the turbo code is used to reduce the BER when extracting the watermark. The PSNR(Peak Signal to Noise Ratio) of the shopping mall image watermarked by the proposed algorithm is 40.7~48.5[dB] and BER(Bit Error Rate) after JPEG with QF = 70 is 0. This means the watermarked image is high quality and the algorithm is robust to JPEG compression that is used generally at the online shopping malls. Also, for 40% change in size and 40 degrees of rotation, the BER is 0. In general, the shopping malls are used compressed images with QF which is higher than 90. Because the pirated image is used to replicate from original image, the proposed algorithm can identify the copyright infringement in the most cases. As shown the experimental results, the proposed algorithm is suitable to the shopping mall images with simple background. However, the future study should be carried out to enhance the robustness of the proposed algorithm because the robustness loss is occurred after mask process.