• Title/Summary/Keyword: 불법 사이트

Search Result 43, Processing Time 0.02 seconds

An Automated Technique for Illegal Site Detection using the Sequence of HTML Tags (HTML 태그 순서를 이용한 불법 사이트 탐지 자동화 기술)

  • Lee, Kiryong;Lee, Heejo
    • Journal of KIISE
    • /
    • v.43 no.10
    • /
    • pp.1173-1178
    • /
    • 2016
  • Since the introduction of BitTorrent protocol in 2001, everything can be downloaded through file sharing, including music, movies and software. As a result, the copyright holder suffers from illegal sharing of copyright content. In order to solve this problem, countries have enacted illegal share related law; and internet service providers block pirate sites. However, illegal sites such as pirate bay easily reopen the site by changing the domain name. Thus, we propose a technique to easily detect pirate sites that are reopened. This automated technique collects the domain names using the google search engine, and measures similarity using Longest Common Subsequence (LCS) algorithm by comparing the tag structure of the source web page and reopened web page. For evaluation, we colledted 2,383 domains from google search. Experimental results indicated detection of a total of 44 pirate sites for collected domains when applying LCS algorithm. In addition, this technique detected 23 pirate sites for 805 domains when applied to foreign pirate sites. This experiment facilitated easy detection of the reopened pirate sites using an automated detection system.

Effecient Techniques to Block Copyright Infringement Illegal Streaming Sites (저작권 침해 불법 스트리밍 사이트 차단을 위한 효율적인 기법)

  • Kim, Chan-hee;Yu, Ho-jei;Kim, Seo-yeon;Oh, Soo-hyun
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.5
    • /
    • pp.837-844
    • /
    • 2022
  • In proportion to the rapid development of information and communication technology, the damage to copyright infringement is also increasing. In particular, as the OTT platform market has grown significantly in recent years, the speed and distribution of pirated copies that infringe copyright are increasing rapidly compared to the past. Accordingly, the country is trying to prevent copyright infringement by detecting and blocking illegal streaming sites, but it is difficult to expect great results due to the fast production of illegal streaming sites. Therefore, in this paper, we analyze the causes of rapid production of blocked illegal streaming sites, track and analyze 58 illegal streaming sites, and propose ways to effectively block illegal streaming sites based on the analysis results.

Development of an Intelligent Illegal Gambling Site Detection Model Based on Tag2Vec (Tag2vec 기반의 지능형 불법 도박 사이트 탐지 모형 개발)

  • Song, ChanWoo;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.211-227
    • /
    • 2022
  • Illegal gambling through online gambling sites has become a significant social problem. The development of Internet technology and the spread of smartphones have led to the proliferation of illegal gambling sites, so now illegal online gambling has become accessible to anyone. In order to mitigate its negative effect, the Korean government is trying to detect illegal gambling sites by using self-monitoring agents or reporting systems such as 'Nuricops.' However, it is difficult to detect all illegal sites due to limitations such as a lack of staffing. Accordingly, several scholars have proposed intelligent illegal gambling site detection techniques. Xu et al. (2019) found that fake or illegal websites generally have unique features in the HTML tag structure. It implies that the HTML tag structure can be important for detecting illegal sites. However, prior studies to improve the model's performance by utilizing the HTML tag structure in the illegal site detection model are rare. Against this background, our study aimed to improve the model's performance by utilizing the HTML tag structure and proposes Tag2Vec, a modified version of Doc2Vec, as a methodology to vectorize the HTML tag structure properly. To validate the proposed model, we perform the empirical analysis using a data set consisting of the list of harmful sites from 'The Cheat' and normal sites through Google search. As a result, it was confirmed that the Tag2Vec-based detection model proposed in this study showed better classification accuracy, recall, and F1_Score than the URL-based detection model-a comparative model. The proposed model of this study is expected to be effectively utilized to improve the health of our society through intelligent technology.

A Study on the Classification Model of Overseas Infringing Websites based on Web Hierarchy Similarity Analysis using GNN (GNN을 이용한 웹사이트 Hierarchy 유사도 분석 기반 해외 침해 사이트 분류 모델 연구)

  • Ju-hyeon Seo;Sun-mo Yoo;Jong-hwa Park;Jin-joo Park;Tae-jin Lee
    • Convergence Security Journal
    • /
    • v.23 no.2
    • /
    • pp.47-54
    • /
    • 2023
  • The global popularity of K-content(Korean Wave) has led to a continuous increase in copyright infringement cases involving domestic works, not only within the country but also overseas. In response to this trend, there is active research on technologies for detecting illegal distribution sites of domestic copyrighted materials, with recent studies utilizing the characteristics of domestic illegal distribution sites that often include a significant number of advertising banners. However, the application of detection techniques similar to those used domestically is limited for overseas illegal distribution sites. These sites may not include advertising banners or may have significantly fewer ads compared to domestic sites, making the application of detection technologies used domestically challenging. In this study, we propose a detection technique based on the similarity comparison of links and text trees, leveraging the characteristic of including illegal sharing posts and images of copyrighted materials in a similar hierarchical structure. Additionally, to accurately compare the similarity of large-scale trees composed of a massive number of links, we utilize Graph Neural Network (GNN). The experiments conducted in this study demonstrated a high accuracy rate of over 95% in classifying regular sites and sites involved in the illegal distribution of copyrighted materials. Applying this algorithm to automate the detection of illegal distribution sites is expected to enable swift responses to copyright infringements.

불법유해정보 법.제도 동향 분석

  • Yun, Yeo-Saeng;Yu, Jin-Ho
    • Review of KIISC
    • /
    • v.22 no.3
    • /
    • pp.25-36
    • /
    • 2012
  • 기존 불법유해정보 분류체계를 비교 분석하여 재정립하고, 국내외 불법유해정보 법 제도 현황을 살펴 보았다. 이와 함께 불법유해정보 접근 차단 방안에 대한 이용자 설문 결과를 기초로 불법유해정보 차단에 대한 정책적 제언을 다음과 같이 제시하고자 한다. 먼저 기존 유해정보차단 프로그램의 문제점인 메모리 사용량 증가에 따른 컴퓨터 성능 저하현상을 개선할 수 있는 기술적인 대책이 마련되어야 하며, 청소년 이용자의 보호자 또는 학부모가 사이트별로 제한할 수 있는 기능을 추가하여 다중 필터링 시스템 환경을 조성해야 한다. 또한 기존의 불법유해정보 신고 프로그램은 신고주소, 신고제목, 증거자료 입력 등 복잡한 구성으로 인해 효율성이 떨어지므로, 신고를 원하는 사이트를 이미지화 하여 바로 저장 및 전송이 가능한 형태로 신고 프로그램을 제작하여 신고완료까지의 시간을 단축해야 할 것이다. 기존의 주민등록번호 입력 방식에서 개인식별번호를 이용한 i-PIN 도입을 의무화하고, 기존 i-PIN 사용자의 전환사용을 통해 불편함을 최소화하여 개인정보유출 방지를 위한 i-PIN 사용을 의무화해야 한다. 마지막으로 '자율 등급 서비스' 이외에도 제3의 기관을 통한 '제3자 등급 서비스'를 동시에 사용하여 정보제공자의 부정확한 등급 표시의 문제점을 보완하도록 해야 한다.

User Study for Legalization of Pirate Comics Market (만화 온라인 불법복제물 시장의 양성화를 위한 이용자 연구)

  • Hwang, Sun Tae;Jin Jeon, Eun-Young
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.5
    • /
    • pp.550-559
    • /
    • 2015
  • Despite of the long history of comic industry in Korea, self-motivated sound ecosystem is not determinded due to the pirated contents. The file-sharing literature has focused mainly on how pirated contents could be eradicated, with less attention to the effective strategy. This research is focusing more on pirate comic market within the framework of users. The target user is people who pay for using file sharing site. The research methods are a online survey and a written interview. The goal of this research is to analyze the correlation between morality and websites uses, the user convenience of file-sharing site and the main factors for attracting user to legal portal site. According to our results, legal portal sites needs to improve purchasing factors such as diversity of contents, improvement of payment system and offer of a detailed information.

Study on Preventing Copyrights Infringement through Blocking Advertisements of Illegal Copyrighted Websites (불법 저작물 사이트의 광고 차단을 통한 저작권 침해 방지 연구 - 자금 추적 기반 방식을 중심으로)

  • Shin, Myeong-Seob;Yong, Mi-Ran;Lee, Yeong-Ju
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.7
    • /
    • pp.331-341
    • /
    • 2020
  • Recently the government has succeeded in shutting down the Illegal Copyrighted Websites by cracking down on the operators of the websites. But this only caused 'the Balloon Effect', similar websites were created and users shifted to the new websites. 'Follow the money' is drawing attention as a way to complement the effect of policies. It tracks the commercialization scheme and fund flows of the Illegal Copyrighted Websites and blocks the supply and publication of advertisements, which are the main source of revenue. This approach aims at self-closure of Illegal websites by blocking the revenue source. In this study, we have selected and analyzed overseas cases that adopted these measures. Many countries had different policies and campaigns, but three things are common: non-punishment measures, partnership based on voluntary participation, pursuing a variety of purposes other than protecting the copyright industry. In Korea, the reason public-private Partnerships was not properly established had been caused by the difference of views between them. Advertisers and agencies need to expand their awareness that illegal advertisements can have adverse effects such as brand image damage and enormous economic losses. Also campaigns and conferences related with the policy should be held to prevent copyright infringement through mutual understanding and cooperation.

Detection Technique of Suspected Piracy Sites based on Image Black List (이미지 블랙리스트 기반 저작권 침해 의심 사이트 탐지 기법)

  • Kim, Eui-Jin;Jung, In-Su;Song, Yu-Rae;Kwak, Jin
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.148-150
    • /
    • 2021
  • 저작권 콘텐츠의 해외 진출과 함께, 국내·외 저작권 시장 규모가 증가하고 있다. 이와 동시에 등장한 저작권 침해사이트는 메인 페이지에 저작권 침해사이트를 대표하는 이미지를 게시하는 특징이 있다. 이러한 저작권 침해사이트는 음악, 영화, 드라마 등의 저작권 콘텐츠를 불법 유통시키며 저작권 시장에 피해를 입히고 있다. 공공기관에서는 저작권 침해를 방지하기 위해 저작권 침해사이트를 차단하는 등의 대응을 하고 있지만, 저작권 침해사이트의 생성 속도에 비해 침해 여부 판단 속도가 상대적으로 느려서 차단에 어려움이 존재한다. 따라서, 본 논문에서는 저작권 침해사이트의 대표 이미지를 활용한 이미지 블랙리스트에 기반하여 저작권 침해 의심 사이트 탐지 기법을 제안하고자 한다.

Study of Policy through Big data Analysis about Gambling News (사행산업 관련 뉴스의 빅데이터 분석을 통한 정책 연구)

  • Moon, HyeJung;Kim, SungKyung
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2016.11a
    • /
    • pp.190-193
    • /
    • 2016
  • 본 연구는 사행산업의 분야인 복권, 체육진흥투표권, 경마, 카지노에 대해 언론에서는 어떻게 다루어지고 있는지를 1990년부터 2015년까지의 뉴스데이터를 빅데이터 분석 방법 중 테스트의 의미연결망 분석을 통해 밝혀보고자 하는 연구이다. 이 논문은 의미망 분석을 통해 기사의 빈도와 연결성을 프레이밍과 시민관심 정도로 재조명 하여 기사에 대한 언론보도자의 의도와 시민의 인식차이를 밝혔고, 이를 통해 정책적 특성과 개혁과제를 탐색하였다. 분석결과 복권의 경우 당첨번호, 당첨금, 조작의혹 등 당첨에 대한 부분이 주제인 '사회문제' 형태였으며, 체육진흥투표권의 경우에는 사업입찰, 불법사이트, 발매대상 등 주로 사업추진과 불법사이트에 대한 '의무정보' 종류였고, 경마의 경우 사업장, 홍보, 기사 등으로 사업홍보나 광고 관련 뉴스이었고, 마지막으로 카지노의 경우에는 불법, 도박장, 외국인 등 '주요정보'에 해당하는 논문이었다. 시대에 따라 1990년대에는 카지노, 2000년대에는 복권, 2010년대에는 경마에 대한 기사보도가 많아졌으며, 이에 대한 시민의 반응도 사업비리, 당첨, 시민운동 등의 차이가 있었다. 마지막으로 기사의 빈도와 연결성이 나타내는 프레이밍 정도와 시민의 관심은 '1. 홍보광고, 2. 의무정보, 3. 사회이슈, 4. 주요정보' 네 가지로 구분되었으며 이 중 사고, 비리 등 주요기사로 구분되는 사회문제가 주요 공공의제로 형성되는 것을 확인할 수 있었다.

  • PDF

A Study on the Current State of Illegal Distribution of Literary Works on Internet Cafes and Homepages (인터넷 카페와 홈페이지의 어문저작물 불법 전송 실태에 관한 연구)

  • Kwack, Dong-Chul
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.4 s.54
    • /
    • pp.209-231
    • /
    • 2004
  • The purpose of this study is to examine the current state of illegal distribution of literary works on internet cafes and homepages, and figure out how to protect the rights of copyright holders. For this study, examined were the cafes and menus of homepages of major Websites on the Internet, where illegal copying and delivery of literary works could happen. For each Website, the volumes of the entire collection, the number of literary works held, the maximum and average number of transactions were investigated, and literary works categorized according to genres were analyzed. The result shows that the strict legislation and regulation by government or copyright organizations could hinder the positive distribution of awareness about the copyright : but, still strongly needed is the promotion and education of the importance of copyright to help the public understand better. Providers of portal services should take a proper step to strengthen the training of and systematic support for copyright-related issues for both operators and users of cafes and homepages on Websites.