• Title/Summary/Keyword: web pages

Search Result 553, Processing Time 0.023 seconds

A WWW Images Automatic Annotation Based On Multi-cues Integration (멀티-큐 통합을 기반으로 WWW 영상의 자동 주석)

  • Shin, Seong-Yoon;Moon, Hyung-Yoon;Rhee, Yang-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.4
    • /
    • pp.79-86
    • /
    • 2008
  • As the rapid development of the Internet, the embedded images in HTML web pages nowadays become predominant. For its amazing function in describing the content and attracting attention, images become substantially important in web pages. All these images consist a considerable database. What's more, the semantic meanings of images are well presented by the surrounding text and links. But only a small minority of these images have precise assigned keyphrases. and manually assigning keyphrases to existing images is very laborious. Therefore it is highly desirable to automate the keyphrases extraction process. In this paper, we first introduce WWW image annotation methods, based on low level features, page tags, overall word frequency and local word frequency. Then we put forward our method of multi-cues integration image annotation. Also, show multi-cue image annotation method is more superior than other method through an experiment.

  • PDF

Mining Search Keywords for Improving the Accuracy of Entity Search (엔터티 검색의 정확성을 높이기 위한 검색 키워드 마이닝)

  • Lee, Sun Ku;On, Byung-Won;Jung, Soo-Mok
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.9
    • /
    • pp.451-464
    • /
    • 2016
  • Nowadays, entity search such as Google Product Search and Yahoo Pipes has been in the spotlight. The entity search engines have been used to retrieve web pages relevant with a particular entity. However, if an entity (e.g., Chinatown movie) has various meanings (e.g., Chinatown movies, Chinatown restaurants, and Incheon Chinatown), then the accuracy of the search result will be decreased significantly. To address this problem, in this article, we propose a novel method that quantifies the importance of search queries and then offers the best query for the entity search, based on Frequent Pattern (FP)-Tree, considering the correlation between the entity relevance and the frequency of web pages. According to the experimental results presented in this paper, the proposed method (59% in the average precision) improved the accuracy five times, compared to the traditional query terms (less than 10% in the average precision).

Development of Network-Based Online GPS Baseline Processing System (네트워크 기반 온라인 GPS 기선해석 시스템 개발)

  • Kim, Su-Kyung;Bae, Tae-Suk
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.14 no.2
    • /
    • pp.138-146
    • /
    • 2011
  • With the increased use of GPS in the field of various applications including surveying, the request for fast and precise positional information has increased. Several countries such as USA, Canada, and Australia have already been operating Internet-based automatic GPS data analysis system using e-mail and FTP. Expanding GPS market, it is necessary to establish automatic GPS baseline processing system that is accessible via Internet. The system developed in this study is operating on the web, and it allows the users to access easily regardless of time and place. The main processing engines are Bernese V5.0 and PAGES. They process user data with three GPS CORS(Continuously Operating Reference Station), and then send the report to the users through e-mail. This system allows users to process high accurate GPS data easily. It is expected that this system will be used for various GPS applications such as monitoring large-scale structures and providing spatial information services in private sector.

A Study of E-mail and Personal Homepage as a Marketing Promotion Tool in the Hotel Industry (호텔에서 마케팅 도구로써 이메일과 개인 홈페이지의 활용방안에 관한 연구)

  • Chung Hyun-Young
    • The Journal of the Korea Contents Association
    • /
    • v.4 no.4
    • /
    • pp.11-19
    • /
    • 2004
  • With the help of information technology the number of email users and personal home page owners are increasing. Marketers have much interest in using the email and personal home pages as a marketing promotion tool which can provide potential customers with messages they want to send. Marketers can facilitate the promotion efforts once if the profiles of potential customers' information can be databased by sending proper messages to the targeted market. Because of the merit of email and personal home page hotel firms are expected to adopt the information applications in their promotions for customers. This study proposes the Possibilities of email and personal home pages as a marketing promotion tool in the hotel industry and discusses problems to be overcome.

  • PDF

A Study on the Design of Hypertext-Based Linear Displays for an Online Thesaurus (하이퍼텍스트를 이용한 온라인 시소러스의 선형배열 설계에 관한 연구)

  • Choi Jae-Hwang
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.33 no.3
    • /
    • pp.109-126
    • /
    • 1999
  • The purpose of this study is to design hypertext-based linear displays for an online thesaurus in librarianship and information science with the aid of ISO and ANSI/NISO thesaurus standards. This study starts with the assumptions that hypertext-based online thesauri would provide a convenient and useful subject retrieval tool to both indexers and searchers of information and become starting point for the study of thesauri searching patterns, which were difficult with printed thesauri. For this study, thesaurus of librarianship and information science was stored in MS ACCESS 97 as a relational database and, for the conjunction of a relational database with World Wide Web, technics of ASP(Active Server Pages) were applied under Windows NT operation.

  • PDF

Implementation of a Large-scale Web Query Processing System Using the Multi-level Cache Scheme (계층적 캐시 기법을 이용한 대용량 웹 검색 질의 처리 시스템의 구현)

  • Lim, Sung-Chae
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.14 no.7
    • /
    • pp.669-679
    • /
    • 2008
  • With the increasing demands of information sharing and searches via the web, the web search engine has drawn much attention. Although many researches have been done to solve technical challenges to build the web search engine, the issue regarding its query processing system is rarely dealt with. Since the software architecture and operational schemes of the query processing system are hard to elaborate, we here present related techniques implemented on a commercial system. The implemented system is a very large-scale system that can process 5-million user queries per day by using index files built on about 65-million web pages. We implement a multi-level cache scheme to save already returned query results for performance considerations, and the multi-level cache is managed in 4-level cache storage areas. Using the multi-level cache, we can improve the system throughput by a factor of 4, thereby reducing around 70% of the server cost.

An Adaptive Web Surfing System for Supporting Autonomous Navigation (자동항해를 지원하는 적응형 웹 서핑 시스템)

  • 국형준
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.4
    • /
    • pp.439-446
    • /
    • 2004
  • To design a user-adaptive web surfing system, we nay take the approach to divide the whole process into three phases; collecting user data, processing the data to construct and improve the user profile, and adapting to the user by applying the user profile. We have designed three software agents. Each privately works in each phase and they collaboratively support adaptive web surfing. They are IIA(Interactive Interface Agent), UPA(User Profile Agent), and ANA(Autonomous Navigation Agent). IIA provides the user interface, which collects data and performs mechanical navigation support. UPA processes the collected user data to build and update the user profile while user is web-surfing. ANA provides an autonomous navigation mode in which it automatically recommends web pages that are selected based on the user profile. The proposed approach and design method, through extensions and refinements, may be used to build a practical adaptive web surfing system.

Phishing Detection Methodology Using Web Sites Heuristic (웹사이트 특징을 이용한 휴리스틱 피싱 탐지 방안 연구)

  • Lee, Jin Lee;Park, Doo Ho;Lee, Chang Hoon
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.4 no.10
    • /
    • pp.349-360
    • /
    • 2015
  • In recent year, phishing attacks are flooding with services based on the web technology. Phishing is affecting online security significantly day by day with the vulnerability of web pages. To prevent phishing attacks, a lot of anti-phishing techniques has been made with their own advantages and dis-advantages respectively, but the phishing attack has not been eradicated completely yet. In this paper, we have studied phishing in detail and categorize a process of phishing attack in two parts - Landing-phase, Attack-phase. In addition, we propose an phishing detection methodology based on web sites heuristic. To extract web sites features, we focus on URL and source codes of web sites. To evaluate performance of the suggested method, set up an experiment and analyze its results. Our methodology indicates the detection accuracy of 98.9% with random forest algorithm. The evaluation of proof-of-concept reveals that web site features can be used for phishing detection.

HTML Text Extraction Using Tag Path and Text Appearance Frequency (태그 경로 및 텍스트 출현 빈도를 이용한 HTML 본문 추출)

  • Kim, Jin-Hwan;Kim, Eun-Gyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1709-1715
    • /
    • 2021
  • In order to accurately extract the necessary text from the web page, the method of specifying the tag and style attributes where the main contents exist to the web crawler has a problem in that the logic for extracting the main contents. This method needs to be modified whenever the web page configuration is changed. In order to solve this problem, the method of extracting the text by analyzing the frequency of appearance of the text proposed in the previous study had a limitation in that the performance deviation was large depending on the collection channel of the web page. Therefore, in this paper, we proposed a method of extracting texts with high accuracy from various collection channels by analyzing not only the frequency of appearance of text but also parent tag paths of text nodes extracted from the DOM tree of web pages.

The Effects of Self-regulated Learning Strategies Using WEB on students′ Academic Achievements and Learning Attitudes in the Middle school Mathematics. -Focused on the Chapter ″Function″ of the First Grade- (중학교 수학에서 WEB을 이용한 자기주도적 학습이 학생들의 학업성취도 및 학습태도에 미치는 영향 - 1학년 함수 단원을 중심으로 -)

  • 이덕호;이관희
    • Journal of the Korean School Mathematics Society
    • /
    • v.4 no.2
    • /
    • pp.75-84
    • /
    • 2001
  • The purpose of this research is to promote the academic achievement motivation and improve problem solving ability in Mathematics. In addition I hope to explore a new teaching method and facilitate students interest in mathmatics. If the teachers utilize an Internet Web Page and exchang information, the interaction activities will allow them to collect and analyse a variety of data. As this teaching method assists students motivation to get the effects of self-regulated learning strategies of students using the internet and their academic achievements and learning attitudes can be explored. The information will be gathered after the students participate in classes which were taught through the Edunet Homepage and the Department of Mathematics Homepage of KongJu National University. The Internet pages focused on the "Function" chapter of the first grade text for students attending middle school. The students were divided into two groups, experimental and comparative. Each group is composed of three levels, high, middle, and low. In the post experimental phase, two tests were administered which measured achievement ability and the learning attitude of the students. The results of the tests were then compared and analyzed. The results were as follows: First, the study demonstrated that self-regulated Learning Starategies towards Academic Achievements and Learning Attitudes were more effective than traditional teaching methods. These methods were significantly effective in the middle level and low level groups. The study demonstrated little to no improvement in the high level groups

  • PDF