Search | Korea Science

An Implementation and Performance Evaluation of Fast Web Crawler with Python

Kim, Cheong Ghil
- Journal of the Semiconductor & Display Technology
- /
- v.18 no.3
- /
- pp.140-143
- /
- 2019
The Internet has been expanded constantly and greatly such that we are having vast number of web pages with dynamic changes. Especially, the fast development of wireless communication technology and the wide spread of various smart devices enable information being created at speed and changed anywhere, anytime. In this situation, web crawling, also known as web scraping, which is an organized, automated computer system for systematically navigating web pages residing on the web and for automatically searching and indexing information, has been inevitably used broadly in many fields today. This paper aims to implement a prototype web crawler with Python and to improve the execution speed using threads on multicore CPU. The results of the implementation confirmed the operation with crawling reference web sites and the performance improvement by evaluating the execution speed on the different thread configurations on multicore CPU.
PDF KSCI

Improving Malicious Web Code Classification with Sequence by Machine Learning

Paik, Incheon
- IEIE Transactions on Smart Processing and Computing
- /
- v.3 no.5
- /
- pp.319-324
- /
- 2014
Web applications make life more convenient. Many web applications have several kinds of user input (e.g. personal information, a user's comment of commercial goods, etc.) for the activities. On the other hand, there are a range of vulnerabilities in the input functions of Web applications. Malicious actions can be attempted using the free accessibility of many web applications. Attacks by the exploitation of these input vulnerabilities can be achieved by injecting malicious web code; it enables one to perform a variety of illegal actions, such as SQL Injection Attacks (SQLIAs) and Cross Site Scripting (XSS). These actions come down to theft, replacing personal information, or phishing. The existing solutions use a parser for the code, are limited to fixed and very small patterns, and are difficult to adapt to variations. A machine learning method can give leverage to cover a far broader range of malicious web code and is easy to adapt to variations and changes. Therefore, this paper suggests the adaptable classification of malicious web code by machine learning approaches for detecting the exploitation user inputs. The approach usually identifies the "looks-like malicious" code for real malicious code. More detailed classification using sequence information is also introduced. The precision for the "looks-like malicious code" is 99% and for the precise classification with sequence is 90%.
https://doi.org/10.5573/IEIESPC.2014.3.5.319 인용 PDF KSCI

Operational Scheme for Large Scale Web Server Cluster Systems (대규모 웹서버 클러스터 시스템의 운영방안 연구)

Park, Jin-Won
- Journal of the Korea Society for Simulation
- /
- v.22 no.3
- /
- pp.71-79
- /
- 2013
Web server cluster systems are widely used, where a large number of PC level servers are interconnected via network. This paper focuses on forecasting an appropriate number of web servers which can serve four different classes of user requests, simple web page viewing, knowledge query, motion picture viewing and motion picture uploading. Two ways of serving different classes of web service requests are considered, commonly used web servers and service dedicated web servers. Computer simulation experiments are performed in order to find a good way of allocating web servers among different classes of web service requests, maintaining certain levels of resource utilization and response time.
https://doi.org/10.9709/JKSS.2013.22.3.071 인용 PDF KSCI

Hybrid Intelligent Web Recommendation Systems Based on Web Data Mining and Case-Based Reasoning

Kim, Jin-Sung
- Journal of the Korean Institute of Intelligent Systems
- /
- v.13 no.3
- /
- pp.366-370
- /
- 2003
In this research, we suggest a hybrid intelligent Web recommendation systems based on Web data mining and case-based reasoning (CBR). One of the important research topics in the field of Internet business is blending artificial intelligence (AI) techniques with knowledge discovering in database (KDD) or data mining (DM). Data mining is used as an efficient mechanism in reasoning for association knowledge between goods and customers＇ preference. In the field of data mining, the features, called attributes, are often selected primary for mining the association knowledge between related products. Therefore, most of researches, in the arena of Web data mining, used association rules extraction mechanism. However, association rules extraction mechanism has a potential limitation in flexibility of reasoning. If there are some goods, which were not retrieved by association rules-based reasoning, we can＇t present more information to customer. To overcome this limitation case, we combined CBR with Web data mining. CBR is one of the AI techniques and used in problems for which it is difficult to solve with logical (association) rules. A Web-log data gathered in real-world Web shopping mall was given to illustrate the quality of the proposed hybrid recommendation mechanism. This Web shopping mall deals with remote-controlled plastic models such as remote-controlled car, yacht, airplane, and helicopter. The experimental results showed that our hybrid recommendation mechanism could reflect both association knowledge and implicit human knowledge extracted from cases in Web databases.
https://doi.org/10.5391/JKIIS.2003.13.3.366 인용 PDF KSCI

Application of Evaluation Criteria for Web sites to Sexuality Education (인터넷상의 성교육 사이트 평가기준의 적용)

Kang, Nam-Mi;Hyun, Tai-Sun;Lee, Pil-Ryang;Kim, Jin
- Women's Health Nursing
- /
- v.7 no.3
- /
- pp.373-381
- /
- 2001
Web sites on the internet are excellent resources for the younger generation to gain information related to sexuality education. The potential benefits of the information of sexuality education on web sites are obvious. But the information of sexuality education on web sites could also result in potentially negative effects. Yet the quality of the information of sexuality education on web sites is variable and difficult to assess. There is no rating criteria for quality assessment of the information on web sites. The rating criteria for quality assessment of information of sexuality education were investigated and reviewed. Among the criteria, best 15 items to evaluate the information of sexuality education on web sites were selected and identified in this study. 15 items were categorized to reliability ( 3 items ), content ( 6 items ), goal ( 2 items ), design & technology ( 4 items ). This 15-items questionnaires is considered as commonly implementable criteria for the information of sexuality education on web sites in Korean. 20 web sites related to sexualtiy education were evaluated and the results were discussed.
PDF

Factors of Consumer' s Digital Content Selection : Focusing on Web-toon (소비자들의 디지털컨텐츠 선택 요인 : 웹툰을 중심으로)

Oh, Yongmin;Jung, Hunsik;Boo, Jeman
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.42 no.3
- /
- pp.217-231
- /
- 2019
The purpose of this study is to analyze the factors influencing consumers' selection of web-toon service through AHP (Analytic Hierarchy Process) analysis and to provide the strategy of web-toon service. To accomplish this study, theories, existing research and references related to AHP were sufficiently examined and selected the factors in the selection criteria. Surveys from consumers who used the web-toon service were conducted with selected factors. Through this, the results were analyzed by AHP analysis to find out the weighting values and the differences were examined and analyzed. The highest weighting factor in the first layer that consists of web-toon service was cinematic quality. The cinematic quality was the most important factor in the selection criteria of customers who use the web-toon service regardless of their preferred genre. Furthermore, it was confirmed that the weighting value or ranking changed in the second layer by genre. In this study, the effective basis of strategy were suggested by ranking the quantitative selection factors according to the preferred genre of consumers using web-toon services. In addition, This research provides some practical implications. That is, the web-toon service provider can easily recognize and respond to the customer's requirements, which factors are important when the customer selects a specific genre from the web-toon genre.
https://doi.org/10.11627/jkise.2019.42.3.217 인용 PDF KSCI

Web-based SpecCharts Specification Environment for HW/SW Codesign (HW/SW 통합설계를 위한 웹 기반의 SpecCharts 기술 환경)

김승권;김종훈
- Journal of Korea Multimedia Society
- /
- v.3 no.6
- /
- pp.661-673
- /
- 2000
In this paper, we propose a Web-based HW/SW Codesign Environment with Distributed Architecture (WebCEDA), then design and implement SpecCharts Specification Environment(ScSE) for specifying systems in WebCEDA. WebCEDA has 3-tier client/server architecture than can remedy disadvantages of existing codesign tools, such as platform dependency, difficulty of extension, absence of collaboraton environment. ScSE includes web interface, SpecCharts editor, HW/SW codesin application sever and SpecCharts translator. To verify the operation of ScSE, we specify several example system using SpecCharts editor, then translate it to VHDL using SpecCharts translator and simulate the translated VHDL codes on synopsys. As the results, we know that ScSE has correct operations, also obtain the following advantages, the reduction in system complexity and the natural abstract design.
PDF

A Research on Managing Assurance Level for Guaranteeing Quality of Web Services (웹 서비스 품질보장을 위한 보증수준 유지방안 연구)

Lee, Young-Kon;Kim, Eun-Ju
- The KIPS Transactions:PartD
- /
- v.14D no.3 s.113
- /
- pp.319-328
- /
- 2007
As the coverage of Web services become wider and the number of implementation cases is growing, the importance of applying the Web services quality model to real world is increased. For maintaining the level of Web services qualify, it should be required to study on assurance method of Web services qualify level. Assurance for Web services, which is newly proposed by OASIS TC, means the totality of activities for managing the quality level of them. For managing Web service quality, Web service associates could usually use SLA(Service Level Agreement) method in which a service consumer contracts for some service level with a service provider and gives for penalty or pays incentives according to the result of evaluation of services. But, there are some difficulties in applying SLA to Web services, because Web services have publicity, multiple users, and 3rd party for management. So, we need a new assurance method for Web service by considering the characteristics of Web services. This paper provides the new concept of committed assurance level for Web services. This concept can be defined as the set of maximum level of quality expected by each user, which provide the consistent view of Web service quality. This paper presents the method for duality associates to preserve some quality level of Web service by using this concept.
https://doi.org/10.3745/KIPSTD.2007.14-D.3.319 인용 PDF KSCI

Comparative Study of Web Accessibility for Visually Impaired People in Scientific and Technical Retrieval System and Web Contents (시각장애인을 위한 과학기술정보검색 시스템 및 콘텐츠의 웹접근성 평가)

Park, Mi-Young;Ahn, In-Ja;Park, Hyei-Soo;Kim, In-Hee
- Journal of the Korean BIBLIA Society for library and Information Science
- /
- v.21 no.3
- /
- pp.123-137
- /
- 2010
This paper evaluates web accessibility using an evaluative measure of K-WAH 3.0. It targets systems and contents in scientific and technological information, based on a guide for 'Internet Web Contents Accessibility 1.0'. The result indicates that web accessibility of informational system is lower than that of informational contents. Based on this fact, this paper identifies the main characteristics and differences of web accessibilities in informational system and contents, and discusses possible improvements. the result of the evaluation found the scientific system sites is more superior than contents sites in web accessibility. And the scientific system and contents sites except NDSL was lest 'understandable' and 'Robust' in observance. Therefore web page that should bring "easily understandable" and "reducing outstanding side effects" was required.
https://doi.org/10.14699/kbiblia.2010.21.3.123 인용 PDF

A Study on the Ontology Languages and Application Systems for the Semantic Web (시맨틱웹을 위한 온톨로지 언어와 구현사례 연구)

Jeong, Do-Heon
- Journal of Information Management
- /
- v.34 no.3
- /
- pp.87-109
- /
- 2003
Continual attempts to accumulate and apply information eventually gave birth to the concept of the "Semantic Web". Thus, the "Semantic Web" can be defined as a product of mankind's desire to standardize information. At the same time, the term provides "a method that standardizes mankind's concept of linguistical expression", and can be noted as an effort to combine such methods into a standard web environment that may materialize to form a catalogue. This study introduced RDF schema, ontology languages for the semantic web, and ontology-based systems. The purpose of the study was to construct a system based on the semantic web environment's ontology by utilizing the ontology schema derived from the facettype Art and Architecture Thesaurus(AAT). The aforementioned ontology schema is based on the Web Ontology Language(OWL), which is being widely considered the standard ontology language for the W3C-centered semantic web environment.
https://doi.org/10.1633/JIM.2003.34.3.087 인용 PDF

Search Result 4,902, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)