• Title/Summary/Keyword: Scraping method

Search Result 25, Processing Time 0.027 seconds

Analysis on Topic Trends and Topic Modeling of KSHSM Journal Papers using Text Mining (텍스트마이닝을 활용한 보건의료산업학회지의 토픽 모델링 및 토픽트렌드 분석)

  • Cho, Kyoung-Won;Bae, Sung-Kwon;Woo, Young-Woon
    • The Korean Journal of Health Service Management
    • /
    • v.11 no.4
    • /
    • pp.213-224
    • /
    • 2017
  • Objectives : The purpose of this study was to analyze representative topics and topic trends of papers in Korean Society and Health Service Management(KSHSM) Journal. Methods : We collected English abstracts and key words of 516 papers in KSHSM Journal from 2007 to 2017. We utilized Python web scraping programs for collecting the papers from Korea Citation Index web site, and RStudio software for topic analysis based on latent Dirichlet allocation algorithm. Results : 9 topics were decided as the best number of topics by perplexity analysis and the resultant 9 topics for all the papers were extracted using Gibbs sampling method. We could refine 9 topics to 5 topics by deep consideration of meanings of each topics and analysis of intertopic distance map. In topic trends analysis from 2007 to 2017, we could verify 'Health Management' and 'Hospital Service' were two representative topics, and 'Hospital Service' was prevalent topic by 2011, but the ratio of the two topics became to be similar from 2012. Conclusions : We discovered 5 topics were the best number of topics and the topic trends reflected the main issues of KSHSM Journal, such as name revision of the society in 2012.

THE EFFECTS OF WAVELENGTH AND INTENSITY OF VISIBLE LIGHT ON THE CURING OF VISIBLE LIGHT CURED COMPOSITE RESIN (가시광선의 파장과 광도가 광중합형 복합레진의 경화에 미치는 영향)

  • Lee, Chae-Gyeong;Hur, Bok
    • Restorative Dentistry and Endodontics
    • /
    • v.14 no.1
    • /
    • pp.149-159
    • /
    • 1989
  • The purpose of this study was to assess the effects of wavelength and intensity of light curing units on the curing of composite resin. The wavelength and intensity of nine units were evaluated with Optical Multichannel Analyzer and Radiometer. Two-part split stainless steel mold with a cylindrical hole-3.0mm in diameter, 6.0mm in hgieht-was prepared. After placing a Mylar strip between two parts, 100 specimens were made by inserting each of four composite resins into the mold and irradiating for 20 seconds with five light units alternatively. The curing depths were measured by scraping method and evaluated by two-way ANOVA. And Vicker's hardness measurements were made on the longitudinally sectioned surface at 0.5mm interval. The results were as follows: 1. Visilux 2 showed a narrow spectral band within the effective wavelength in initiating polymerization and the highest intensity. Translux showed the diffuse spectrum of wavelength and the lower light intensity. 2. Visilux 2 showed the highest curing effect in any composite resin and then followed by Optilux, Efos 35, Heliomat and Translux. (p < 0.01) 3. Durafill showed the deepest curing depth in any light unit and then followed by Bisfil M, Silux and Heliosit. (p < 0.01). 4. Maximum hardness values showed 0.1mm and 0.5mm under top surface and then gradually decreased with depth.

  • PDF

Study on the Anti-angiogenic Activity of Ethanol Extract of Bojungbangam-tang (보정방암탕 에타놀층의 혈관형성 저해작용에 관한 연구)

  • Lee Eun-Ok;Shim Beom-Sang;Surh Young-Joon;Jeon Byung-Hun;Ahn Kyoo-Seok;Kim Sung-Hoon
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.20 no.1
    • /
    • pp.15-19
    • /
    • 2006
  • The anti-angiogenic activity of ethanol extract of Bojungbangam-tang, a new herbal prescription composed of nine crude drugs, was evaluated in human umbilical vein endothelial cells (HUVECs). HPLC profile revealed that five major compunds such as apioliquiritin, narirutin, hesperidin, liquiritin and glycyrrhizin. Ethanol extract of Bojungbangam-tang (EBJT) did not showed any significant cytotoxicity against HUVECs up to 200 ug/ml. EBJT significantly inhibited basic fibroblast growth factor (bFGF)-induced HUVECs proliferation to 69% at 200 ${\mu}g/ml$. Migration using window scraping method and tube formation in bFGF stimulated HUVECs were also significantly suppressed by EBJT in a dose-dependent manner. Taken together, these results suggest that Bojungbangam-tang can be a potent prescription for angiogenesis related disease.

Korean Web Content Extraction using Tag Rank Position and Gradient Boosting (태그 서열 위치와 경사 부스팅을 활용한 한국어 웹 본문 추출)

  • Mo, Jonghoon;Yu, Jae-Myung
    • Journal of KIISE
    • /
    • v.44 no.6
    • /
    • pp.581-586
    • /
    • 2017
  • For automatic web scraping, unnecessary components such as menus and advertisements need to be removed from web pages and main contents should be extracted automatically. A content block tends to be located in the middle of a web page. In particular, Korean web documents rarely include metadata and have a complex design; a suitable method of content extraction is therefore needed. Existing content extraction algorithms use the textual and structural features of content blocks because processing visual features requires heavy computation for rendering and image processing. In this paper, we propose a new content extraction method using the tag positions in HTML as a quasi-visual feature. In addition, we develop a tag rank position, a type of tag position not affected by text length, and show that gradient boosting with the tag rank position is a very accurate content extraction method. The result of this paper shows that the content extraction method can be used to collect high-quality text data automatically from various web pages.

A Study on Unstructured text data Post-processing Methodology using Stopword Thesaurus (불용어 시소러스를 이용한 비정형 텍스트 데이터 후처리 방법론에 관한 연구)

  • Won-Jo Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.6
    • /
    • pp.935-940
    • /
    • 2023
  • Most text data collected through web scraping for artificial intelligence and big data analysis is generally large and unstructured, so a purification process is required for big data analysis. The process becomes structured data that can be analyzed through a heuristic pre-processing refining step and a post-processing machine refining step. Therefore, in this study, in the post-processing machine refining process, the Korean dictionary and the stopword dictionary are used to extract vocabularies for frequency analysis for word cloud analysis. In this process, "user-defined stopwords" are used to efficiently remove stopwords that were not removed. We propose a methodology for applying the "thesaurus" and examine the pros and cons of the proposed refining method through a case analysis using the "user-defined stop word thesaurus" technique proposed to complement the problems of the existing "stop word dictionary" method with R's word cloud technique. We present comparative verification and suggest the effectiveness of practical application of the proposed methodology.

Further Modifications to the Mobile Nylon Bag Technique to Determine Nutrient Digestibility for Swine

  • Thacker, P.A.;Qiao, S.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.14 no.8
    • /
    • pp.1149-1156
    • /
    • 2001
  • Previous studies conducted with swine have reported that the mobile nylon bag technique (MNBT) does not always accurately predict in vivo nutrient digestibilities. Therefore, in this study, the MNBT was modified so that nutrient digestibilities would more closely resemble those from conventional (Con) digestibility studies obtained using the indicator method. A total of 19 feeds were tested including five cereal grains, five legumes, three high protein sources and six mixed diets. The principle changes to the MNBT included the use of a fecal collection harness which minimized the number of bags lost. In addition, previous protocols involved pooling of bags within pig while in the present experiment all bags were analyzed separately to increase the precision of the test. Finally, chemical analyses were done using the entire nylon bag plus residue rather than opening.the bags and scraping out the contents. With the exception of the barley sample (p=0.01), dry matter digestibility (DMD) coefficients obtained with the MNBT were not significantly different from those obtained with the indicator method. The linear regression equation relating the MNBT to the indicator method was Con DMD=-O.77+1.02 MNBT DMD ($r^2=0.93$: p<0.0001). There was no significant (p>0.05) difference in gross energy digestibility (GED) coefficients determined using the MNBT or the indicator method for any of the 19 feeds. The regression line equation relating the MNBT to the indicator method was Con GED=-5.68+1.06 MNBT GED ($r^2=0.94$: p<0.0001). The MNBT was less effective in predicting in vivo crude protein digestibility (CPD) than it was in predicting dry matter and energy digestibility. Differences greater than five percentage units were observed for two of the legumes, Kabuli chickpeas (p=0.02) and the extruded pea-canola seed mixture (p=0.01) as well as for three of the mixed diets including the unheated hulled barley-based diet (p=0.01), the unheated hulless-barley based diet (p=0.08) and the barley-soybean meal based diet (p=0.008). The regression equation relating the MNBT to the indicator method was Con CPD=5.75 + 0.90 MNBT CPO ($r^2=0.76$; p<0.0001). This study indicates that the modified MNBT can be used for the rapid determination of dry matter and energy digestibility in a wide variety of ingredients. For the measurement of crude protein digestibility, the technique produces results similar to conventional digestibility studies for cereal grains and high protein feeds but tends to overestimate protein digestibility for legumes and mixed diets.

A Study of cut off effect of ultraviolet in sunglasses lens coated with nickel-ferrite thin film NxFe3-xO4 (니켈페라이트 박막 NxFe3-xO4를 이용한 선글라스 렌즈의 자외선 차단효과에 대한 연구)

  • Ha, T.W.;Lee, Y.H.;Choi, K.S.;Cha, J.W.
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.8 no.2
    • /
    • pp.25-29
    • /
    • 2003
  • Nickel-ferrite $Ni_xFe_{3-x}O_4$ thin films with several composition for Ni on glass substrate was prepared by ferrite plating method in order to make sunglass which cut off ultraviolet and shield electromagnetic field. It has single phase of polycrystalline spinel structure and has gloss as mirror and has high hardness which is no scratch while scraping by using nail. The transmittance of nickel-ferrite thin film is lowered to zero below 400 nm manifestly. And it shows that the nickel-ferrite thin film in nickel composition rate x = 0.09 was most cut oil ultraviolet when compared with goods of other company in the cut off effect of ultraviolet. Therefore, sunglasses coated with $Ni_xFe_{3-x}O_4$ thin film can be used in removing ultraviolet and electromagnetic field.

  • PDF

Detection of Human Papillomavirus DNA in Routine Cervical Scraping Samples: Use for a National Cervical Cancer Screening Program in a Developing Nation

  • Othman, Norodiyah;Othman, Nor Hayati
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.5
    • /
    • pp.2245-2249
    • /
    • 2014
  • Background: Human papillomavirus is a well-established cause of the development of a variety of epithelial lesions in the cervix. However, as yet, incorporation of HPV testing into cervical cancer screening either as an adjunct or stand alone test is limited due to its cost. We therefore here ascertained the presence and type specificity of human papilloma virus (HPV) DNA in routine cervical scrapings. Materials and Methods: Cervical scrapings were collected from women attending clinics for routine Pap smear screening. HPV-DNA was detected by PCR using MY09/11 and GP5+/GP6+ primer sets and genotyping was accomplished by cycle-sequencing. Results: A total of 635 women were recruited into the study with $mean{\pm}SD$ age of $43{\pm}10.5$ years. Of these 92.6% (588/635) were reported as within normal limits (WNL) on cytology. The presence of HPV infection detected by nested MY/GP+-PCR was 4.4% (28/635). The overall prevalence of high-risk HPV (HR-HPV) in abnormal Pap smears was 53.8% (7/13). HPVs were also seen in 3.1% (18/588) of smears reported as WNL by cytology and 5.9% (2/34) in smears unsatisfactory for evaluation. Conclusions: The overall percentage of HPV positivity in routine cervical screening samples is comparable with abnormal findings in cytology. Conventional Pap smear 'missed' a few samples. Since HPV testing is expensive, our results may provide valuable information for strategising implementation of effective cervical cancer screening in a country with limited resources like Malaysia. If Pap smear coverage could be improved, HPV testing could be used as an adjunct method on cases with ambiguous diagnoses.

Control of Apple Valsa Canker by Localized Spraying with Neoasozin Solution, an Arsenic Fungicide (네오아소진의 국부처리에 의한 사과나무 부란병의 방제)

  • 엄재열;손형락
    • Korean Journal Plant Pathology
    • /
    • v.11 no.1
    • /
    • pp.9-16
    • /
    • 1995
  • Undiluted neoasozin solution (6.5% a.i.), an arsenic fungicide, was sprayed on 169 cankers of apple trees from early March to September in 1987 twice at intervals of one week without scraping off the affected barks. Among the treated cankers, 79.9% ceased to grow within 1∼7 weeks, 13.0% showed partial development, and 7.1% grew continuously to girdle the branches. The partially developed cankers, however, could also be cured by an additional spray after slightly piercing at the edge of cankers to facilitate the penetration of the chemical. When the canker growth was blocked, cankers were encircled by cracks developed at the marginal area of the cankers. If the cracks developed once, very few cankers grew beyond them. The above results suggest that the crack development may be the consequence of the host defense activity to wall off the pathogen. In addition to the curative efficacy, the neoasozin solution inhibited sporulation of the pathogenic fungus almost completely. However, the pathogen survived for more than three months in some cankers that externally appeared to be cured, suggesting that an indirect mode of action of the chemical against apple Valsa canker seems to be still more persuasive than the direct fungicidal effect. In the final examination conducted in the mid April of the next year, 72.7% of the cankers were completely cured by the two successive neoasozin treatments. Moreover the cure rate became 83.1% if that of partially developed cankers which were also completely cured by an additional treatment was also taken into account. Since 1989 when this method was widely applied in apple orchards in Korea, apple Valsa canker has been effectively controlled to reach a tolerable level.

  • PDF

Topic Modeling on Research Trends of Industry 4.0 Using Text Mining (텍스트 마이닝을 이용한 4차 산업 연구 동향 토픽 모델링)

  • Cho, Kyoung Won;Woo, Young Woon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.7
    • /
    • pp.764-770
    • /
    • 2019
  • In this research, text mining techniques were used to analyze the papers related to the "4th Industry". In order to analyze the papers, total of 685 papers were collected by searching with the keyword "4th industry" in Korea Journal Index(KCI) from 2016 to 2019. We used Python-based web scraping program to collect papers and use topic modeling techniques based on LDA algorithm implemented in R language for data analysis. As a result of perplexity analysis on the collected papers, nine topics were determined optimally and nine representative topics of the collected papers were extracted using the Gibbs sampling method. As a result, it was confirmed that artificial intelligence, big data, Internet of things(IoT), digital, network and so on have emerged as the major technologies, and it was confirmed that research has been conducted on the changes due to the major technologies in various fields related to the 4th industry such as industry, government, education field, and job.