• Title/Summary/Keyword: Processing

Search Result 69,101, Processing Time 0.094 seconds

X-tree Diff: An Efficient Change Detection Algorithm for Tree-structured Data (X-tree Diff: 트리 기반 데이터를 위한 효율적인 변화 탐지 알고리즘)

  • Lee, Suk-Kyoon;Kim, Dong-Ah
    • The KIPS Transactions:PartC
    • /
    • v.10C no.6
    • /
    • pp.683-694
    • /
    • 2003
  • We present X-tree Diff, a change detection algorithm for tree-structured data. Our work is motivated by need to monitor massive volume of web documents and detect suspicious changes, called defacement attack on web sites. From this context, our algorithm should be very efficient in speed and use of memory space. X-tree Diff uses a special ordered labeled tree, X-tree, to represent XML/HTML documents. X-tree nodes have a special field, tMD, which stores a 128-bit hash value representing the structure and data of subtrees, so match identical subtrees form the old and new versions. During this process, X-tree Diff uses the Rule of Delaying Ambiguous Matchings, implying that it perform exact matching where a node in the old version has one-to one corrspondence with the corresponding node in the new, by delaying all the others. It drastically reduces the possibility of wrong matchings. X-tree Diff propagates such exact matchings upwards in Step 2, and obtain more matchings downwsards from roots in Step 3. In step 4, nodes to ve inserted or deleted are decided, We aldo show thst X-tree Diff runs on O(n), woere n is the number of noses in X-trees, in worst case as well as in average case, This result is even better than that of BULD Diff algorithm, which is O(n log(n)) in worst case, We experimented X-tree Diff on reat data, which are about 11,000 home pages from about 20 wev sites, instead of synthetic documets manipulated for experimented for ex[erimentation. Currently, X-treeDiff algorithm is being used in a commeercial hacking detection system, called the WIDS(Web-Document Intrusion Detection System), which is to find changes occured in registered websites, and report suspicious changes to users.

A Study on the Effect of Using Sentiment Lexicon in Opinion Classification (오피니언 분류의 감성사전 활용효과에 대한 연구)

  • Kim, Seungwoo;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.133-148
    • /
    • 2014
  • Recently, with the advent of various information channels, the number of has continued to grow. The main cause of this phenomenon can be found in the significant increase of unstructured data, as the use of smart devices enables users to create data in the form of text, audio, images, and video. In various types of unstructured data, the user's opinion and a variety of information is clearly expressed in text data such as news, reports, papers, and various articles. Thus, active attempts have been made to create new value by analyzing these texts. The representative techniques used in text analysis are text mining and opinion mining. These share certain important characteristics; for example, they not only use text documents as input data, but also use many natural language processing techniques such as filtering and parsing. Therefore, opinion mining is usually recognized as a sub-concept of text mining, or, in many cases, the two terms are used interchangeably in the literature. Suppose that the purpose of a certain classification analysis is to predict a positive or negative opinion contained in some documents. If we focus on the classification process, the analysis can be regarded as a traditional text mining case. However, if we observe that the target of the analysis is a positive or negative opinion, the analysis can be regarded as a typical example of opinion mining. In other words, two methods (i.e., text mining and opinion mining) are available for opinion classification. Thus, in order to distinguish between the two, a precise definition of each method is needed. In this paper, we found that it is very difficult to distinguish between the two methods clearly with respect to the purpose of analysis and the type of results. We conclude that the most definitive criterion to distinguish text mining from opinion mining is whether an analysis utilizes any kind of sentiment lexicon. We first established two prediction models, one based on opinion mining and the other on text mining. Next, we compared the main processes used by the two prediction models. Finally, we compared their prediction accuracy. We then analyzed 2,000 movie reviews. The results revealed that the prediction model based on opinion mining showed higher average prediction accuracy compared to the text mining model. Moreover, in the lift chart generated by the opinion mining based model, the prediction accuracy for the documents with strong certainty was higher than that for the documents with weak certainty. Most of all, opinion mining has a meaningful advantage in that it can reduce learning time dramatically, because a sentiment lexicon generated once can be reused in a similar application domain. Additionally, the classification results can be clearly explained by using a sentiment lexicon. This study has two limitations. First, the results of the experiments cannot be generalized, mainly because the experiment is limited to a small number of movie reviews. Additionally, various parameters in the parsing and filtering steps of the text mining may have affected the accuracy of the prediction models. However, this research contributes a performance and comparison of text mining analysis and opinion mining analysis for opinion classification. In future research, a more precise evaluation of the two methods should be made through intensive experiments.

Determination of S-Allyl-L-cystein, Diallyl Disulfide, and Total Amino Acids of Black Garlic after Spontaneous Short-term Fermentation (자가숙성발효 후 흑마늘의 S-Allyl-L-cystein, Diallyl Disulfide 및 Total Amino Acids 분석)

  • Kim, Mun-Su;Kim, Min-Ju;Bang, Woo-Suk;Kim, Keun-Sung;Park, Sung-Soo
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.41 no.5
    • /
    • pp.661-665
    • /
    • 2012
  • Garlic (Allium sativum L.) is one of the oldest cultivated plants and has been used throughout the world as a food supplement and a folk medicine for thousands of years. Raw garlic has been processed into a variety of commercial garlic products for consumer convenience. The latest new processing technology, 'spontaneous short-term fermentation', has been developed to process raw garlic into black garlic. The physiologically active effects of garlic have been attributed to its organosulfur compounds. In this study, the proximate compositions and the total amino acid content of raw Namhae garlic and black garlic were determined. The two major organosulfur compounds of garlic, $S$-allyl-L-cysteine (SAC), and diallyl-disulfide (DADS), were also analyzed using RP-HPLC. The proximate compositions were not different between raw and black garlic. The amount of 13 amino acids was greater in black garlic than in raw garlic among a total of 17 amino acids considered. The black garlic had 2-fold higher levels of SAC and 30-fold higher levels of DADS than the raw garlic. Therefore, it is suggested that consuming black garlic produced by spontaneous short-term fermentation is more effective than consuming raw garlic, in order for consumers to take more physiologically active organosulfur compounds (SAC and DADS), which are the compounds that are good for consumer health.

Measurement and Quality Control of MIROS Wave Radar Data at Dokdo (독도 MIROS Wave Radar를 이용한 파랑관측 및 품질관리)

  • Jun, Hyunjung;Min, Yongchim;Jeong, Jin-Yong;Do, Kideok
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.32 no.2
    • /
    • pp.135-145
    • /
    • 2020
  • Wave observation is widely used to direct observation method for observing the water surface elevation using wave buoy or pressure gauge and remote-sensing wave observation method. The wave buoy and pressure gauge can produce high-quality wave data but have disadvantages of the high risk of damage and loss of the instrument, and high maintenance cost in the offshore area. On the other hand, remote observation method such as radar is easy to maintain by installing the equipment on the land, but the accuracy is somewhat lower than the direct observation method. This study investigates the data quality of MIROS Wave and Current Radar (MWR) installed at Dokdo and improve the data quality of remote wave observation data using the wave buoy (CWB) observation data operated by the Korea Meteorological Administration. We applied and developed the three types of wave data quality control; 1) the combined use (Optimal Filter) of the filter designed by MIROS (Reduce Noise Frequency, Phillips Check, Energy Level Check), 2) Spike Test Algorithm (Spike Test) developed by OOI (Ocean Observatories Initiative) and 3) a new filter (H-Ts QC) using the significant wave height-period relationship. As a result, the wave observation data of MWR using three quality control have some reliability about the significant wave height. On the other hand, there are still some errors in the significant wave period, so improvements are required. Also, since the wave observation data of MWR is different somewhat from the CWB data in high waves of over 3 m, further research such as collection and analysis of long-term remote wave observation data and filter development is necessary.

Studies on the Drought-Resistance of Major Food Crops I. Effect of Water Stress on the Plant Height, Seedling Dry Weight, Relative Turgidity, Protein and Reducing Sugar in Barley and Wheat Seedling Stage (주요작물의 한발저항성에 관한 연구 제1보 맥류 유묘기의 수분부족이 초장, 유묘건물종, 엽침소, 상대팽압도, 단백질 및 환원당에 미치는 영향)

  • 최원열;민경수;김용환
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.26 no.4
    • /
    • pp.304-310
    • /
    • 1981
  • In order to observe the degree and response of drought-resistance and its physiological mechanism in barley and wheat, 5 species (16 cultivars) were tested for changes in plant height, seedling dry weight, chlorophyll content, leaf relative turgidity, soluble protein, reducing sugar and growth of seedling subjected to water stress by withholding watering for 8 days at 10 days (at the 3rd leaf stage) after emergence. The average rate of decrease of all cultivars was 15% in plant height, 24% in seedling dry weight, 32% in chlorophyll content, 27% in leaf relative turgidity, and 27% in protein. But reducing sugar content of control was increased 4 folds more than that of water stress. In the decreased rate of seedling dry weight of each cultivar, rye was shown to be lowest rate, and Baegdong, Mokpo #55, and 3 two-row barley were shown to be the highest rate. The degree of the decreased rate in 5 species was in the order of rye < < wheat < covered barley < naked barley < two-row barley. in the decreased rate of chlorophyll content, rye, Cheonggaemil and Olmil are the lowest group, and the highest one are Milyang #12, Bangsa #6, Hyangmaeg and Sacheon #4. In the decreased rate of leaf relative turgidity, the lowest group (22-25%) were rye, Cheonggaemil and Dongbori #1, and, on the other hand, the highest group (30-33%) were Baegdong and 3 two-row barley. In the decreased rate of soulble protein, the lowest group (14-17%) were Chogwang, Geurumil, Dongbori #1, and Mokpo #55, and the highest one was 3 two-row barley. The increased ratio of reducing sugar of water stress to control was 4 to 5 folds in rye and wheat, and about 2 folds in naked barley and 3 two-row barley. The degree of the increased ratio of 5 species was in the order of rye > wheat > covered barley > naked barley > two-row barley. In terms of the physiological and adaptive metabolism during the processing leading to drought-resistance, the degree of drought-resistance of 5 species to water stress at seedling stage was shown to be in the order of rye > wheat > covered barley > naked barley > two-row-barley.

  • PDF

Quality of Jeonbuk-originated Brand Rice Compared with Other Domestic Brands and Imported Market Rice (전라북도 브랜드 쌀과 국내 및 수입 유통쌀의 품질 특성 비교)

  • Song, Young-Eun;Cho, Seong-Hyun;Kwon, Young-Rip;Choi, Dong-Chil
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.53 no.4
    • /
    • pp.347-352
    • /
    • 2008
  • This study was carried out to estimate the quality of Jeonbuk-originated brand rice by comparing with other domestic brand rices and imported market rices. Rice variety, "Ilmi" was a major portion of brand rices in Jeonbuk region, and in few portion, there were variety-mixed brands, Shindongjin, Kosihikari, and Hitomebore. Comparing the ratio of head rice of high-quality Jeonbukoriginated brand rice with other domestic brand rices were not significantly different. Head rice ratio and mechanical taste values were not significantly different between high-quality Jeonbuk-originated brand rice and the other domestic brand rices. The contents of protein, moisture, amylose of rice were also not significantly different between them. The quality of high-quality Jeonbuk-originated brand rice was as good as that of other domestic brand rices and had not changed it by period. The foreign rice imported from United States, Chinese (involved parboiled), Thailand and the domestic rice cultivated in Jeonbuk province were investigated. There could get difference on the major component related to palatability of rice as country in this study. Comparing with foreign rices. protein content of domestic rice (6.1%) was similar with that of United States, lower than those of Chinese and Thailand. The head rice ratio of the domestic rice was 92%, which was similar with those of Unite State and Chinese but the Chinese parboiled rice was completely cracked during processing. The setback viscosity of domestic rice related to retrogradation was lower than those of the imported rice except United States. The Ad (Adhesiveness / H(Hardness) ratio was higher in the domestic and United States rice.

Mammalian Reproduction and Pheromones (포유동물의 생식과 페로몬)

  • Lee, Sung-Ho
    • Development and Reproduction
    • /
    • v.10 no.3
    • /
    • pp.159-168
    • /
    • 2006
  • Rodents and many other mammals have two chemosensory systems that mediate responses to pheromones, the main and accessory olfactory system, MOS and AOS, respectively. The chemosensory neurons associated with the MOS are located in the main olfactory epithelium, while those associated with the AOS are located in the vomeronasal organ(VNO). Pheromonal odorants access the lumen of the VNO via canals in the roof of the mouth, and are largely thought to be nonvolatile. The main pheromone receptor proteins consist of two superfamilies, V1Rs and V2Rs, that are structurally distinct and unrelated to the olfactory receptors expressed in the main olfactory epithelium. These two type of receptors are seven transmembrane domain G-protein coupled proteins(V1R with $G_{{\alpha}i2}$, V2R with $G_{0\;{\alpha}}$). V2Rs are co-expressed with nonclassical MHC Ib genes(M10 and other 8 M1 family proteins). Other important molecular component of VNO neuron is a TrpC2, a cation channel protein of transient receptor potential(TRP) family and thought to have a crucial role in signal transduction. There are four types of pheromones in mammalian chemical communication - primers, signalers, modulators and releasers. Responses to these chemosignals can vary substantially within and between individuals. This variability can stem from the modulating effects of steroid hormones and/or non-steroid factors such as neurotransmitters on olfactory processing. Such modulation frequently augments or facilitates the effects that prevailing social and environmental conditions have on the reproductive axis. The best example is the pregnancy block effect(Bruce effect), caused by testosterone-dependent major urinary proteins(MUPs) in male mouse urine. Intriguingly, mouse GnRH neurons receive pheromone signals from both odor and pheromone relays in the brain and may also receive common odor signals. Though it is quite controversial, recent studies reveal a complex interplay between reproduction and other functions in which GnRH neurons appear to integrate information from multiple sources and modulate a variety of brain functions.

  • PDF

Quality Characteristics of Puffed Snacks Made from High-amylose Rice Varieties Containing Resistance Starch (저항전분 함유 고아밀로스 품종의 현미로 제조한 팽화 과자의 품질특성)

  • Lee, Kyung Ha;Park, Jiyoung;Lee, Seuk Ki;Lee, Yu-Young;Lee, Byung-Won;Park, Hye Young;Choi, Hye Sun;Cho, Donghwa;Han, Sang-Ik;Oh, Sea-Kwan
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.62 no.4
    • /
    • pp.285-292
    • /
    • 2017
  • We investigated physicochemical properties of puffed snacks with intermediate and high amylose rice varieties. The intermediate amylose rice varieties 'Sindongjin' and high amylose rice varieties newly developed for food processing, 'Dodamssal' and 'Goami4' were tested for this study. The crude fat and crude protein contents of the rice cultivars ranged 1.47-3.08% and 6.30-7.63%, respectively. The resistant starch and amylose contents of Dodamssal and Goami4 were higher than those of Sindongjin. The hardness of rice was the highest in Sindongjin and Dodamssal. Also, Hardness of puffed snacks decreased by 72.07% for Sindongjin, 88.21% for Dodamssal and 66.67% for Goami4 compared to raw rice samples. The sensory evaluation showed that the highest scores in taste, texture and overall acceptability of puffed snacks were obtained in Dodamssal. The results of this study indicate that Dodamssal was suitable varieties for puffed snacks. Also, the physicochemical properties of Dodamssal were improved by the extrusion process. Therefore Dodamssal can be used for the industrial production of puffed snacks.

A Study on Establishment of the Optimum Mountain Meteorological Observation Network System for Forest Fire Prevention (산불 방지를 위한 산악기상관측시스템 구축방안)

  • Lee, Si-Young;Chung, Il-Ung;Kim, Sang-Kook
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.8 no.1
    • /
    • pp.36-44
    • /
    • 2006
  • In this study, we constructed a forest fire danger map in the Yeongdong area of Gangwon-do and Northeastern area of Gyeongsangbuk-do using a forest fire rating model and geographical information system (GIS). We investigated the appropriate positions of the automatic weather station (AWS) and a comprehensive network solution (a system including measurement, communication and data processing) for the establishment of an optimum mountain meteorological observation network system (MMONS). Also, we suggested a possible plan for combining the MMONS with unmanned monitoring camera systems and wireless relay towers operated by local governments and the Korea Forest Service for prevention of forest fire.

Processing and Quality Control of Flux Data at Gwangneung Forest (광릉 산림의 플럭스 자료 처리와 품질 관리)

  • Lim, Hee-Jeong;Lee, Young-Hee
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.10 no.3
    • /
    • pp.82-93
    • /
    • 2008
  • In order to ensure a standardized data analysis of the eddy covariance measurements, Hong and Kim's quality control program has been updated and used to process eddy covariance data measured at two levels on the main flux tower at Gwangneung site from January to May in 2005. The updated program was allowed to remove outliers automatically for $CO_2$ and latent heat fluxes. The flag system consists of four quality groups(G, D, B and M). During the study period, the missing data were about 25% of the total records. About 60% of the good quality data were obtained after the quality control. The number of record in G group was larger at 40m than at 20m. It is due that the level of 20m was within the roughness sublayer where the presence of the canopy influences directly on the character of the turbulence. About 60% of the bad data were due to low wind speed. Energy balance closure at this site was about 40% during the study period. Large imbalance is attributed partly to the combined effects of the neglected heat storage terms, inaccuracy of ground heat flux and advection due to local wind system near the surface. The analysis of wind direction indicates that the frequent occurrence of positive momentum flux was closely associated with mountain valley wind system at this site. The negative $CO_2$ flux at night was examined in terms of averaging time. The results show that when averaging time is larger than 10min, the magnitude of calculated $CO_2$ fluxes increases rapidly, suggesting that the 30min $CO_2$ flux is influenced severely by the mesoscale motion or nonstationarity. A proper choice of averaging time needs to be considered to get accurate turbulent fluxes during nighttime.