• Title/Summary/Keyword: Generate Data

Search Result 3,065, Processing Time 0.036 seconds

Domain Knowledge Based Approach for Design Optimization of Arch Dams Using Genetic Algorithms

  • Dongsu Kim;Sangik Lee;Jonghyuk Lee;Byung-hun Seo;Yejin Seo;Dongwoo Kim;Yerim Jo;Won Choi
    • International conference on construction engineering and project management
    • /
    • 2024.07a
    • /
    • pp.1321-1321
    • /
    • 2024
  • Concrete arch dams, unlike conventional concrete gravity dams, have thin arch-shaped cross sections and must be designed considering a three-dimensional shape. In particular, double-curvature arch dams, which have arch-shaped vertical and horizontal sections, require careful consideration during design due to their unique shape. Although stress analysis is complex, and various factors need to be considered during the design, these dams offer economic advantages as they require less material. Consequently, numerous double-curvature arch dams have been constructed worldwide, and ongoing research focuses on optimizing their shapes. In this study, an efficient optimization algorithm was developed for the shape optimization of concrete arch dams with double-curvature using genetic algorithms and improved population initializing technique. The developed technique utilized domain knowledge in the field of arch dams to generate an excellent initial population. To assess the relevance of domain knowledge, an investigation was conducted on the accumulated knowledge and empirical formulas from literature. Two pieces of domain knowledge can be gleaned from the iterative structural design experiences associated with arch dams. First, it concerns the thickness of the central cantilever of an arch dam. For minimum tensile stress, it is best to make the thickness as thin as possible at the dam crest and gradually become thicker as it goes down. The second aspect concerns the sliding stability of the arch dam, which depends on the central angle of the horizontal section. This angel is important for stability because the plane arch serves to transfer the hydraulic load from the reservoir to both abutments. Also, preliminary design formulas for arch dams from a manual written by the United States Bureau of Reclamation (USBR) were used. On the other hand, since domain knowledge is based on engineering experiences and data from existing dams, its usability should be verified by comparing it with the results of design optimization performed by classic genetic algorithms. To validate the performance of the optimization algorithm with the improved population initialization technique, a test site with an existing dam was selected, and algorithmic application tests were conducted. Stress analysis is performed for each design iteration, evaluating constraints and calculating fitness as the objective function. The results confirmed that the algorithm developed in this study exhibits superior performance in terms of average fitness and convergence rate compared to classic genetic algorithms.

Basic Research on the Possibility of Developing a Landscape Perceptual Response Prediction Model Using Artificial Intelligence - Focusing on Machine Learning Techniques - (인공지능을 활용한 경관 지각반응 예측모델 개발 가능성 기초연구 - 머신러닝 기법을 중심으로 -)

  • Kim, Jin-Pyo;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.51 no.3
    • /
    • pp.70-82
    • /
    • 2023
  • The recent surge of IT and data acquisition is shifting the paradigm in all aspects of life, and these advances are also affecting academic fields. Research topics and methods are being improved through academic exchange and connections. In particular, data-based research methods are employed in various academic fields, including landscape architecture, where continuous research is needed. Therefore, this study aims to investigate the possibility of developing a landscape preference evaluation and prediction model using machine learning, a branch of Artificial Intelligence, reflecting the current situation. To achieve the goal of this study, machine learning techniques were applied to the landscaping field to build a landscape preference evaluation and prediction model to verify the simulation accuracy of the model. For this, wind power facility landscape images, recently attracting attention as a renewable energy source, were selected as the research objects. For analysis, images of the wind power facility landscapes were collected using web crawling techniques, and an analysis dataset was built. Orange version 3.33, a program from the University of Ljubljana was used for machine learning analysis to derive a prediction model with excellent performance. IA model that integrates the evaluation criteria of machine learning and a separate model structure for the evaluation criteria were used to generate a model using kNN, SVM, Random Forest, Logistic Regression, and Neural Network algorithms suitable for machine learning classification models. The performance evaluation of the generated models was conducted to derive the most suitable prediction model. The prediction model derived in this study separately evaluates three evaluation criteria, including classification by type of landscape, classification by distance between landscape and target, and classification by preference, and then synthesizes and predicts results. As a result of the study, a prediction model with a high accuracy of 0.986 for the evaluation criterion according to the type of landscape, 0.973 for the evaluation criterion according to the distance, and 0.952 for the evaluation criterion according to the preference was developed, and it can be seen that the verification process through the evaluation of data prediction results exceeds the required performance value of the model. As an experimental attempt to investigate the possibility of developing a prediction model using machine learning in landscape-related research, this study was able to confirm the possibility of creating a high-performance prediction model by building a data set through the collection and refinement of image data and subsequently utilizing it in landscape-related research fields. Based on the results, implications, and limitations of this study, it is believed that it is possible to develop various types of landscape prediction models, including wind power facility natural, and cultural landscapes. Machine learning techniques can be more useful and valuable in the field of landscape architecture by exploring and applying research methods appropriate to the topic, reducing the time of data classification through the study of a model that classifies images according to landscape types or analyzing the importance of landscape planning factors through the analysis of landscape prediction factors using machine learning.

Studies of Molecular Breeding Technique Using Genome Information on Edible Mushrooms

  • Kong, Won-Sik;Woo, Sung-I;Jang, Kab-Yeul;Shin, Pyung-Gyun;Oh, Youn-Lee;Kim, Eun-sun;Oh, Min-Jee;Park, Young-Jin;Lee, Chang-Soo;Kim, Jong-Guk
    • 한국균학회소식:학술대회논문집
    • /
    • 2015.05a
    • /
    • pp.53-53
    • /
    • 2015
  • Agrobacterium tumefaciens-mediated transformation(ATMT) of Flammulina velutipes was used to produce a diverse number of transformants to discover the functions of gene that is vital for its variation color, spore pattern and cellulolytic activity. Futhermore, the transformant pool will be used as a good genetic resource for studying gene functions. Agrobacterium-mediated transformation was conducted in order to generate intentional mutants of F. velutipes strain KACC42777. Then Agrobacterium tumefaciens AGL-1 harboring pBGgHg was transformed into F. velutipes. This method is use to determine the functional gene of F. velutipes. Inverse PCR was used to insert T-DNA into the tagged chromosomal DNA segments and conducting sequence analysis of the F. velutipes. But this experiment had trouble in diverse morphological mutants because of dikaryotic nature of mushroom. It needed to make monokaryotic fruiting varients which introduced genes of compatible mating types. In this study, next generation sequencing data was generated from 28 strains of Flammulina velutipes with different phenotypes using Illumina Hiseq platform. Filtered short reads were initially aligned to the reference genome (KACC42780) to construct a SNP matrix. And then we built a phylogenetic tree based on the validated SNPs. The inferred tree represented that white- and brown- fruitbody forming strains were generally separated although three brown strains, 4103, 4028, and 4195, were grouped with white ones. This topological relationship was consistently reappeared even when we used randomly selected SNPs. Group I containing 4062, 4148, and 4195 strains and group II containing 4188, 4190, and 4194 strains formed early-divergent lineages with robust nodal supports, suggesting that they are independent groups from the members in main clades. To elucidate the distinction between white-fruitbody forming strains isolated from Korea and Japan, phylogenetic analysis was performed using their SNP data with group I members as outgroup. However, no significant genetic variation was noticed in this study. A total of 28 strains of Flammulina velutipes were analyzed to identify the genomic regions responsible for producing white-fruiting body. NGS data was yielded by using Illumina Hiseq platform. Short reads were filtered by quality score and read length were mapped on the reference genome (KACC42780). Between the white- and brown fruitbody forming strains. There is a high possibility that SNPs can be detected among the white strains as homozygous because white phenotype is recessive in F. velutipes. Thus, we constructed SNP matrix within 8 white strains. SNPs discovered between mono3 and mono19, the parental monokaryotic strains of 4210 strain (white), were excluded from the candidate. If the genotypes of SNPs detected between white and brown strains were identical with those in mono3 and mono19 strains, they were included in candidate as a priority. As a result, if more than 5 candidates SNPs were localized in single gene, we regarded as they are possibly related to the white color. In F. velutipes genome, chr01, chr04, chr07,chr11 regions were identified to be associated with white fruitbody forming. White and Brown Fruitbody strains can be used as an identification marker for F. veluipes. We can develop some molecular markers to identify colored strains and discriminate national white varieties against Japanese ones.

  • PDF

Gaussian Filtering Effects on Brain Tissue-masked Susceptibility Weighted Images to Optimize Voxel-based Analysis (화소 분석의 최적화를 위해 자화감수성 영상에 나타난 뇌조직의 가우시안 필터 효과 연구)

  • Hwang, Eo-Jin;Kim, Min-Ji;Jahng, Geon-Ho
    • Investigative Magnetic Resonance Imaging
    • /
    • v.17 no.4
    • /
    • pp.275-285
    • /
    • 2013
  • Purpose : The objective of this study was to investigate effects of different smoothing kernel sizes on brain tissue-masked susceptibility-weighted images (SWI) obtained from normal elderly subjects using voxel-based analyses. Materials and Methods: Twenty healthy human volunteers (mean $age{\pm}SD$ = $67.8{\pm}6.09$ years, 14 females and 6 males) were studied after informed consent. A fully first-order flow-compensated three-dimensional (3D) gradient-echo sequence ran to obtain axial magnitude and phase images to generate SWI data. In addition, sagittal 3D T1-weighted images were acquired with the magnetization-prepared rapid acquisition of gradient-echo sequence for brain tissue segmentation and imaging registration. Both paramagnetically (PSWI) and diamagnetically (NSWI) phase-masked SWI data were obtained with masking out non-brain tissues. Finally, both tissue-masked PSWI and NSWI data were smoothed using different smoothing kernel sizes that were isotropic 0, 2, 4, and 8 mm Gaussian kernels. The voxel-based comparisons were performed using a paired t-test between PSWI and NSWI for each smoothing kernel size. Results: The significance of comparisons increased with increasing smoothing kernel sizes. Signals from NSWI were greater than those from PSWI. The smoothing kernel size of four was optimal to use voxel-based comparisons. The bilaterally different areas were found on multiple brain regions. Conclusion: The paramagnetic (positive) phase mask led to reduce signals from high susceptibility areas. To minimize partial volume effects and contributions of large vessels, the voxel-based analysis on SWI with masked non-brain components should be utilized.

Evaluation of Future Turbidity Water and Eutrophication in Chungju Lake by Climate Change Using CE-QUAL-W2 (CE-QUAL-W2를 이용한 충주호의 기후변화에 따른 탁수 및 부영양화 영향평가)

  • Ahn, So Ra;Ha, Rim;Yoon, Sung Wan;Kim, Seong Joon
    • Journal of Korea Water Resources Association
    • /
    • v.47 no.2
    • /
    • pp.145-159
    • /
    • 2014
  • This study is to evaluate the future climate change impact on turbidity water and eutrophication for Chungju Lake by using CE-QUAL-W2 reservoir water quality model coupled with SWAT watershed model. The SWAT was calibrated and validated using 11 years (2000~2010) daily streamflow data at three locations and monthly stream water quality data at two locations. The CE-QUAL-W2 was calibrated and validated for 2 years (2008 and 2010) water temperature, suspended solid, total nitrogen, total phosphorus, and Chl-a. For the future assessment, the SWAT results were used as boundary conditions for CE-QUAL-W2 model run. To evaluate the future water quality variation in reservoir, the climate data predicted by MM5 RCM(Regional Climate Model) of Special Report on Emissions Scenarios (SRES) A1B for three periods (2013~2040, 2041~2070 and 2071~2100) were downscaled by Artificial Neural Networks method to consider Typhoon effect. The RCM temperature and precipitation outputs and historical records were used to generate pollutants loading from the watershed. By the future temperature increase, the lake water temperature showed $0.5^{\circ}C$ increase in shallow depth while $-0.9^{\circ}C$ in deep depth. The future annual maximum sediment concentration into the lake from the watershed showed 17% increase in wet years. The future lake residence time above 10 mg/L suspended solids (SS) showed increases of 6 and 17 days in wet and dry years respectively comparing with normal year. The SS occupying rate of the lake also showed increases of 24% and 26% in both wet and dry year respectively. In summary, the future lake turbidity showed longer lasting with high concentration comparing with present behavior. Under the future lake environment by the watershed and within lake, the future maximum Chl-a concentration showed increases of 19 % in wet year and 3% in dry year respectively.

Elucidation of Dishes High in N-Nitrosamines Using Total Diet Study Data (총식이조사 자료를 이용한 음식별 니트로사민 함량 분포 규명)

  • Choi, Seul Ki;Lee, Youngwon;Seo, Jung-eun;Park, Jong-eun;Lee, Jee-yeon;Kwon, Hoonjeong
    • Journal of Food Hygiene and Safety
    • /
    • v.33 no.5
    • /
    • pp.361-368
    • /
    • 2018
  • N-nitrosamines are probable or possible human carcinogens, which are produced by the reaction between secondary amines and nitrogen oxide in the acidic environment or by heating. Common risk assessment procedure involves the comparison between exposures expressed in the unit, mg/kg body weight/day and the Health-Based Reference dose expressed in the same unit. This procedure is suitable for the policy decision-making and is considered as inappropriate for the consumers to get information about their dietary decision-making. Therefore, the distributions of NDMA (N-nitrosodimethylamine), NDBA (N-nitrosodibutylamine), the six N-nitrosamines (NDMA, NDBA, NDEA (N-nitrosodiethylamine), NPYR (N-nitrosopyrrolidine), NPIP (N-nitrosopiperidine), and NMOR (N-nitrosomorpholine) in the menus grouped based on the presence of main ingredients and cooking methods were analyzed to generate consumer-friendly information regarding food contaminants. Recipes and intakes were taken from 2014 to 2016 KNHANES (The Korean National Health and Nutrition Examination Survey) and only the data from ages of 7 years or older were used. The contamination data were collected from the 2014~2016 Total Diet Study and all the analysis were performed using R software. Rockfish, eel, anchovy broth and pollock were mainly exposed to N-nitrosamines. In terms of cooking methods, soups and stews appeared to contain the highest amount of N-nitrosamines. Cereals, fruits, and dairy products in the ingredient categories, and rice dishes and rice combined with others in recipe categories had the lowest level exposure to N-nitrosamines. In case of N-nitrosamines, unlike other cooking related food contaminants, boiled dishes such as soups and stews and dishes mainly consisting of fishes and shellfishes had highest level of exposure, showing a large discrepancy with the previous thought of processed meat is the main source of N-nitrosamines.

Research about feature selection that use heuristic function (휴리스틱 함수를 이용한 feature selection에 관한 연구)

  • Hong, Seok-Mi;Jung, Kyung-Sook;Chung, Tae-Choong
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.281-286
    • /
    • 2003
  • A large number of features are collected for problem solving in real life, but to utilize ail the features collected would be difficult. It is not so easy to collect of correct data about all features. In case it takes advantage of all collected data to learn, complicated learning model is created and good performance result can't get. Also exist interrelationships or hierarchical relations among the features. We can reduce feature's number analyzing relation among the features using heuristic knowledge or statistical method. Heuristic technique refers to learning through repetitive trial and errors and experience. Experts can approach to relevant problem domain through opinion collection process by experience. These properties can be utilized to reduce the number of feature used in learning. Experts generate a new feature (highly abstract) using raw data. This paper describes machine learning model that reduce the number of features used in learning using heuristic function and use abstracted feature by neural network's input value. We have applied this model to the win/lose prediction in pro-baseball games. The result shows the model mixing two techniques not only reduces the complexity of the neural network model but also significantly improves the classification accuracy than when neural network and heuristic model are used separately.

Korean Word Sense Disambiguation using Dictionary and Corpus (사전과 말뭉치를 이용한 한국어 단어 중의성 해소)

  • Jeong, Hanjo;Park, Byeonghwa
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.1-13
    • /
    • 2015
  • As opinion mining in big data applications has been highlighted, a lot of research on unstructured data has made. Lots of social media on the Internet generate unstructured or semi-structured data every second and they are often made by natural or human languages we use in daily life. Many words in human languages have multiple meanings or senses. In this result, it is very difficult for computers to extract useful information from these datasets. Traditional web search engines are usually based on keyword search, resulting in incorrect search results which are far from users' intentions. Even though a lot of progress in enhancing the performance of search engines has made over the last years in order to provide users with appropriate results, there is still so much to improve it. Word sense disambiguation can play a very important role in dealing with natural language processing and is considered as one of the most difficult problems in this area. Major approaches to word sense disambiguation can be classified as knowledge-base, supervised corpus-based, and unsupervised corpus-based approaches. This paper presents a method which automatically generates a corpus for word sense disambiguation by taking advantage of examples in existing dictionaries and avoids expensive sense tagging processes. It experiments the effectiveness of the method based on Naïve Bayes Model, which is one of supervised learning algorithms, by using Korean standard unabridged dictionary and Sejong Corpus. Korean standard unabridged dictionary has approximately 57,000 sentences. Sejong Corpus has about 790,000 sentences tagged with part-of-speech and senses all together. For the experiment of this study, Korean standard unabridged dictionary and Sejong Corpus were experimented as a combination and separate entities using cross validation. Only nouns, target subjects in word sense disambiguation, were selected. 93,522 word senses among 265,655 nouns and 56,914 sentences from related proverbs and examples were additionally combined in the corpus. Sejong Corpus was easily merged with Korean standard unabridged dictionary because Sejong Corpus was tagged based on sense indices defined by Korean standard unabridged dictionary. Sense vectors were formed after the merged corpus was created. Terms used in creating sense vectors were added in the named entity dictionary of Korean morphological analyzer. By using the extended named entity dictionary, term vectors were extracted from the input sentences and then term vectors for the sentences were created. Given the extracted term vector and the sense vector model made during the pre-processing stage, the sense-tagged terms were determined by the vector space model based word sense disambiguation. In addition, this study shows the effectiveness of merged corpus from examples in Korean standard unabridged dictionary and Sejong Corpus. The experiment shows the better results in precision and recall are found with the merged corpus. This study suggests it can practically enhance the performance of internet search engines and help us to understand more accurate meaning of a sentence in natural language processing pertinent to search engines, opinion mining, and text mining. Naïve Bayes classifier used in this study represents a supervised learning algorithm and uses Bayes theorem. Naïve Bayes classifier has an assumption that all senses are independent. Even though the assumption of Naïve Bayes classifier is not realistic and ignores the correlation between attributes, Naïve Bayes classifier is widely used because of its simplicity and in practice it is known to be very effective in many applications such as text classification and medical diagnosis. However, further research need to be carried out to consider all possible combinations and/or partial combinations of all senses in a sentence. Also, the effectiveness of word sense disambiguation may be improved if rhetorical structures or morphological dependencies between words are analyzed through syntactic analysis.

Time-Lapse Crosswell Seismic Study to Evaluate the Underground Cavity Filling (지하공동 충전효과 평가를 위한 시차 공대공 탄성파 토모그래피 연구)

  • Lee, Doo-Sung
    • Geophysics and Geophysical Exploration
    • /
    • v.1 no.1
    • /
    • pp.25-30
    • /
    • 1998
  • Time-lapse crosswell seismic data, recorded before and after the cavity filling, showed that the filling increased the velocity at a known cavity zone in an old mine site in Inchon area. The seismic response depicted on the tomogram and in conjunction with the geologic data from drillings imply that the size of the cavity may be either small or filled by debris. In this study, I attempted to evaluate the filling effect by analyzing velocity measured from the time-lapse tomograms. The data acquired by a downhole airgun and 24-channel hydrophone system revealed that there exists measurable amounts of source statics. I presented a methodology to estimate the source statics. The procedure for this method is: 1) examine the source firing-time for each source, and remove the effect of irregular firing time, and 2) estimate the residual statics caused by inaccurate source positioning. This proposed multi-step inversion may reduce high frequency numerical noise and enhance the resolution at the zone of interest. The multi-step inversion with different starting models successfully shows the subtle velocity changes at the small cavity zone. The inversion procedure is: 1) conduct an inversion using regular sized cells, and generate an image of gross velocity structure by applying a 2-D median filter on the resulting tomogram, and 2) construct the starting velocity model by modifying the final velocity model from the first phase. The model was modified so that the zone of interest consists of small-sized grids. The final velocity model developed from the baseline survey was as a starting velocity model on the monitor inversion. Since we expected a velocity change only in the cavity zone, in the monitor inversion, we can significantly reduce the number of model parameters by fixing the model out-side the cavity zone equal to the baseline model.

  • PDF

Spectral Induced Polarization Characteristics of Rocks in Gwanin Vanadiferous Titanomagnetite (VTM) Deposit (관인 함바나듐 티탄철광상 암석의 광대역 유도분극 특성)

  • Shin, Seungwook
    • Geophysics and Geophysical Exploration
    • /
    • v.24 no.4
    • /
    • pp.194-201
    • /
    • 2021
  • Induced polarization (IP) effect is known to be caused by electrochemical phenomena at interface between minerals and pore water. Spectral induced polarization (SIP) method is an electrical survey to localize subsurface IP anomalies while injecting alternating currents of multiple frequencies into the ground. This method was effectively applied to mineral exploration of various ore deposits. Titanomagnetite ores were being produced by a mining company located in Gonamsan area, Gwanin-myeon, Pocheon-si, Gyeonggi-do, South Korea. Because the ores contain more than 0.4 w% vanadium, the ore deposit is called as Gwanin vanadiferous titanomagnetite (VTM) deposit. The vanadium is the most important of materials in production of vanadium redox flow batteries, which can be appropriately used for large-scale energy storage system. Systematic mineral exploration was conducted to identify presence of hidden VTM orebodies and estimate their potential resources. In geophysical exploration, laboratory geophysical measurement of rock samples is helpful to generate reliable property models from field survey data. Therefore, we performed laboratory SIP data of the rocks from the Gwanin VTM deposit to understand SIP characteristics between ores and host rocks and then demonstrate the applicability of this method for the mineral exploration. Both phase and resistivity spectra of the ores sampled from underground outcrop and drilling cores were different of those of the host rocks consisting of monzodiorite and quartz monzodiorite. Because the phase and resistivity at frequencies below 100 Hz are mainly dependent on the SIP characteristics of the rocks, we calculated mean values of the ores and the host rocks. The average phase values at 0.1 Hz were ores: -369 mrad and host rocks: -39 mrad. The average resistivity values at 0.1 Hz were ores: 16 Ωm and host rocks: 2,623 Ωm. Because the SIP characteristics of the ores were different of those of the host rocks, we considered that the SIP survey is effective for the mineral exploration in vanadiferous titanomagnetite deposits and the SIP characteristics are useful for interpreting field survey data.