Deep learning for scene visualization and sentence-based image synthesis

Deep learning and data mining is a subset of machine learning. This project requires to study mainly in field of deep learning and data mining. The research question to be addressed is solve deep learning for scene visualization and sentence-based image synthesis through image classification and ima...

Full description

Saved in:
Bibliographic Details
Main Author: Beh, Teck Sian
Format: Final Year Project / Dissertation / Thesis
Published: 2023
Subjects:
Online Access:http://eprints.utar.edu.my/5988/1/fyp_IA_2023_BTS.pdf
http://eprints.utar.edu.my/5988/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utar-eprints.5988
record_format eprints
spelling my-utar-eprints.59882024-01-02T15:20:01Z Deep learning for scene visualization and sentence-based image synthesis Beh, Teck Sian T Technology (General) TA Engineering (General). Civil engineering (General) TD Environmental technology. Sanitary engineering TN Mining engineering. Metallurgy Deep learning and data mining is a subset of machine learning. This project requires to study mainly in field of deep learning and data mining. The research question to be addressed is solve deep learning for scene visualization and sentence-based image synthesis through image classification and image captioning using language python and anaconda navigator. Image classification is a part of project that has many practical applications in different fields, ranging from object recognition, medical imaging, content moderation, and quality control. Image captioning generator is simple which take an image and try to generate a caption that matches the gist of that image closely as possible, which include whole meaning of one picture in just one sentence, which saves times. The image captioning between NLP and computer vision and work in coordination to make image captioning possible and the attention mechanism came to rescue. The methodology and techniques included in the project are research-based project, which in the research process. Research methods and tools to be used were language python and anaconda navigator to launch the jupyter notebook and google colab. Besides that, the dataset was gotten from Kaggle which is Flickr8k Dataset to launch the progress. The platform uses to run the datasets is Jupyter notebook and google colab to run the coding input and give output to judge validity and generality of results. The projects image processing contributes to computer vision applications, such as object detection, classification, and tracking. Scene visualization allows computer understand objects, environment and sentence-based image synthesis enabled computers generate images from textual descriptions. User can random insert picture and system will detect the images given with suitable text description. This project used to generate visual instructions for robots to perform tasks and create more realistic and immersive gaming environments. For advertising and marketing, these techniques can used to generate personalized ads or product recommendations based on customer preferences. For example, sentence-based image synthesis can used to create custom product images based on user input or social media data. These neural networks try to mimic how the human brain functions. Using a public dataset as training data, a deep learning method called CNN used to detect and segment multiple targets in two-dimensional (2D) elemental images for integral imaging system. A range of applications are embracing these techniques to build virtual scenes by verbal description in tandem with advancement of computer graphics, natural language processing, and computing power. Image captioning with start an image and pass it through a pre-trained ImageNet model like inception v3 and produce output feature vectors. Inception v3 vii is a large network with many pooling, convolution, and fully connected layers which have higher accuracy in the ImageNet dataset which knows as transfer learning for layer output. 2023-06 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/5988/1/fyp_IA_2023_BTS.pdf Beh, Teck Sian (2023) Deep learning for scene visualization and sentence-based image synthesis. Final Year Project, UTAR. http://eprints.utar.edu.my/5988/
institution Universiti Tunku Abdul Rahman
building UTAR Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tunku Abdul Rahman
content_source UTAR Institutional Repository
url_provider http://eprints.utar.edu.my
topic T Technology (General)
TA Engineering (General). Civil engineering (General)
TD Environmental technology. Sanitary engineering
TN Mining engineering. Metallurgy
spellingShingle T Technology (General)
TA Engineering (General). Civil engineering (General)
TD Environmental technology. Sanitary engineering
TN Mining engineering. Metallurgy
Beh, Teck Sian
Deep learning for scene visualization and sentence-based image synthesis
description Deep learning and data mining is a subset of machine learning. This project requires to study mainly in field of deep learning and data mining. The research question to be addressed is solve deep learning for scene visualization and sentence-based image synthesis through image classification and image captioning using language python and anaconda navigator. Image classification is a part of project that has many practical applications in different fields, ranging from object recognition, medical imaging, content moderation, and quality control. Image captioning generator is simple which take an image and try to generate a caption that matches the gist of that image closely as possible, which include whole meaning of one picture in just one sentence, which saves times. The image captioning between NLP and computer vision and work in coordination to make image captioning possible and the attention mechanism came to rescue. The methodology and techniques included in the project are research-based project, which in the research process. Research methods and tools to be used were language python and anaconda navigator to launch the jupyter notebook and google colab. Besides that, the dataset was gotten from Kaggle which is Flickr8k Dataset to launch the progress. The platform uses to run the datasets is Jupyter notebook and google colab to run the coding input and give output to judge validity and generality of results. The projects image processing contributes to computer vision applications, such as object detection, classification, and tracking. Scene visualization allows computer understand objects, environment and sentence-based image synthesis enabled computers generate images from textual descriptions. User can random insert picture and system will detect the images given with suitable text description. This project used to generate visual instructions for robots to perform tasks and create more realistic and immersive gaming environments. For advertising and marketing, these techniques can used to generate personalized ads or product recommendations based on customer preferences. For example, sentence-based image synthesis can used to create custom product images based on user input or social media data. These neural networks try to mimic how the human brain functions. Using a public dataset as training data, a deep learning method called CNN used to detect and segment multiple targets in two-dimensional (2D) elemental images for integral imaging system. A range of applications are embracing these techniques to build virtual scenes by verbal description in tandem with advancement of computer graphics, natural language processing, and computing power. Image captioning with start an image and pass it through a pre-trained ImageNet model like inception v3 and produce output feature vectors. Inception v3 vii is a large network with many pooling, convolution, and fully connected layers which have higher accuracy in the ImageNet dataset which knows as transfer learning for layer output.
format Final Year Project / Dissertation / Thesis
author Beh, Teck Sian
author_facet Beh, Teck Sian
author_sort Beh, Teck Sian
title Deep learning for scene visualization and sentence-based image synthesis
title_short Deep learning for scene visualization and sentence-based image synthesis
title_full Deep learning for scene visualization and sentence-based image synthesis
title_fullStr Deep learning for scene visualization and sentence-based image synthesis
title_full_unstemmed Deep learning for scene visualization and sentence-based image synthesis
title_sort deep learning for scene visualization and sentence-based image synthesis
publishDate 2023
url http://eprints.utar.edu.my/5988/1/fyp_IA_2023_BTS.pdf
http://eprints.utar.edu.my/5988/
_version_ 1787140946940395520
score 13.15806