Unit DEEP LEARNING AND ROBOT PERCEPTION
- Course
- Computer engineering and robotics
- Study-unit Code
- A003170
- Curriculum
- Data science
- Teacher
- Gabriele Costante
- Teachers
-
- Gabriele Costante
- Hours
- 48 ore - Gabriele Costante
- CFU
- 6
- Course Regulation
- Coorte 2022
- Offered
- 2023/24
- Learning activities
- Caratterizzante
- Area
- Ingegneria informatica
- Sector
- ING-INF/04
- Type of study-unit
- Opzionale (Optional)
- Type of learning activities
- Attività formativa monodisciplinare
- Language of instruction
- Italian
- Contents
- Image filtering, keypoint detection and feature extraction. Keypoint matching and Image Transformations. Epipolar geometry and stereo reconstruction. Visual odometry and visual SLAM. Deep Learning and convolutional neural networks. Guided exercises on Pytorch for the development of solutions based on convolutional neural networks.
- Reference texts
- Richard Szeliski. “Computer Vision: Algorithms and Applications”, Springer -
Ian Goodfellow, Yoshua Bengio, Aaron Courville. “Deep Learning”, MIT Press - Christopher Bishop. “Pattern Recognition and Machine Learning”, Springer; Richard Hartley, Andrew Zisserman. “Multiple view geometry”, Cambridge University Press – Ramesh Jain, Rangachar Kasturi, Brian G. Schunk. “Machine Vision”, McGraw-Hill
Teacher's handouts available on UNISTUDIUM - PIATTAFORMA DI E-LEARNING DELL'UNIVERSITÀ DEGLI STUDI DI PERUGIA (https://www.unistudium.unipg.it/unistudium/ ) - Educational objectives
- - Basic knowledge of: methodologies for the extraction of information of various semantic levels from images; techniques for the feature extraction and image descriptors computation; techniques for transformation estimation between pairs of images; strategies for 3D reconstruction from stereo images; principles and intuition behind Visual odometry and Visual SLAM; characteristics, advantages and solutions based on convolutional neural networks; methodologies and solution based on reinforcement learning.
- Skills: capability to choose technologies and functional blocks of image analysis systems for various applications; capability to use convolutional neural networks and to design deep learning architectures (in particular, using the Pytorch framework)
- The teaching also contributes to the following learning outcomes: capability to elaborate and/or apply original ideas in different applications; capability to solve problems in new and/or interdisciplinary environments; capability to motivate the design choices made, highlighting possible critical issues; capability to integrate knowledge from different sources and manage complexity. - Prerequisites
- In order to understand the content presented and achieve the learning objectives it is useful to have a basic knowledge of Linear Algebra and a good knowledge of Machine Learning and programming. Suggested teaching: "Machine Learning and Data Mining".
- Teaching methods
- The teaching is organized as follows:- frontal lectures in the classroom;- seminars- guided exercises at the Computer Science Laboratory on the design of solutions for deep learning and deep reinforcement learning problems with the Pytorch framework. Teaching support tools: blackboard and PC+projector, PC.
- Other information
- I Semester (more details at link http: //www.ing.unipg.it/it/didattica/studiare-nei-nostri-corsi/orario-delle-lezioni ).
- Learning verification modality
- The verification of learning is performed with an oral exam and a practical test (project).
The oral test includes a discussion of the submitted project and two questions on topics chosen by the teacher, and lasts about 45 minutes.
The exam aims to verify the student's understanding of the fundamental theoretical tools provided by the teaching, as well as the relationship between these tools and the general topics of information engineering.
The project aims to verify the student capabilities to design solutions based on deep learning methodologies for computer vision and robotic applications by using the Pytorch framework.
The project needs to be sent to the teacher the day before the examination date.
Reservations for examinations are made via the SOL portal: https://www.segreterie.unipg.it/.
For information on support services for students with disabilities and/or DSA visit http://www.unipg.it/disabilita-e-dsa. - Extended program
- Introduction to Computer Vision.
Image Filtering.
Image Features. Feature Descriptor and Matching.
Image Transforms.
Stereo geometry. Visual odometry.
Machine learning and Computer Vision: Introduction.
Neural Network and Deep Learning.
Convolutional Neural Network (CNN).
CNN Architectures.
Recurrent Neural Network: RNN –LSTM –GRU. CNN in Computer vision applications (Detection, segmentation, recognition).
Generative Models (Generative adversarial network (GAN)) and CNN unsupervised applications.
Laboratory Session: Framework for deep learning, CNN Architecture design, CNN application examples. - Obiettivi Agenda 2030 per lo sviluppo sostenibile