Unit COMPUTER VISION AND ROBOT PERCEPTION

Course
Computer engineering and robotics
Study-unit Code
A002337
Curriculum
Robotics
Teacher
Gabriele Costante
Teachers
  • Gabriele Costante
Hours
  • 48 ore - Gabriele Costante
CFU
6
Course Regulation
Coorte 2021
Offered
2021/22
Learning activities
Caratterizzante
Area
Ingegneria informatica
Academic discipline
ING-INF/04
Type of study-unit
Obbligatorio (Required)
Type of learning activities
Attività formativa monodisciplinare
Language of instruction
Italian
Contents
Image filtering, keypoint detection and feature extraction. Keypoint matching and Image Transformations. Bag of visual words for image retrieval. Epipolar geometry and stereo reconstruction. Visual odometry and visual SLAM. Deep Learning and convolutional neural networks. Guided exercises on Pytorch for convolutional neural networks design.
Reference texts
Richard Szeliski. “Computer Vision: Algorithms and Applications”, Springer -
Ian Goodfellow, Yoshua Bengio, Aaron Courville. “Deep Learning”, MIT Press - Christopher Bishop. “Pattern Recognition and Machine Learning”, Springer; Richard Hartley, Andrew Zisserman. “Multiple view geometry”, Cambridge University Press – Ramesh Jain, Rangachar Kasturi, Brian G. Schunk. “Machine Vision”, McGraw-Hill.
Teacher's handouts available on UNISTUDIUM - PIATTAFORMA DI E-LEARNING DELL'UNIVERSITÀ DEGLI STUDI DI PERUGIA (https://www.unistudium.unipg.it/unistudium/ )
Educational objectives
- Basic knowledge of: methodologies for the extraction of information of various semantic levels from images; techniques for the feature extraction and image descriptors computation; techniques for trasnformation estimation between pairs of images; algorithms for the image classification and retrieval; strategies for 3D reconstruction from stereo images; principles and intuition behind Visual odometry and Visual SLAM; characteristics, advantages and solutions based on convolutional neural networks. Skills: capability to choose technologies and functional blocks of image analysis systems for various applications; capability to use convolutional neural networks and to design deep learning architectures for image analysis (in particular, using the Pytorch framework) - The teaching also contributes to the following learning outcomes: capability to elaborate and/or apply original ideas in different applications; capability to solve problems in new and/or interdisciplinary environments; capability to motivate the design choices made, highlighting possible critical issues; capability to integrate knowledge from different sources and manage complexity.
Prerequisites
In order to understand the content presented and achieve the learning objectives it is useful to have a basic knowledge of Linear Algebra and a good knowledge of Machine Learning and programming. Suggested teaching: "Machine Learning and Data Mining".
Teaching methods
The teaching is organized as follows:- frontal lectures in the classroom;- seminars- guided exercises at the Computer Science Laboratory on the design of convolutional neural networks with the Pytorch framework. During laboratory lectures, the students are distributed on 16 workstations. Teaching support tools: blackboard and PC+projector, PC.
Other information
II Semester (more details at link http: //www.ing.unipg.it/it/didattica/studiare-nei-nostri-corsi/orario-delle-lezioni ).
Learning verification modality
The verification of learning is performed with an oral exam and a practical test (project).
The oral test includes a discussion of the submitted project and two questions on topics chosen by the teacher, and lasts about 45 minutes.
The exam aims to verify the student's understanding of the fundamental theoretical tools provided by the teaching, as well as the relationship between these tools and the general topics of information engineering.
The project is used to verify the student capabilities to design solutions based on convolutional neural networks for computer vision applications by using the Pytorch framework.
The project needs to be sent to the teacher the day before the examination date.
Reservations for examinations are made via the SOL portal: https://www.segreterie.unipg.it/.

For information on support services for students with disabilities and/or DSA visit http://www.unipg.it/disabilita-e-dsa.
Extended program
Introduction to Computer Vision. Image Filtering. Image Features. Feature Descriptor and Matching. Image Transforms. RANSAC-Hough transform. Bag of visual words. Stereo geometry. Visual odometry. Machine learning and Computer Vision: Introduction, Nearest Neighbor, Linear Classifier, SVM, Neural Network. Deep Learning. Convolutional Neural Network (CNN). CNN Architectures. Recurrent Neural Network: RNN –LSTM –GRU. CNN in Computer vision applications (Detection, segmentation, recognition). Generative Models (Generative adversarial network (GAN)) and CNN unsupervised applications. Laboratory Session: Framework for deep learning, CNN Architecture design, CNN application examples.
Condividi su