Unit COMPUTER VISION AND ROBOT PERCEPTION

Course

Computer engineering and robotics

Study-unit Code

A002337

Curriculum

Robotics

Teacher

Gabriele Costante

Teachers

Gabriele Costante

Hours

48 ore - Gabriele Costante

CFU

Course Regulation

Coorte 2021

Offered

2021/22

Learning activities

Caratterizzante

Area

Ingegneria informatica

Academic discipline

ING-INF/04

Type of study-unit

Obbligatorio (Required)

Type of learning activities

Attività formativa monodisciplinare

Language of instruction

Italian

Contents

Image filtering, keypoint detection and feature extraction. Keypoint matching and Image Transformations. Bag of visual words for image retrieval. Epipolar geometry and stereo reconstruction. Visual odometry and visual SLAM. Deep Learning and convolutional neural networks. Guided exercises on Pytorch for convolutional neural networks design.

Reference texts

Richard Szeliski. “Computer Vision: Algorithms and Applications”, Springer -
Ian Goodfellow, Yoshua Bengio, Aaron Courville. “Deep Learning”, MIT Press - Christopher Bishop. “Pattern Recognition and Machine Learning”, Springer; Richard Hartley, Andrew Zisserman. “Multiple view geometry”, Cambridge University Press – Ramesh Jain, Rangachar Kasturi, Brian G. Schunk. “Machine Vision”, McGraw-Hill.
Teacher's handouts available on UNISTUDIUM - PIATTAFORMA DI E-LEARNING DELL'UNIVERSITÀ DEGLI STUDI DI PERUGIA (https://www.unistudium.unipg.it/unistudium/ )

Educational objectives

- Basic knowledge of: methodologies for the extraction of information of various semantic levels from images; techniques for the feature extraction and image descriptors computation; techniques for trasnformation estimation between pairs of images; algorithms for the image classification and retrieval; strategies for 3D reconstruction from stereo images; principles and intuition behind Visual odometry and Visual SLAM; characteristics, advantages and solutions based on convolutional neural networks. Skills: capability to choose technologies and functional blocks of image analysis systems for various applications; capability to use convolutional neural networks and to design deep learning architectures for image analysis (in particular, using the Pytorch framework) - The teaching also contributes to the following learning outcomes: capability to elaborate and/or apply original ideas in different applications; capability to solve problems in new and/or interdisciplinary environments; capability to motivate the design choices made, highlighting possible critical issues; capability to integrate knowledge from different sources and manage complexity.

Prerequisites

In order to understand the content presented and achieve the learning objectives it is useful to have a basic knowledge of Linear Algebra and a good knowledge of Machine Learning and programming. Suggested teaching: "Machine Learning and Data Mining".

Teaching methods

The teaching is organized as follows:- frontal lectures in the classroom;- seminars- guided exercises at the Computer Science Laboratory on the design of convolutional neural networks with the Pytorch framework. During laboratory lectures, the students are distributed on 16 workstations. Teaching support tools: blackboard and PC+projector, PC.

Other information

II Semester (more details at link http: //www.ing.unipg.it/it/didattica/studiare-nei-nostri-corsi/orario-delle-lezioni ).

Learning verification modality

The verification of learning is performed with an oral exam and a practical test (project).
The oral test includes a discussion of the submitted project and two questions on topics chosen by the teacher, and lasts about 45 minutes.
The exam aims to verify the student's understanding of the fundamental theoretical tools provided by the teaching, as well as the relationship between these tools and the general topics of information engineering.
The project is used to verify the student capabilities to design solutions based on convolutional neural networks for computer vision applications by using the Pytorch framework.
The project needs to be sent to the teacher the day before the examination date.
Reservations for examinations are made via the SOL portal: https://www.segreterie.unipg.it/.

For information on support services for students with disabilities and/or DSA visit http://www.unipg.it/disabilita-e-dsa.

Extended program

Introduction to Computer Vision. Image Filtering. Image Features. Feature Descriptor and Matching. Image Transforms. RANSAC-Hough transform. Bag of visual words. Stereo geometry. Visual odometry. Machine learning and Computer Vision: Introduction, Nearest Neighbor, Linear Classifier, SVM, Neural Network. Deep Learning. Convolutional Neural Network (CNN). CNN Architectures. Recurrent Neural Network: RNN –LSTM –GRU. CNN in Computer vision applications (Detection, segmentation, recognition). Generative Models (Generative adversarial network (GAN)) and CNN unsupervised applications. Laboratory Session: Framework for deep learning, CNN Architecture design, CNN application examples.