Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Search for study opportunitiesNEW

Connect with the world's best universities and choose your course of study

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

Image Pyramids and Scale Space Analysis in Computer Vision - Prof. Marshall Tappen, Study notes of Computer Science

University of Central Florida (UCF)Computer Science

Prof. Marshall Tappen

Image pyramids, a representation of an image using a collection of images with decreasing resolution. The importance of scale space analysis in computer vision and introduces gaussian and laplacian pyramids. It also covers applications of image pyramids in texture synthesis and noise removal. Based on a lecture from the computer vision course, cap5415, at the university of texas at austin, fall 2009.

Typology: Study notes

Pre 2010

Uploaded on 11/08/2009

koofers-user-gdy-1 🇺🇸

10 documents

1 / 43

Related documents

Stereo Vision: Triangulation and Image Rectification - Prof. Marshall Tappen

Computer Vision Lecture 1: Course Overview and Convolution - Prof. Marshall Tappen

Scaled Representations and Gaussian Pyramids in Computer Vision

Computer Vision Lecture 3: Periodicity, Sampling, Practicalities in CAP 5415 - Prof. Marsh

Lecture Notes on Computer Vision - Binary Operations | CAP 5415

Problem Set 2 for Computer Vision Course CAP5415 at University of Central Florida - Prof.

Basic Equation - Lecture Notes - Introduction to Computer Vision | CAP 5415

Camera and Geometry Lecture: Pinhole Cameras, Lenses, and Image Formation - Prof. Marshall

Aperture Problem-Introduction to Computer Vision-Lecture 07-Computer Science

(1)

Feature Detection in Computer Vision: Finding Good Points for Matching and Edge Detection

Mean-Shift and Normalized Cuts for Image Segmentation: A Comprehensive Study - Prof. Marsh

Computer Vision and Image Analysis

Computer Vision: Intro to Image Recognition and Face Detection

Computer Vision And Deep Learning: Convolutional Neural Network

Computer Vision: Understanding Images through Image Processing and Analysis

Edge in 1D-Introduction to Computer Vision-Lecture 05-Computer Science

(1)

More Linear Processing - Lecture Notes | CAP 5415

Basic Multi-View Geometry - Lecture Notes | CAP 5415

Multiresolution Image Processing: Image Pyramids and Subband Coding - Prof. S. Todorovic

Lecture Slides on Recap Pyramids - Computer Vision | CAP 5415

Pedestrian Traffic Flow Analysis: A Computer Vision Project - Prof. Linda Wills

Image-Based Volumetric Modeling: Visual Hull and Space Carving

Computer Vision Lecture: Image Projection, Binary Analysis, and Morphological Operators -

Understanding Distance and Paths in Image Analysis: Image Spaces & Riemannian Manifolds -

Linear Algebra for Computer Vision: Notation, Basics, and Applications - Prof. Ramalingam

Computer Vision: Understanding Images and Video Data

CSE152 Assignment 1: Computer Vision - Perspective Transform and Image Manipulation

Computer Vision Lecture 6: Finding Lines in Images using Hough Transform

Understanding Computer Vision: Image Processing, CNN, and Applications

Sample Exam 2 with Answer Key - Image Analysis and Computer Vision | ECE 438

(1)

Partial preview of the text

Download Image Pyramids and Scale Space Analysis in Computer Vision - Prof. Marshall Tappen and more Study notes Computer Science in PDF only on Docsity! CAP5415: Computer Vision Lecture 4: More on Sampling, Image Pyramids, Image Statistics, Denoising Fall 2009 Image Pyramids ● I've motivated the DFT as a transformation that allows you to see different aspects of the data ● Today we will look at a different transformation that gives you easier access to different kinds of information ● Specifically we will look at scale information How about now? EX A e New Task ¢ How many whiskers does the zebra have? e> " Now you need high-resolution info =) (128 x 128 Image) ● Each level of the pyramid represents the image if it were blurred with a Gaussian. The levels vary by the std. dev. of the Gaussians When would it be useful? ● Useful whenever you need to work at multiple scales – Looking for an object that could be close or far ● Can eliminate distractions What's wrong with the Gaussian pyramid? ● It is redundant ● Each level contains all of the low- frequencies that are available at the lower levels. Frequency view ● Each level captures a band of spatial frequencies Level 1 Level 2 Level 3 What about orientation? First component of layer | Laplacian Pyramid Oriented Pyramid Application: Texture Synthesis ¢ Basic Problem wm Sample Texture Larger Sample Heeger and Bergen: SIGGRAPH95 ● Basic Assumption – Take two texture images – Decompose them into a steerable pyramid – If the histogram of the pyramid coefficients are similar, then the textures will appear similar ● Based on studies of human vision, see (Bergen and Adelson 88) or (Malik and Perona 89) Basic algorithm ● Decompose current image into steerable pyramid ● Modify each pyramid image so that its histogram matches the histogram of the corresponding image from the sample ● Invert the pyramid to recover the current texture image ● Repeat until the image converges Results Input Synthesized From (Heeger and Bergen 95) Failures From (Heeger and Bergen 95) Texture beyond histograms ● This approach, while interesting, ended up not being used widely in the graphics community ● Researchers found that they could generate more visually pleasing textures by replicating patches of texture in a smooth fashion – Efros and Leung – 1999 – Efros and Freeman – 2003 ● This approach is still very interesting from an analysis point of view We can use the steerable pyramid (From Adelson and Simoncelli - 1996) What do these histograms tell us? ● Pyramid coefficients from images: Usually zero, but big sometimes ● Noise coefficients: Usually close to zero, very rarely big (From Adelson and Simoncelli - 1996) Question ● What would my estimate of the coefficient be if: – It had a big value? – It had a small value (From Adelson and Simoncelli - 1996) Results (From Adelson and Simoncelli - 1996) Denoising ● One of the state-of-the-art algorithms (Portilla, Strela, Wainwright, and Simoncelli) extends this basic idea – Uses a better estimator of derivative values Fast Filtering ● All of these operations require a lot of filtering ● The FFT is one trick to making it faster. ● We can do more by choosing the filters correctly ● Technique called separable filtering Now, let's do a convolution * The result Result from 2 1-D convolution Original Filter What does this mean? ● Using associativity, we can get the same result by filtering with a 81x81 filter as with convolving by two 1x81 filters ● 81x81 filter=81*81=6561 additions and multiplications ● 2 1x81 filters = 2*81=162 operations ● Much faster

Documents

questions