CS 180 Final Project: Light Field Cameras and Gradient Domain Fusion

Note

This report contains a gif that may not load properly if viewed in the pdf. Please click the link above to view the report online.

Overview

For this project, I reproduced the effects from a Lytro camera using real lightfield data via shifting and averaging. The images were captured over a plane orthogonal to the optical axis which helps to achieve the complex effects shown below.

Depth Refocusing

For this part of the project, I took in all of the images and found the center one as the reference image. The naive method would be to simply average these images without any shifting. This, however, will produce an image which is sharp around the far-away objects but blurry around the nearby ones. To address this, I shifted the images appropriately around the reference image and then used a constant to vary this shift. This is the following equation I used: $$ \text{(shift_x, shift_y)} = c \cdot (\text{(reference_x, reference_y)} - \text{(center_x, center_y)})$$ Here are some images and gif with varying levels of $c$.

$c = 0.15$

$c = 0.5$

Gif for $c \in [-0.5, 0.5]$

As shown in the gif, the depth gets refocused on different parts of the image based on the value of $c$. The far-away objects become blurry when $c$ is negative and the nearby objects become blurry when $c$ is positive.

Aperture Adjustment

In this section, I simulated changing the aperture which affects the focused image regions. While maintaining the same $c$ value, I varied the radius away from the center and used only those images within the circle for my final averaged image. Although I used fewer images compared to before, the resulting image was more focused in the center. Here are some of the images and gif with varying radii.

Radius = 2

Radius = 6

Gif for Varying Radius

As shown in the gif, the center pieces of the image are in focus while the outer pieces shift in their blurriness. This demonstrates the effect of changing the aperture.

Learnings

It was very interesting to see how I could simulate the effects of a Lytro camera using real lightfield data. I learned how to shift and average images to achieve the desired effects and how to vary the shift to get different results. I also learned how to simulate changing the aperture to focus on different parts of the image.

Overview

The second project that I worked on explored gradient-domain processing and how it can be used to seamlessly blend an object or texture from a source image into a target image. I used the Poisson blending algorithm to achieve this effect. The images below show the source and target images as well as the final blended image.

Toy Problem

In the first part of the project, I found the best pixels to reconstruct a toy problem. The insight is that we care much more about the gradient of an image instead of the overall intensity. This means we can set up the problem as finding values for the target pixels that maximally preserve the gradient of the source region without changing any of the background pixels. I formulated this objective as a least squares problem and tried to minimize the following three objectives. $$ \begin{align*} (v(x + 1, y) - v(x, y) &- ((s(x + 1, y) - s(x, y))))^2 \\ (v(x, y + 1) - v(x, y) &- ((s(x, y + 1) - s(x, y))))^2 \\ (v(1, 1) &- s(1, 1))^2 \end{align*} $$ where $x$ and $y$ are the gradients from an image $s$ and $v$ is the reconstructed resulting image. I had some optimization by using a sparse matrix to reduce the time needed to solve the problem. For the toy problem, I was able to solve this least squares problem to recover the original image.

Toy Story (Original)

Toy Story (Reconstructed)

As shown above, the reconstructed image is very similar to the original image. Both are a bit grainy because they were small and blown up to be easily viewed.

Poisson Blending

For the next part, I worked on Poisson blending. This was very similar to the above toy problem but instead focused on the pixels inside of the mask. First, I chose an image to put into the foreground of my original image. Since the interactive drawing tool wasn't quite working, I manually created the mask to what I wanted the final image to look like. I also passed in the original image because I wanted to make sure that the background of the final image was the same as the original image outside of where I placing the mask. Let $f$ be the final image, $t$ be the background image, and $s$ be the foreground image where we are trying to place $s$ into $t$ to create $f$. There were three types of equations to solve:

If the pixel in the final image $f$ is outside of the mask, take the pixel from the target image $t$.
- $f[i, j] = t[i, j]$
If the pixel and its four neighbors in the final image $f$ are all inside of the mask, try to match the gradients in $f$ with the gradients in the source image $s$.
- $f[i, j] - f[i - 1, j] = s[i, j] - s[i - 1, j]$
- $f[i, j] - f[i + 1, j] = s[i, j] - s[i + 1, j]$
- $f[i, j] - f[i, j - 1] = s[i, j] - s[i, j - 1]$
- $f[i, j] - f[i, j + 1] = s[i, j] - s[i, j + 1]$
If the pixel and its four neighbors in the final image $f$ are all inside of the mask, try to match the gradients in $f$ with the gradients in the source image $s$.
- $f[i, j] - f[i - 1, j] = s[i, j] - s[i - 1, j]$
- $f[i, j] - f[i + 1, j] = s[i, j] - s[i + 1, j]$
- $f[i, j] - f[i, j - 1] = s[i, j] - s[i, j - 1]$
- $f[i, j] - f[i, j + 1] = s[i, j] - s[i, j + 1]$
If the pixel and its four neighbors in the final image $f$ are not all inside of the mask, try to match the gradients for $f$ and $t$ with the gradients in $s$.
- $f[i, j] - t[i - 1, j] = s[i, j] - s[i - 1, j]$
- $f[i, j] - t[i + 1, j] = s[i, j] - s[i + 1, j]$
- $f[i, j] - t[i, j - 1] = s[i, j] - s[i, j - 1]$
- $f[i, j] - t[i, j + 1] = s[i, j] - s[i, j + 1]$

I had some optimization by using a sparse matrix to reduce the time needed to solve the problem. Finally, I used the least squares optimization to solve for the final image. Here are the results:

Hikers (Background)	Penguin (Foreground)	Penguin (Mask)	Hikers with Penguin
Apocalypse (Background)	Mona Lisa (Foreground)	Mona Lisa (Mask)	Apocalypse with Mona Lisa
Steph Shooting (Background)	Ian Face (Foreground)	Ian Face (Mask)	Ian Shooting

In this section, I implemented mixed gradients into the Poisson blending algorithm. The only difference is that I took the maximum between the source and target gradients instead of worrying about just the source gradients from the above equations. This helps to create a better seamless transition between the two images overlaid on top of each other. Here are the results:

Penguin without Mixed Gradients

Penguin with Mixed Gradients

As shown above, the image with mixed gradients has a more seamless transition between the two images, specifically where there is the dirty snow while the image without mixed gradients still has the seams on the sides and worse blending there. Mixed gradients is useful when there is a significant gradient change such as harsh lines. It helps to smooth out the transition between the two images.

Learnings

For this project, I learned how to use gradient-domain processing to blend two images together. I had used images from project 2 where I used Laplacian and Gaussian pyramids to blend in images. This seems to solve for specified pixel intensities and gradients and I learned how to create stronger blends with mixed gradients.

CS 180: Intro to Computer Vision and Computational Photography, Fall 2024

Final Project: Lightfield Cameras

Ian Dong

Note

Overview

Section I: Depth Refocusing

Depth Refocusing

Section II: Aperture Adjustment

Aperture Adjustment

Section III: Summary

Learnings

Final Project: Gradient Domain Fusion

Overview

Section I: Toy Problem

Toy Problem

Section II: Poisson Blending

Poisson Blending

Section III: Bells and Whistles - Mixed Gradients

Section IV: Summary

Learnings