CS 280A HW2 Report

CS 280A HW2 Report1.1: Finite Difference Operator1.2: Derivative of Gaussian Filter2.1: Image "Sharpening"2.2: Hybrid Images2.3: Gaussian and Laplacian Stacks2.4: Multiresolution Blending

1.1: Finite Difference Operator

$\bold I$ $\bold D_\bold x=[1,\ -1],\ \bold D_\bold y=\begin{bmatrix*}1\\-1\end{bmatrix*}$ $x$ $y$ -direction by convoluting them with the origin image.


$\bold I * \bold D_\bold x$	$\bold I * \bold D_\bold y$

$[-1,\ 1]$ $f(x) = \frac{x+1}{2}$ for better visualizations.

$\bold I_{\text {grad}}$ $\bold I_{\text{grad}} = \sqrt{(\bold I*\bold D_\bold x)^2 + (\bold I*\bold D_\bold y)^2}$ $*$ $I_{\text{bin}}$ , which showed the pixels with a larger gradient.


$\bold I_{\text{grad}}$	$\bold I_\text{bin}$

1.2: Derivative of Gaussian Filter

$\bold I$ $\bold G$ $\bold I' = \bold I*\bold G$ $\bold I'_{\text{grad}}$ $\bold I'_{\text{bin}}$ by the method mentioned in section 1.1.


$\bold I' * \bold D_\bold x$	$\bold I' * \bold D_\bold y$

$\bold I'_{\text{grad}}$	$\bold I'_{\text{bin}}$

By applying Gaussian blur to the image, I saw the difference that the noises generated by the lawn are suppressed. This is mainly because the areas of the gradients generated by lawn are small. Therefore, they can be smoothed by the Gaussian blur and not be considered an "edge."

$\bold I'_{\text{grad}} = \sqrt{(\bold I' * \bold D_\bold x)^2+(\bold I'*\bold D_\bold y)^2} = \sqrt{(\bold I*\bold G * \bold D_\bold x)^2+(\bold I*\bold G * \bold D_\bold y)^2}$ $\bold G * \bold D_\bold x$ $\bold G * \bold D_\bold y$ first. In this assignment, it is called Derivative of Gaussian (DoG) filter.

\begin{matrix} {DoG}_{x} = G * D_{x} \\ {DoG}_{y} = G * D_{y} \end{matrix}

$\bold I'_{\text{grad}}$ $\bold{DoG}$ $\bold I'_{\text{grad}}=\sqrt{(\bold I * \bold{DoG}_ x)^2 + (\bold I * \bold{DoG}_y)^2}$ .


$\bold I * \bold{DoG}_\bold x$	$\bold I * \bold{DoG}_\bold y$

$\bold I'_{\text{grad}}$ (by DoG filter)	$\bold I'_{\text{bin}}$ (by DoG filter)

$\bold I'_{\text{bin}}$ obtained from two different methods are essentially the same.

2.1: Image "Sharpening"

$\bold I_{\text{hf}}$ of an image by subtraction.

I_{hf} = I - (I * G)

$\bold I_{\text{hf}}$ $\bold I_{\text{sharp}}$ .

I_{sharp} = I + α \cdot I_{hf}

$\alpha$ $\alpha = 0$ keeps the image unchanged.

$\alpha = 3$ .

I picked an image of Sather Gate. I blurred it first and tried to resharpen it based on the blurred image.


Origin

Blurred

Resharpened

Unfortunately, the resharpen process failed to "restore" the blurred image. The characters on the Sather Gate are still vague after resharpening.

2.2: Hybrid Images

$\bold I_1,\ \bold I_2$ , we can extract the low frequencies of one image and the high frequencies of the other and average them. In my implementation, instead of simply averaging them, I used the weighted average for better visual effects. Formally, it can be expressed as:

\begin{array}{r} I_{Hybrid} = \frac{a \cdot I_{1} * G + b \cdot (I_{2} - I_{2} * G)}{a + b} \end{array}

$a,\ b$ are hand-tuned parameters.

$\bold I_1,\ \bold I_2$ :


$\bold I_1$ (Frieren)	$\bold I_2$ (Anya)

I kept the low frequencies of Frieren and the high frequencies of Anya to generate the hybrid image "Frierenya":


"Frierenya"

The image looks more like Frieren when you look far away, while it looks more like Anya when you look close.

Now, let's inspect the frequencies of these images by Fourier analysis.


Frieren's frequencies	Low frequencies of Frieren

Anya's frequencies	High frequencies of Anya


Frierenya's frequencies

Similarly, I applied the merge process to two more pairs of images.


$\bold I_1$ (Albert Einstein)	$\bold I_2$ (Alexei Efros)


"Albert Efros"

The result seems acceptable, though Prof. Efros' collar aligned with Einstein's chin because of Einstein's large head (note that their eyes are aligned!).

The following is a failed example:


Campanile	Big Ben


"???????"

For me, it seems like some strange texture on the surface of Campanile. I can't discern Big Ben through this image.

2.3: Gaussian and Laplacian Stacks

$\bold I$ $\bold I$ $\bold I$ $\bold G$ $\bold I$ $N$ levels can be expressed as:

I_{G} = {I_{G, i} | i = 0, 1, 2, \dots, N - 1}

, where

\begin{matrix} {\begin{aligned} I_{G, 0} & = I \\ I_{G, i} & = I_{G, i - 1} * G \end{aligned} \end{matrix}

In my implementation, the kernel size of the Gaussian filter increased doubly as the level increased by one to capture the image's features in various scales.

The Laplacian stack can be derived from the Gaussian stack. It is defined as:

\begin{array}{r} I_{L} = {I_{L, i} | i = 0, 1, 2, \dots, N - 1} \end{array}

where

\begin{matrix} {\begin{aligned} I_{L, i} & = I_{G, i} - I_{G, i + 1} \\ I_{L, N - 1} & = I_{G, N - 1} \end{aligned} \end{matrix}

The original image can be constructed by summing up the Laplacian stack.

\begin{matrix} (1) & \begin{aligned} \sum_{i = 0}^{N - 1} I_{L, i} & = (\sum_{i = 0}^{N - 2} I_{L, i}) + I_{L, N - 1} \\ = (I_{G, 0} - I_{G, N - 1}) + I_{L, N - 1} \\ = (I_{G, 0} - I_{G, N - 1}) + I_{G, N - 1} \\ = I_{G, 0} \\ = I \end{aligned} \end{matrix}

The following are the Laplacian stack examples of an apple and orange:


$\bold I_\bold G$ for apple	$\bold I_\bold L$ for apple

$[-1,\ 1]$ $[0,\ 1]$ for better visualization (just as the one did in section 1.1).


$\bold I_\bold G$ for orange	$\bold I_\bold L$ for orange

We also need a Gaussian stack of a mask to blend two images. In the "oraple" example, a mask that vertically divides the image is needed:

2.3.mask

The Laplacian stack of the blended image can be computed by the formula:

\begin{matrix} (2) & I_{L, i}^{A} = M_{G, i} \times I_{L, i}^{B} + (1 - M_{G, i}) \times I_{L, i}^{C} \end{matrix}

$\bold I^A$ $\bold I^B$ $\bold I^C$ $\bold M$ $\times$ is the element-wise multiplication.

$\bold I^A$ $\bold I^A_\bold L$ through the formula (1).


Blending process

$\bold M_\bold G\times\bold I^B_\bold L, \bold M_\bold G\times\bold I^C_\bold L$ $\bold I^A_\bold L$ . (apple, orange, oraple)

$\bold M_{\bold G, 0}\times \bold I^B_{\bold L, 0} + \bold M_{\bold G, N-1}\times \bold I^B_{\bold L, N-1}$ . I did this because these images look more reasonable than the images normalized by other methods.