Sergei Mikhailovich Prokudin-Gorskii was a Russian photographer before color printing was available.
He took exposures of scenes with a red, green, and blue filter. Using image processing techniques,
we can align the glass plate images to produce a color image from the three channels!
Below, you can see three examples of these RGB glass plate negatives.
Here is a comparison of brute force searching for alignment on small images; all images were searched over [-15, 15] pixels.
The left is a naive combination of channels with no alignment.
The middle uses Euclidean distance to score green and red channels compared to the blue channel.
The right uses cropping to improve scoring, making the alignment more robust to noise at the edges.
Euclidean distance is the L2-norm, ie. \( (\sum |x^2|)^{\frac{1}{2}} \).
What happens if you use different p-norms?
Well, turns out not that much. These images were generated with 5% cropping.
Note that the images above are relatively small: the cathedral dimensions are 390 x 341 = 132,990 pixels.
Consider the large image of Emir: its dimensions are 3,702 x 3,209 = 11,879,718.
Not only is this approximately 90 times as many pixels (meaning that each brute-forced alignment does 90 times the work!),
a brute force search needs to search over more alignments.
Before, the small images searched over [-15, 15], a 4% shift in all cardinal directions, which works out to 900 different alignments.
A 4% shift on Emir would be [-150, 150], working out to 90,000 different alignments— 100 times as many!
This is mostly because of the "square-cube" law idea (except for line-square, 1d to 2d). When increasing the sides of the image by 10 times, the
amount of work is increased by 100.
In this case, we're affected even more! 10x to the image side length works out to 10,000x the work
(100x the alignments, each of which searches 100x pixels.)