Computational Cameras @ SIGGRAPH

31 August, 2009
Refocussing a single light-field photograph

Refocussing a single light-field photograph

Last time I blogged about the interesting talk given by Will Wright at this year’s SIGGRAPH conference. Will’s talk was a spell-binding overview of current visualization and entertainment trends, and the role that human psychology and aesthetics plays within it. However, SIGGRAPH also has a more facts-and-numbers side – there actually is an academic conference hiding between all the glitz and glamour. (The one with technical papers.)

Now when I say “conference” I immediately conjure up images of badly dressed nerds in exotic locations, getting all excited about mind-numbingly boring topics, and then getting drunk and trying to hit on the only attractive girl among the several hundred conference-goers. But in SIGGRAPH’s case, at least, the topics are not all that mind-numbingly boring. Because in addition to the academic quality enforced by the reviewers, there is also a strong emphasis on originality, and a “what can this be used for, and does it look cool?” criteria.

There were several very interesting sessions, some of which focused on

  • Realistic simulation of fluid, fire, and computer animation physics
  • Human-computer interaction (Holographic teleconferencing with eye tracking, 3D with focused-ultrasound force-feedback)
  • Smart image editing (Resizing images without visibly cropping or distorting the content, and automatic black-and-white to colour conversion)
  • Making deformable 3D computer meshes, and then doing crazy things to them – like squashing a bunny and ripping a cow in two
  • 3D visualization of scientific data – complete with 3D glasses! (watching the Sun’s erupting surface, or travelling through an Egyptian mummy)

For this blog post, however, I want to talk about the “computational cameras” session. These talks might give you a glimpse into the future of photography…

Invertible motion blur

Deblurring images of a moving vehicle

Deblurring images of a moving vehicle

I always thought it would be quite possible to deblur an image if you knew how your subject moved. For example of you photograped a car driving past while using a slow shutter speed. Turns out it’s not that easy, since you actually lose data you cannot recover. Unless you’re clever, and take more than one photograph with different amounts of blur, and combine them.

Dark flash photography

UV + IR Flash. "Smile"!

UV + IR Flash. "Smile"!

We all know (right?) that using a flash spoils the mood and the lighting, and temporarily blinds the poor people looking at the camera. But at poorly lit venues using natural light only gives horribly noisy images. Solution? Take one photograph with a barely visible ultraviolet+infra-red flash, and paint in the colours using a noisy natural-light photograph. The result looks almost natural!

4D lens for universal depth-of-field

A new type of optimized plenoptic camera lens

A new type of optimized plenoptic camera lens

These researchers used clever frequency-domain mathematics to physically design and build a new lens (by cutting up an expensive Canon L lens and shuffling around the pieces). This Frankenstein’s creation makes a blurry photo (duh), but with some clever processing you can get an in-focus image, at any (or all) focal depths of your choice! The guys (and girl) also showed that “traditional” designs which did the same thing were suboptimal, since you only need a 3D subspace of the 4D image “light field” data. This lens focuses all its energy into this 3D sub-domain, thereby getting better performance than what was previously done. Does this sound like abstract gobbledygook? It actually works!

Bokodes

Bokode: Visual Tags for Camera-based Interaction from a Distance

Bokode: Visual Tags for Camera-based Interaction from a Distance

Getting its name from “Bokeh” + “code”, this is a really brilliant concept, which made me think: “now why couldn’t I think of something as elegant as this!?” Well done, MIT!

The idea is to place a tiny special reflector / LED light in an object. When you now take an out of focus photograph of it, you’ll get a sharp (in-focus) image of a barcode or data matrix superimposed on blurry background scene! The tiny little object will appear large in your view-finder, even over moderate distances. Say hello to ultra-small bar codes!

The authors then went on to show some really cool applications for this technology – e.g. you can use bokodes to accurately track an object in 3D using a single camera, you could use them to replace RFID tags (simpler and cheaper), use bokodes in an interactive classroom to identify audience members, or to get electronic business cards as you drive by in your car.

In conclusion

Innovation in photography – even fundamental innovation – is far from dead. I can’t wait to see what they’ll dream up for next year!


Will Wright @ SIGGRAPH

6 August, 2009
Will Wright

Will Wright

I am writing this post from the floor of SIGGRAPH 2009, this year being held in New Orleans, USA.

Yesterday I had the privilege of attending a keynote talk by Will Wright, the designer behind several landmark games such as SimCity, The Sims, and Spore. He presented an entertaining and panoramic talk entitled “Playing with Perception” which covered topics involving human psychology, visual processing, game design and the interrelatedness of multimedia technology. During his hour behind the microphone he swept through a staggering 270 slides, thereby wildly breaking the golden rule of  “one slide per minute”, but coming from him, it worked. Respect.

I am guessing here, but from his talk I deduced that he is also a hobby photographer, and he continued to dedicate several of his many slides to this topic. Here are a few of the points which caught my attention:

  • New technology fails to have its intented impact where there’s a misunderstanding, or ignorance of, human psychology.
  • Several decades ago everyone expected 3D imaging and displays to be widespread by the turn of the century. However, 2D photographic effects (like shallow depth-of-field) are used to great effect in movies and photography, where it focusses our attention where the director intended. This lack in 3D makes the experience poorer rather than richer. (For a live concert broadcast experience, however, 3D works well, since here we demand to focus our attention freely in an immersive “I’m there” experience.)
  • Visual art (painting) gradually evolved from crude rock painting to photorealism, before in recent times further developing into subjective and emotion-driven abstraction. The same may hold for animation and photography, at least in some contexts.
  • Case in point tilt-shift photography is taking us further away from reality, and yet has immense emotional appeal. A possible reason is that the toy-like appearance of the photographed reality gives us the feeling of being able to manipulate and play with the scene (à la SimCity) – this pleases our senses.
  • Being behind a camera lens gives you the ability to see the world in new ways, and trains your perception. After a while you see creative angles in everyday objects, even when your camera is not with you.
  • With the addition of digital and CG tools, film makers have the ability to create (almost) any conceivable image on screen – the question is therefore no longer “what is possible to create?”, but purely “what should we create?”

Indeed, what should we create? It’s often amazing to see how extremely simple technologies can be enormously immersive. A clear favourite of mine is this music video, which doesn’t make use of any computer animation. Digital effects can greatly enrich our experience, but the art lies in knowing when, and how much, it is needed.

Looking at his professional success it is clear that Will deeply understands the aesthetics of interactive technology. And this is the exactly what I admire in SIGGRAPH  - it brings technical innovation and aesthetics together in a single venue – something rare indeed.

Here’s to innovation, beauty and perception!

PS: Since writing the above blog, I have seen the 3D CG session featuring mind expanding scientific visualisation from NASA Goddard’s space flight centre. Stereoscopic 3D showing the sun’s coronal mass ejections in the UV spectrum (photographed in stereo by the twin SOHO spacecraft), a visualization of deep-ocean currents, seasonal ice cap variations, and much more. This was followed by a biomedical visualization tour of CTs obtained from 3000 year old Egyptian mummies. And then there were the animated shorts of Pixar, followed by an excerpt from U2 – live in 3D.

3D is indeed amazing when used correctly – it is just taking long to mature. When it comes into its own it will blow your mind – wow!


The next killer must-have camera feature

11 July, 2009
The Nikon E2s, a 1.3 Megapixel camera selling for $20,000 in 1995. What's changed since then?

The Nikon E2s, a 1.3 Megapixel camera selling for $20,000 in 1995. What's changed since then?

My housemate just walked into my room asking whether he should get the new iPhone 3Gs. He added “and it has a better camera: 3 megapixel instead of the 2 of the old one”. And this is coming from a very intelligent and technically educated person, which just shows how effectively the marketing guys mess with our minds. Three is better than two if you have a good lens and sufficient sensor size – but not here where probably neither improved since the older model.

In my post called “The Megapixel Race to nowhere” I talked at length about how the camera manufacturers are deceiving us by putting those stickers with “12 Megapixels” on cameras which take pictures which look crappy even when rescaled to a measly 300×300 pixels. Yes, the photo shown in that link was taken with a 12 megapixel camera. And yes, it looks crappy.  Compare that with this photograph taken with a 3.1 megapixel camera using technology of 7 years earlier. There are several reasons why this photograph taken by a new 12 megapixel camera looks so  bad compared to the older one – I won’t go into those reasons here, but suffice to say that the reason for the latter’s superiority lies not in the number of megapixels.

Luckily, it seems that now that most cameras have hit the 10 megapixel mark there has been a bit of a sanity check. The tag lines used to convince us to part with our hard-earned money focus on other features – and some of them are even useful! A killer feature is a feature that changes the way you use a piece of technology, or substantially improves the results you get from it.

Here’s a rough summary of the features which shaped digital camera evolution since the turn of the century:

  • 2000-2005: More megapixels (Yes, with a big enough sensor and good enough lens you did gain from more megapixels, up to a point), and faster operation (very important)
  • 2005-2006: Although it existed for some time, image stabilisation became all the rage, and everyone had to have it. Now widely available across multiple manufacturers. Very useful, too!
  • 2007: Automatic face detection hit the market. This is actually a very impressive technical feat, and useful for happy snappy auto-mode cameras, but not a big deal for people who carefully compose their pictures.
  • 2008: High ISO. Every camera now suddenly bragged about ISO 3200. Without better sensors (as is often the case) you just get an incredibly noisy and useless image. No good.
  • 2008: Live view and video recording on digital SLR cameras. Very useful!
  • 2008-2009: Wide angle. Gone are the days of zoom ranges starting at 35mm equivalent. Everyone now wants 28mm or even wider at the wide end. While some lens manufacturers struggle to give good quality at these wide settings it is very useful and nice to have!
  • 2008-2009: HD Video. Nice.
  • 2009: Waterproof/rugged: There used to be some cameras like these in the old film days, but now several manufacturers offering models which are 5-10m waterproof and can take a few knocks. Useful for the adventurous.

I’m sure the following years will give us some more gimmicks and truly useful features? Here are a few of my guesses:

  • GPS for location-tagging (some cameras already have this, but it’s rare)
  • Higher dynamic range (capturing darker and brighter areas simultaneously) – will be very useful
  • Direct upload to flickr / facebook / youtube? (a bit gimmicky)
  • Large touch-screen operation (also a bit gimmicky – buttons are faster and more reliable)
  • Software-based in-camera correction of lens distortion (already being done by Panasonic and Nikon)
  • Super-high frame rates for super-slow-mo videos (think 1000fps)
  • Software-based image stabilisation
  • In-camera image stacking for lower noise
  • Exotic multispectral, infra-red and ultraviolet photography modes (this one’s for the geeks)
  • Real-time photo stitching to create panoramas (already seen in some models) and creation of 3D models (a bit further off?)

Which ones do you think will make it? What did I miss?

To summarise, it seems like the innovation engine of the camera industry is still cranking out new features, despite the economic turmoil. And this is good for us, the consumers. As a good friend pointed out in his blog: “Innovation is hardly ever about coming up with something completely new. That sometimes does happen, but is a fluke”.  Most of these “killer features” have existed for some time. Heck, electric cars were already driving around in 1830, yes the eighteen-30s. But it took new technology to make it practical.  Things are still changing, and I’m not getting bored yet!


The Megapixel Race to nowhere

17 November, 2008

megapixels

How many of you have seen adverts for digital cameras announcing “12 Megapixel sensor for unprecedented image quality” ? The number of megapixels is often the only thing people enquire about a new camera, and the higher the number, the more impressed they are. That, as well as how much it costs, how it looks, and how many times it can zoom.

While I also think that cost and looks are important, I think the “megapixel” (MP) race has reached ridiculous levels. A few years ago, when 2 MP cameras were the norm, more was indeed better. But that was then. In fact, that shiny new 12 megapixel compact camera you bought will almost certainly take worse pictures than your neighbour’s 6 megapixel digital SLR which he bought 4 years ago. Let me try to shed some light on the situation…

When looking at your photographs (on a computer monitor, high definition plasma TV, or prints), you won’t see the detail-advantage of a 12 MP photograph over a 6 MP photograph because:

  • Your computer monitor can probably display somewhere between 1 and 2 MP
    (1920×1200 is pretty high-res, but still only 2.3MP)
  • An A4-size print on photo paper has less than 9 MP resolution, and 13×18cm less than 4MP

There *are* differences between good and bad photographs, but it is not a function of megapixels:

  • Your shiny National Geographic or Cosmopolitan magazines have limited resolution on a “full page” pic (probably < 5 MP), and yet those photographs look awesome!
  • Pictures on facebook are only about 0.27 MP, and yet you can often see the difference between a good and a bad (e.g. cell phone) camera. This is mostly due to differences in dynamic range and the quality of the lens, not megapixel count!

Having more megapixels is actually often a disadvantage:

  • A camera can only take pixel-level sharp pictures if the lens is able to form a sharp image on the sensor. The higher the pixel density, the sharper the lens needs to be. Very few cameras have lenses which can consistently provide detail to a sensor with 12 or more MP
  • A sensor pixel can only hold a certain amount of charge before “overflowing”. The smaller the pixel, the less charge it can hold. This directly influences dynamic range (see images below)
  • Every pixel needs light. Adding more pixels means each pixel will receive less light. This leads to “noisy” (speckled) images, since the camera needs to amplify the image signal more.
  • The size of the sensor directly influences the price and size and weight of the camera, and therefore small cheap cameras invariably have small sensors. The only way to keep pixel density low is by keeping the number of megapixels reasonable.
Relative sizes of sensors used in most current digital cameras.

Relative sizes of sensors used in most current digital cameras. (click for detail)

DPReview recently started listing the pixel density (MP/square cm) of all the cameras in their database. Because cameras have different size sensors the megapixel count on its own doesn’t tell you how densely they’re packed. Notice the enormous range in numbers! (a high pixel count combined with a low pixel density is best).

Camera Model Price* Pixel Count Pixel Density
Kodak EasyShare C913 110 $ 9.2 MP 37 MP/cm²
Canon PowerShot G9 500 $ 12.1 MP 28 MP/cm²
Panasonic Lumix LX3 400 $ 10.1 MP 24 MP/cm²
Panasonic Lumix G1 800 $ 12.1 MP 5.0 MP/cm²
Canon EOS 50D 1400 $ 15.0 MP 4.5 MP/cm²
Nikon D80 800 $ 10.2 MP 2.7 MP/cm²
Sigma DP1 680 $ 4.6 MP 1.6 MP/cm²
Nikon D3 4500 $ 12.1 MP 1.4 MP/cm²

*prices (including a basic lens for DSLRs) based on Amazon.com quotes for November 2008

“Dynamic Range” is the ratio between the brightest and the darkest values a camera can record at the same time. Large pixels generally have more dynamic range than small pixels. And that is something you notice even on small pictures on facebook. Cell phone cameras are terrible in this regard.

Having an excessive pixel density can lead to images with “blown highlights”, where regions of the photo are overexposed (just white), or underexposed (just black).

well_exposed ldr1
A well exposed photograph taken with sufficient dynamic range
(click for detail)
Blown highlights and underexposed shadows, due to low dynamic range
(click for detail)

Also, since the signal from each pixel is weaker, it needs to be amplified more. Amplifying the signal also amplifies the noise, which leads to random speckles and reduced detail. The noise can be suppressed with “noise reduction”, but this blurs the image and removes even more detail!

Low Noise Image High Image Noise (no noise reduction) Noisy image after "noise reduction"
Low-Noise Image
(click for detail)
Noisy Image
(click for detail)
Blurring due to “noise reduction”
(click for detail)

One fact, however, is that an image file with more pixels will take up more of you hard drive space. So, with your new little camera you might just end up with huge files showing millions and millions of noisy, blurry pixels which can’t cope with the really dark or really bright pictures.

In fact, when comparing the brand new semi-professional Canon 50D (15MP) against its predecessor the 40D (which had 10MP), the DPreview team concluded: “in terms of per-pixel sharpness the 50D cannot quite keep up with the better 10 MP cameras on the market … It appears that Canon has reached the limit of what is sensible, in terms of megapixels … Considering the disadvantages that come with higher pixel densities such as diffraction issues, increased sensitivity towards camera shake, reduced dynamic range, reduced high ISO performance and the need to store, move and process larger amounts of data, one could be forgiven for coming to the conclusion that at this point the megapixel race should probably stop.

Today’s new cameras can take better videos than ever before, have nice large displays, funky colours, are small and light, cost very little and can even detect faces and smiles. And these are all really cool. But we don’t need more megapixels for this!

To conclude: Beyond a certain point more pixels don’t improve picture quality – in fact, often the opposite. Sadly, it’s unlikely that manufacturers will stop the so-called “megapixel race” anytime soon. Why? Because numbers sell.


Moby killed my camera!

9 July, 2008

At Rock Werchter, on Friday 4 July 2008, a thin beam of intense green energy caused my 1-month old camera’s CCD sensor to have a stroke. Yes, dear readers, it’s true! Moby killed my camera. The high-powered(?) stage laser went straight into the lens, and – zap!

Here is what happened:

I thought cameras were designed to survive bright lights, but actions speak louder than words, and lasers shine brighter than than the brightest star.

Now, every picture I take with the camera looks exactly like this:

My camera now only sees this...

My camera now only sees this...

And now with a whacked camera, blasted ears and a stunned mind, I have also realised that this little blog has an existential crisis. In my opinion, successful blogs have a guiding theme or topic of interest – something this one has lacked thus far. That, and content. But now that will change… Over the coming weeks I’ll try to add more of the latter, so that the former might start to emerge. And I even made it a bit more pretty, by adding some eye candy to the to top bar… But then again, it’s only words, words, words. :)

Update: 15/07/2008

Indeed it seems as if things sometimes do go wrong at professional laser shows, as illuminated by the article in New Scientist entitled Party laser ‘blinds’ Russian ravers.
Ravers at the Aquamarine Open Air Festival in Kirzhach, 80 kilometres northeast of Moscow, began seeking medical help days after the show, complaining of eye and vision problems.

“They all have retinal burns, scarring is visible on them. Loss of vision in individual cases is as high as 80%, and regaining it is already impossible,” Kommersant quoted a treating ophthalmologist as saying.

Not cool.