Image Retrieval of First-Person Vision for Pedestrian Navigation in Urban Area

Massive Sensing, Research Index, kameda-lab.org, 2010/01/31, 2010/08/30

Reference

Panasonic DMC-FX37, 28mm focal length (35 mm film)
Duration = 12:13.50 [sec]

Query

Video ID 1 2 3 4
Duration 11:47 15:53 13:34.500 14:38.020
Camera DMC-FX37 DMC-FX37 DMC-FX37 iPod nano
[A] original video: VGA P1180644.MOV(982MB) P1150934.MOV(1.3GB) P1180758.MOV(1.1GB) IMG_0005.mp4(281MB)
[B] query images QVGA t020-query_643-644.avi t020-query_643-934.avi t020-query_643-758.avi t020-query_643-I0005.avi
[C] result video (QVGA x 3 + figures), (=Fig2) t020-20_643-644.avi(172MB) t020-20_643-934.avi(231MB) t020-20_643-758.avi (197MB) t020-20_643-I0005.avi (205MB)
Graph (Fig-1 + extra) / www Result-1(4.0MB) Result-2(4.1MB) Result-3(4.1MB) Result-4(3.9MB)
Graph (Fig-1 + extra) / pdf Result-1(830KB) Result-2(1.0MB) Result-3(1.0MB) Result-4(871KB)

Result video

Graph

The pair counting method :
As for the verification step, just check the number of pairs. If the number of the found pairs is equal or larger than the threshold, it is accepted as the answer.
The threshold is set to keep (almost) the same answer ratio as the poposed method.

  1. On the top left, you see some flat area or vertical jump because of "staying (such as signal waiting)" period during the walks (both in reference and query). They disappear in the top right figure since they are at the same position on path distance notation. (Actually even on waiting signals, sometimes the camera moves a little bit.)
  2. The blue dots in top left/right figures tell how many false-positive top candidates (blue dots far from red dots) are found if we just run the generic image retrieval.
  3. There are some unsuccessful sections, probably because of strong sun back light, and darker sky (making blur larger, loosing SURF keys). But still you find less false-negative with our approach.

kameda[at]iit.tsukuba.ac.jp, kameda.aa[at]gmail.com