{
  "$type": "site.standard.document",
  "description": "In various examples, a single camera is used to capture two images of a scene from different locations. A trained neural network, taking the two images as inputs, outputs a scene structure map that indicates a ratio of height and depth values for pixel locations associated with the images. This…",
  "path": "/patents/1304115",
  "publishedAt": "2021-11-11T00:00:00.000Z",
  "site": "at://did:plc:oql6ds5vnff4ugar6rruliwd/site.standard.publication/3mn3ohu7oxx5w",
  "tags": [
    "G06K9/00805",
    "NVIDIA Corporation"
  ],
  "textContent": "In various examples, a single camera is used to capture two images of a scene from different locations. A trained neural network, taking the two images as inputs, outputs a scene structure map that indicates a ratio of height and depth values for pixel locations associated with the images. This ratio may indicate the presence of an object above a surface (e.g., road surface) within the scene. Object detection then can be performed on non-zero values or regions within the scene structure map.",
  "title": "OBJECT DETECTION USING PLANAR HOMOGRAPHY AND SELF-SUPERVISED SCENE STRUCTURE UNDERSTANDING"
}