Einstein Platform Services

Object Detection Images and Labeling

In the Object Detection Quick Start, the .zip file with the images and the annotations file is provided for you. To create your own model, you first need to gather and label the training data. Here are some best practices when gathering your own data and labeling your images.

Collect Images

The first step to implementing Einstein Object Detection is deciding which objects you want to identify. After you decide that, it's time to gather training data (images) to create the dataset. Use images that are representative of the images that the model will receive in production.

Training Image Considerations

  • Objects in the images are visible and recognizable.
  • Images are forward-facing and not at an angle.
  • Images are neither too dark nor too bright.
  • Images contain 100-200 or more occurrences (across all images) for each object you want the model to identify. The more occurrences of an object you have, the better the model performs.

Label the Images

After you collect training images, you label objects in those images and specify a bounding box around each object. There are a few different options for image labeling.

CrowdFlower

Use Crowdflower's human-in-the-loop platform to create high-quality training datasets of annotated images. Their platform lets you select and manage the human labelers you need (including your own employees) to meet your quality and cost requirements. Email [email protected] to discuss your labeling project.

SharinPix

Use the SharinPix managed package available on the AppExchange to label your images. Their labeling tool offers team management functionality for self-labeling using your own team or assisted labeling with help from SharinPix labelers. Email Jean-Michel Mougeolle at [email protected] to discuss your labeling project.

Self-Labeling

You can do-it-yourself and self-label your images, as long as the annotations meet the required format.

Labeling and Zip File Format

No matter which method you use to label your images, the labeling content is stored in a comma-separated (csv) file named annotations.csv. The annotations file contains the image file name and the labels and coordinates (in JSON format) for each object in the image. See the Annotations.csv File Format section of Create a Dataset From a Zip File Asynchronously. Here are the first four lines from the annotations.csv file contained in alpine.zip.

image_url,box0,box1,box2,box3,box4,box5,box6,box7
20171030_133845.jpg,"{""height"": 1612, ""y"": 497, ""label"": ""Alpine - Oat Cereal"", ""width"": 1041, ""x"": 548}","{""height"": 1370, ""y"": 571, ""label"": ""Alpine - Oat Cereal"", ""width"": 904, ""x"": 1635}","{""height"": 1553, ""y"": 383, ""label"": ""Alpine - Corn Flakes"", ""width"": 1059, ""x"": 2580}",,,,,
20171030_133911.jpg,"{""height"": 1147, ""y"": 2299, ""label"": ""Alpine - Oat Cereal"", ""width"": 861, ""x"": 374}","{""height"": 1038, ""y"": 2226, ""label"": ""Alpine - Oat Cereal"", ""width"": 752, ""x"": 1263}","{""height"": 1464, ""y"": 709, ""label"": ""Alpine - Bran Cereal"", ""width"": 1056, ""x"": 179}","{""height"": 1470, ""y"": 746, ""label"": ""Alpine - Bran Cereal"", ""width"": 697, ""x"": 2327}","{""height"": 1434, ""y"": 752, ""label"": ""Alpine - Corn Flakes"", ""width"": 831, ""x"": 1312}","{""height"": 1080, ""y"": 2378, ""label"": ""Alpine - Corn Flakes"", ""width"": 965, ""x"": 2059}",,
20171030_133915.jpg,"{""height"": 496, ""y"": 1811, ""label"": ""Alpine - Oat Cereal"", ""width"": 344, ""x"": 922}","{""height"": 100, ""y"": 112.18126888217523, ""label"": ""Alpine - Oat Cereal"", ""width"": 100, ""x"": 100}","{""height"": 517, ""y"": 1753, ""label"": ""Alpine - Oat Cereal"", ""width"": 337, ""x"": 1282}","{""height"": 590, ""y"": 1157, ""label"": ""Alpine - Bran Cereal"", ""width"": 383, ""x"": 889}","{""height"": 587, ""y"": 1157, ""label"": ""Alpine - Bran Cereal"", ""width"": 368, ""x"": 1674}","{""height"": 575, ""y"": 1169, ""label"": ""Alpine - Corn Flakes"", ""width"": 350, ""x"": 1303}","{""height"": 532, ""y"": 1757, ""label"": ""Alpine - Corn Flakes"", ""width"": 365, ""x"": 1629}",

After you create the labels in the annotations.csv file, you package up that file along with the images in a .zip file. The API call to create an object detection dataset uses this .zip file to upload the images and labels. See the Object Detection Datasets section of Create a Dataset From a Zip File Asynchronously.

Updated 3 months ago

Object Detection Images and Labeling


In the Object Detection Quick Start, the .zip file with the images and the annotations file is provided for you. To create your own model, you first need to gather and label the training data. Here are some best practices when gathering your own data and labeling your images.

Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.