It Sucks When Your Robot Vac Records You On The Toilet And Facebook Sees It, Wait What?!
This photo is part of a set of fifteen images obtained by MIT Technology Review that were originally posted to Facebook groups, Discord servers, and other forums by Venezuelan contractors. These contractors worked for Scale AI, a San Francisco-based company that pays workers in poorer countries to label images for the purpose of training machine learning algorithms. Scale AI received over two million images from iRobot as part of a larger effort involving multiple data labeling companies to train algorithms for iRobot’s current and future devices.
According to iRobot, the images shared with the data labeling companies were taken by “special development robots with hardware and software modifications that are not and never were present on iRobot consumer products for purchase.” These development devices moved around the homes of iRobot employees and volunteers recruited by third-party data vendors. These individuals signed agreements allowing the Roombas to collect data, including video, while they were running. iRobot labeled each of these devices with a green “video recording in progress” sticker, but left it up to the employees and volunteers to “remove anything they deem sensitive from any space the robot operates in, including children.”
Nonetheless, enforcing such an agreement can be difficult, especially when relying on contractors located all over the world. A competitor of Scale AI, Hive, also works with contractors, and MIT Technology Review asked the company’s CEO, Kevin Guo, about data labelers sharing training images on social media. The CEO responded, “These are distributed workers … You have to assume that people … ask each other for help. The policy always says that you’re not supposed to, but it’s very hard to control … we don’t think we have the right controls in place given our workforce.”
This problem extends way beyond iRobot and robot vacuum cleaners in general. Data labeling is an entire industry unto itself, and the demand for this service will only grow as machine learning improves and becomes more common. The cameras attached to iRobot’s development devices point at an upward angle, enabling the company to collect training images that include a wide array of household objects beyond just the furniture around which Roombas must navigate. It’s clear that iRobot and many other companies are working to train machine learning algorithms that will power the next generation of “smart” devices with more expansive capabilities. All these companies ask is that you invite their devices into your home and agree to the terms of service.