October 14, 2019 News Magazine

Google is investigating the source of voice data leak, plans to update its privacy policies

Google has responded to a report this week from Belgian public broadcaster VRT NWS, which revealed that contractors were given access to Google Assistant voice recordings, including those which contained sensitive information — like addresses, conversations between parents and children, business calls, and others containing all sorts of private information. As a result of the report, Google says it’s now preparing to investigate and take action against the contractor who leaked this information to the news outlet.

The company, by way of a blog post, explained that it partners with language experts around the world who review and transcribe a “small set of queries” to help Google better understand various languages.

Only around 0.2 percent of all audio snippets are reviewed by language experts, and these snippets are not associated with Google accounts during the review process, the company says. Other background conversations or noises are not supposed to be transcribed.

The leaker had listened to over 1,000 recordings, and found 153 were accidental in nature — meaning, it was clear the user hadn’t intended to ask for Google’s help. In addition, the report found that determining a user’s identity was often possible because the recordings themselves would reveal personal details. Some of the recordings contained highly sensitive information, like “bedroom conversations,” medical inquiries, or people in what appeared to be domestic violence situations, to name a few.

Google defended the transcription process as being a necessary part of providing voice assistant technologies to its international users.

But instead of focusing on its lack of transparency with consumers over who’s really listening to their voice data, Google says it’s going after the leaker themselves.

“[Transcription] is a critical part of the process of building speech technology, and is necessary to creating products like the Google Assistant,” writes David Monsees, Product Manager for Search at Google, in the blog post. “We just learned that one of these language reviewers has violated our data security policies by leaking confidential Dutch audio data. Our Security and Privacy Response teams have been activated on this issue, are investigating, and we will take action. We are conducting a full review of our safeguards in this space to prevent misconduct like this from happening again,” he said.

As voice assistant devices are becoming a more common part of consumers’ everyday lives, there’s increased scrutiny on how tech companies are handline the voice recordings, who’s listening on the other end, what records are being stored, and for how long, among other things.

This is not an issue that only Google is facing.

Earlier this month, Amazon responded to a U.S. senator’s inquiry over how it was handling consumers’ voice records. The inquiry had followed a CNET investigation which discovered Alexa recordings were kept unless manually deleted by users, and that some voice transcripts were never deleted. In addition, a Bloomberg report recently found that Amazon workers and contractors during the review process had access to the recordings, as well as an account number, the user’s first name, and the device’s serial number.

Further, a coalition of consumer privacy groups recently lodged a complaint with the U.S. Federal Trade Commission which claims Amazon Alexa is violating the U.S. Children’s Online Privacy Protection Act (COPPA) by failing to obtain proper consent over the company’s use of the kids’ data.

Neither Amazon nor Google have gone out of their way to alert consumers as to how the voice recordings are being used.

As Wired notes, the Google Home privacy policy doesn’t disclose that Google is using contract labor to review or transcribe audio recordings. The policy also says that data only leaves the device when the wake word is detected. But these leaked recordings indicate that’s clearly not true — the devices accidentally record voice data at times.

The issues around the lack of disclosure and transparency could be yet another signal to U.S. regulators that tech companies aren’t able to make responsible decisions on their own when it comes to consumer data privacy.

The timing of the news isn’t great for Google. According to reports, the U.S. Department of Justice is preparing for a possible antitrust investigation of Google’s business practices, and is watching the company’s behavior closely. Given this increased scrutiny, one would think Google would be going over its privacy policies with a fine-toothed comb — especially in areas that are newly coming under fire, like policies around consumers’ voice data — to ensure that consumers understand how their data is being stored, shared, and used.

Google also notes today that people do have a way to opt-out of having their audio data stored. Users can either turn off audio data storage entirely, or choose to have the data auto-delete every 3 months or every 18 months.

The company also says it will work to better explain how this voice data is used going forward.

“We’re always working to improve how we explain our settings and privacy practices to people, and will be reviewing opportunities to further clarify how data is used to improve speech technology,” said Monsees.


Source: TechCrunch

Tags: in Uncategorized
Banner
Related Posts

Spotify’s app for artists gets a big revamp, adds real-time stats for any song

September 24, 2019

September 24, 2019

Spotify this morning is rolling out an updated version of its app for artists, across both iOS and Android. The...

Phantom Auto raises $13.5M to expand remote driving business to delivery bots and forklifts

April 18, 2019

April 18, 2019

Remote driving startup Phantom Auto has raised $13.5 million of financing in a Series A round led by Bessemer Venture...

Meet TezLab, the Fitbit for Tesla vehicles

June 15, 2019

June 15, 2019

Some of the best real-time insights into Tesla and its global fleet of electric vehicles — outside the confines of...

Microsoft launches a drag-and-drop machine learning tool

May 2, 2019

May 2, 2019

Microsoft today announced three new services that all aim to simplify the process of machine learning. These range from a...

Racial bias observed in hate speech detection algorithm from Google

August 14, 2019

August 14, 2019

Understanding what makes something offensive or hurtful is difficult enough that many people can’t figure it out, let alone AI...

Huawei Honor’s smartphone with a hole-punch display is real

January 22, 2019

January 22, 2019

Honor officially launched the Honor View 20 today in Paris. Honor is Huawei’s sub-brand for mid-range phones. And the View...

Facebook answers how Libra taxes and anti-fraud will work

July 12, 2019

July 12, 2019

Facebook provided TechCrunch with new information on how its cryptocurrency will stay legal amidst allegations from President Trump that Libra...

Photos on iPhone is about to look completely different

June 3, 2019

June 3, 2019

When you need to find a photo on your iPhone, what do you do? There are tons of ways, but...

Microsoft OneDrive Personal Vault rolls out worldwide, launches expandable storage

September 30, 2019

September 30, 2019

Earlier this summer, Microsoft introduced an extra layer of security to its Dropbox competitor, OneDrive. The security features, called OneDrive...

Microsoft’s $399 Azure Kinect AI camera is now shipping in the U.S. and China

July 11, 2019

July 11, 2019

Earlier this year, at MWC, Microsoft announced the return of its Kinect sensor in the form of an AI developer...

Google’s managed hybrid cloud platform is now in beta

February 20, 2019

February 20, 2019

Last July, at its Cloud Next conference, Google announced the Cloud Services Platform, its first real foray into bringing its...

Don’t miss this epic Twitter fight between the IAB’s CEO and actual publishers

August 2, 2019

August 2, 2019

Grab popcorn. As Internet fights go this one deserves your full attention — because the fight is over your attention....

‘This is Your Life in Silicon Valley’: The League founder and CEO Amanda Bradford on modern dating, and whether Bumble is a ‘real’ startup

August 20, 2019

August 20, 2019

Sunil Rajaraman Contributor Sunil Rajaraman is the co-founder of Scripted.com More posts by this contributor ‘This is Your Life in...

Mammoth Biosciences adds the final piece of the CRISPR diagnostics puzzle to its toolkit

March 14, 2019

March 14, 2019

With the announcement today that Mammoth Biosciences has received the exclusive license from the University of California, Berkeley to the...

How I Podcast: The Moth’s Dan Kennedy

April 5, 2019

April 5, 2019

The beauty of podcasting is that anyone can do it. It’s a rare medium that’s nearly as easy to make...

Comments
Leave a Reply

Your email address will not be published. Required fields are marked *