Automatic content recognition

Automatic content recognition (ACR) is an identification technology to recognize content played on a media device or present in a media file. Devices containing ACR support enable users quickly obtain additional information about the content they have just experienced without any user based input or search efforts. For example, developers of the application can then provide personalized complementary content to viewers.[1]

How it works

To start the recognition, a short audio clip is recorded by the device and sent to the identification service. The identification service uses a reference database that stores fingerprints of the works to be identified. The database also contains information about the content and associated information, including complementary media. If the fingerprint of the recorded audio sample is matched, the identification service returns the corresponding metadata to the client.[2]

Fingerprint & Watermarking

Audio based ACR is commonly used in the market. The two leading methodology are acoustic fingerprinting and watermarking. There are alternative approaches that involve a focus on Video fingerprinting but augment the accuracy and scalability with other content recognition solutions running in parallel and in series.

Acoustic fingerprinting generates unique fingerprints from the content itself. Fingerprinting techniques work regardless of content format, codec, bitrate and compression techniques. This makes it possible to use across networks and channels. Therefore, it is widely used for interactive TV, second screen application and content monitoring sectors.[3] Popular apps like Shazam, YouTube, Facebook,[4] Thetake, Wechat and Weibo are using audio fingerprinting methodology to recognize the contented played from a TV and trigger additional features like votes, lottery, topic or purchase.

In contrast to fingerprinting, digital watermarking requires inserting digital tags containing information about the content into the content itself, prior to distribution. For example, a broadcast encoder might insert a watermark every few seconds that could be used to identify to broadcast channel, program id, and time stamp. The watermark is normally inaudible or invisible to the users. Terminal devices like phones or tablets read the watermarks instead of actually recognizing the played content.[5] Watermarking technology is utilized in media protection field to trace where the illegal copies originate.

It is expected by Next/Market Insights that 2.5 billion devices will be integrated with ACR technology to provide synchronized live and on-demand video watching experience.[6]

History

ACR technology was applied in TV content by Shazam in 2011 which captured the attention from TV industries. Shazam was previously a music recognition service which recognizes music from a sound recording. By utilizing its own fingerprint technology to identify live channels and videos, Shazam extended their business for TV. In 2012 DIRECTV partnered with Viggle which is a TV loyalty vendor to provide interactive viewing experience on the second screen. In 2013 LG partnered with Cognitive Networks (later purchased by Vizio and renamed Inscape), an ACR vendor, to provide ACR driven interaction.[7] In 2015 ACR technology is spread widely to even more applications and smart TVs. Now, social applications and TV manufacturers like Facebook, Twitter, Google, Wechat, Weibo, LG, Samsung, Vizio TV have already used ACR technology either developed by themselves or integrated from third party ACR providers. In 2016 there are more applications and mobile OS embedded with automatic content recognition services on the market like Peach, Omusic and Mi OS to enhance the music discover experiences.[8][9][10]

Application

Content identification

ACR technology helps audiences easily retrieve information about the content they watched. For smart TVs and applications with ACR technology embedded the audience can check the name of the song which is played or descriptions of the movie they watched. In addition to that, the identified video and music content can be linked to internet content providers for on-demand viewing, third parties for additional background information, or complementary media.

Content enhancement

Because devices can be "aware" of content being watched or listened to, second screen devices can feed users complementary content beyond what is presented on the primary viewing screen. ACR technology can not only identify the content, but also it can identify the precise location within the content. Thus, additional information can be presented to the user. ACR can enable a variety of interactive features such as polls, coupons, lottery or purchase of goods based on timestamp.[11]

Audience measurement

Real-time audience measurement metrics are now achievable by applying ACR technology into smart TVs, set top boxes and mobile devices such as smart phones and tables. This measurement data is highly essential to quantify audience consumption to set advertising pricing policies.

Broadcast monitoring

For advertisers and content owners, it is vital to know when and where their content has been played. Traditionally agencies or advertisers have to manually audit the presentation. At scale it only can be checked through a statistical sampling method. ACR technology enables automatic monitoring of the content played in TV. Information like the time of play, duration, frequency can be achieved without any manual effort.[12]

The alternative approaches are video based automated content recognition technologies. These are a suite of technologies that revolve around the convergence of video and TV Everywhere[13] which will render the audio and digital watermarking methods incapable of handling the millions of unique streams going out and billions of hours of footage to be reviewed with metadata extracted or enriched in relation to the content in real-time. Where Acoustic fingerprint fails in its sensitivity and reliance on very limited to no distortion of the content[14] and where Digital watermarking relies on intrusive frame by frame production stage imprinting on every piece of content[15] it cannot logically or cost effectively scale to the amount of video being generated - both of which fade in ability to accurately detect presence of reference data.[16]

ACR technology providers

ACR service providers include ACRCloud, Audible Magic, Digimarc, Doreso, Gracenote, Kantar Media, Rovi, Shazam.

See also

References

  1. "Automatic Content Recognition (ACR)". Gartner. Retrieved 24 June 2015.
  2. "Automated content recognition creating content aware ecosystems" (PDF). Civolution. Civolution. Retrieved 24 June 2015.
  3. Brink, Kyle. "A Primer on Automated Content". Viggle. Viggle. Retrieved 22 June 2015.
  4. "Facebook Automatic Content Recognition". Starcom MediaVest Group. SMG. Retrieved 6 July 2015.
  5. Brink, Kyle. "SVP of Product Development". A Primer on Automated Content Recognition. Viggle. Retrieved 22 June 2015.
  6. "ACR Technology To Drive Social TV As It Reaches 2.5 Billion Devices by 2017". NEXT / MARKET INSIGHTS. Retrieved 24 June 2015.
  7. "LG partners with Cognitive Networks to make Smart TVs smarter and more interactive". engadget. Retrieved 23 August 2016.
  8. "ACRCloud Powers Song Recognition For Hottest New Social Network, Peach". Music Industry News Network. Music Industry News Network. Retrieved 3 March 2016.
  9. Victoria, Ho. "Xiaomi will help you name that song you can't stop humming". Mashable. Mashable. Retrieved 3 March 2016.
  10. "ACRCloud Powers The Launch Of Taiwan's First Music/Humming Recognition Service For Omusic". Music Industry News Network. Retrieved 3 March 2016.
  11. Wolf, Michael. "Three Ways Automatic Content Recognition Will Change TV". Forbes. Retrieved 20 June 2015.
  12. "Automated Content Recognition creating content aware ecosystems" (PDF). csimagazine. Civolution. Retrieved 24 June 2015.
  13. Ramachandran, Shalini; Vranica, Suzanne (2015-10-20). "Comcast Seeks to Harness Trove of TV Data". Wall Street Journal. ISSN 0099-9660. Retrieved 2016-05-16.
  14. "Facebook Announces Its ContentID Attempt... Using Audible Magic | Techdirt". Techdirt. Retrieved 2016-05-16.
  15. "The Disadvantages of a Watermark". smallbusiness.chron.com. Retrieved 2016-05-16.
  16. "Facebook Announces Its ContentID Attempt... Using Audible Magic | Techdirt". Techdirt. Retrieved 2016-05-16.

Evolution of Video Content Recognition

This article is issued from Wikipedia - version of the 11/26/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.