VeriSilicon had an excellent year 2011 on promoting WebM, a free, open media format, and WebRTC, a free, open real-time communications framework, through Hantro video semiconductor IP licensing. Most tier-1 semiconductor companies adopted VP8 video format for WebM by integrating Hantro G1 Multi-format Decoder and Hantro H1 Multi-format Encoder, the only commercially available encoding IP supporting both H.264 and VP8, into their chips and platforms.
This enables WebM video in a number of new, exciting devices such as media tablets, smart phones and smart TVs. Under an existing licensing agreement with Google, VeriSilicon has rights to commercially license Hantro G1 Multi-format Decoder and Hantro H1 Multi-format Encoder products to semiconductor companies worldwide and modify the Hantro video IPs to enhance core architecture and add new features.
MPEG announced today it is working on a royalty-free MPEG standard in two tracks, one track based on expired MPEG patents and other royalty free technologies and the other based on a proposal that patent holders grant a royalty-free license to a “constrained baseline profile” of the widely used AVC/h.264 standard.
According to the MPEG press release, “Depending on the progress of the two tracks, MPEG will decide, in 2012, whether to choose IVC or WebVC to become an International Standard.”
Given the heavy hitters that are lining up behind these proposals and the significance of the forum of ISO and the MPEG committee, it is worth looking deeper at the public information that has been released on these proposals.
Finally the recording of my presentation at MAX2011 (Encoding for performance on multiple devices) is available on Adobe TV.
You can also download the pdf version here. My using of FFmpeg for repurposing the streams of FMS has attracted quite a lot of interest and attention. I’m planning to extend the series of article dedicated to FFmpeg and also to transform it in a permanent knowledge-base on FFmpeg and related best-practices.
Rovi Corporation today announced the availability of an updated version of its MainConcept Codec SDK (Software Development Kit). Version 9 of the SDK will enable developers to quickly take advantage of many of the very latest codec and network streaming technologies, providing them with what they need to integrate video and audio compression functionality. Rovi, developer of the industry’s leading video and audio codec library, will demonstrate Codec SDK 9 in action at IBC 2011in Hall 5, Stand A31.
Codec SDK 9 demonstrates Rovi’s breadth of entertainment offerings, which include upstream production technology used to create and distribute entertainment. Rovi’s MainConcept technologies are used by top production and broadcast brands across a wide range of professional industries, from Hollywood authoring, IPTV and streaming media to security, medical and digital signage.
New features of MainConcept SDK include: • H.264/AVC Encoder Wrapper: Enables rapid H.264/AVC implementation by providing a single API wrapper for all H.264/AVC encoders: CUDA, OpenCL, Intel Quick Sync Video, and H.264/AVC Software Encoder • H.264/AVC Smart Rendering: Enables fast editing, trimming, and copying at any frame position without requiring a complete re-encode • Enhanced Network Streaming Support: Features enhanced support for network streaming for Flash RTMP, DLNA Media Server/Renderer/Control Point and SIP IP conferencing • VC-3 Encoding Support: VC-3 codec support including Avid DNxHD compliant MXF multiplexing • Fragmented MP4 Support: Enables segmented streaming of an MP4 file as well as the multiplexing of multiple streams with different bitrates into a single MP4 file, essential for streaming in conditions with fluctuating bandwidths
French encoding specialists ATEME claim that their new EAVC4 codec is capable of delivering 4K and even 8K video over IP using current MPEG-4 compression. While ATEME is also a core member of a consortium researching the High Efficiency Video Codec (HEVC)—the successor to MPEG-4 and designed expressly for higher resolutions—it suggests that Ultra-HDTV video transport can be achieved today.
ATEME claims that the H.264-compliant EAVC4 codec can deliver a 20% gain in efficiency over previous MPEG-4 H.264 AVC codecs.
Until recently, the only way to embed video in web pages was using third-party plugins like Silverlight, Flash and QuickTime. HTML5 has made it easier to include multimedia in your web pages without any plugins, by just using the <video> element. However support for video and audio file formats varies across browsers. HTML5 capable browsers support different video formats.
In video compression algorithms, the quantization parameter is typically adapted based on overall bit usage and relative complexity of the region in the picture. However, such complexity-based rate control algorithms do not emphasize the fact that certain objects, such as human faces, are more sensitive to the overall perceptual video quality. To improve overall perceptual quality, it is important to classify human faces as regions of interest (ROI) and preserve as much detail as possible. The challenge is to develop a reliable algorithm that works in real-time implementations. This article will explore a low-complexity system that can run on a single-core DSP as part of an encoder implementation.
The proposed system is a low-complexity, color-based skin tone detection, which classifies skin tone macroblocks (MB) as ROI MBs and non-skin tone macroblocks as non-ROI MBs (MB; 16 × 16 block of pixels). The classification is based on some empirical thresholds applied to the mean of the color components. The empirical threshold values were defined after an extensive training using material that covers all kinds of races. According to this classification and a modified rate control (RC) that permits smoothly assigning different levels of quality, we can increase visual quality in human faces.