论文学习:Overview of the High Efficiency Video Coding Standard

Souce

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 22, NO. 12, DECEMBER 2012

Background (2012)

HEVC is currently being prepared as the newest video coding standard of the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group.

因此下文的语境,是2012年标准尚未确定时。

Abstract

The main goal of the HEVC standardization effort is to enable significantly improved compression performance relative to existing standards --in the range of 50% bit-rate redunction for equal perceptual video quality.

HEVC标准化的主要目的,在于提高压缩效率——和现有标准相比,在保持感知视频质量不变的情况下,减小最多50%的比特率。

Ⅰ. INTRODUCTION

HEVC的别名

In ISO/IEC, the HEVC standard will become MPEG-H Part 2 (ISO/IEC 23008-2) and in ITU-T it is likely to become ITU-T Recommendation H.265.

HEVC被ITU-T组织命名为H.265,被ISO/IEC组织命名为MPEG-H Part 2。

WIKIPEDIA: High Efficiency Video Coding (HEVC), also known as H.265 and MPEG-H Part 2, is a video compression standard, one of several potential successors to the widely used AVC (H.264 or MPEG-4 Part 10).

关于ITU-T和ISO这两大标准制定组织的补充:

Video coding standards have evolved primarily through the development of the well-known ITU-T and ISO/IEC standards.

The ITU-T produced H.261 [2] and H.263 [3], ISO/IEC produced MPEG-1 [4] and MPEG-4 Visual [5], and the two organizations jointly produced the H.262/MPEG-2 Video [6] and H.264/MPEG-4 Advanced Video Coding (AVC) [7] standards.

关于HEVC的前任:AVC

The major video coding standard directly preceding the HEVC project was H.264/MPEG-4 AVC, which was initially developed in the period between 1999 and 2003.

在HEVC之前的主要视频编码标准,是H.264/MPEG-4 AVC。AVC标准最早在1999到2003年发展起来。

H.264/MPEG-4 AVC has been an enabling technology for digital video in almost every area that was not previously covered by H.262/MPEG-2 Video and has substantially displaced the older standard within its existing application domains.

AVC是跨时代的。其应用在数字视频中,涵盖了几乎所有H.262/MPEG-2 没能覆盖的领域,并且本质上在其覆盖领域内代替了所有过去的标准。

AVC暴露的问题

However, an increasing diversity of services, the growing popularity of HD video, and the emergence of beyondHD formats (e.g., 4k×2k or 8k×4k resolution) are creating even stronger needs for coding efficiency superior to H.264/MPEG-4 AVC’s capabilities.

服务的日益多样化、高清视频的增多、超高清格式的出现,对编码效率的要求越来越高,并且高于H.264/MPEG-4 AVC的性能。

当stereo 或 multiview capture and display出现时,这种需求愈发明显。

并且,视频通信对当今networks提出了挑战。

HEVC的两大着重点

HEVC不仅要能处理现存的AVC应用的所有设备,而且特别着重于两大要点:

  1. increased video resolution
  2. increased use of parallel processing architectures

原文地址:https://www.cnblogs.com/RyanXing/p/9512948.html

时间: 2024-11-04 18:44:10

论文学习:Overview of the High Efficiency Video Coding Standard的相关文章

cvpr 2016 论文学习 Video object segmentation

Abstract— Video object segmentation, a binary labelling problem is vital in various applications including object tracking, action recognition, video summarization, video editing, object based encoding and video retrieval(检索). This paper presents an

手势跟踪论文学习:Realtime and Robust Hand Tracking from Depth

本文介绍的方法主要是用到了深度信息.提出了一种新的手指检测以及手型初始化的方法.具有很好的鲁棒性.在不使用GPU的情况下,速度就可以达到25FPS.准确率还相当的高.可以说是现在手势识别中最好的方法了. 当前的很多方法要不就是很慢,要不就是使用了GPU,再或者就是需要非常复杂的初始化.而本文提出的方法重新定义了手势的模型,结合了现在通用的两种方法的优势,并且加上一个约束方程,得到了很好的效果. 1.模型的重新定义 每一只手,定义了一个自由度(DOF)为26 的手的模型,其中的6个自由度代表全局的

【 论文学习记录】A flexible and scalable slam system with full 3d motion estimation 一种灵活可扩展带有3D运动估计的slam系统

A flexible and scalable slam system with full 3d motion estimation   论文学习记录 这篇论文系统框架,栅格多阈值,更新同步与伪数据,扫描匹配起始点,协方差交叉融合的思想还是值得借鉴的. 摘要 关注于搜救机器人建图定位与导航的框架性文章. 低计算资源的在线快速获取栅格地图: 结合鲁棒的激光扫描匹配方法和惯性传感器姿态估计系统. 快速地图梯度近似与多分辨率(类似图像金字塔)栅格地图,精确而不需要闭环检测. 介绍 2D slam 子系

论文学习:Deep residual learning for image recognition

目录 I. Overview II. Degradation III. Solution & Deep residual learning IV. Implementation & Shortcut connections Home page https://github.com/KaimingHe/deep-residual-networks TensorFlow实现: https://github.com/tensorpack/tensorpack/tree/master/exampl

mtcnn论文学习

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks 使用多任务级联卷积网络连接人脸检测和对齐 摘要-因为可能有着多种姿势.照明和遮挡(various poses, illuminations and occlusions),在非限制环境下的人脸检测和对齐是很有挑战性的.目前的研究显示了深度学习方法能够在这两个任务上获得优异的性能.在该论文中,我们提出了一个深度级联多任务框架,用来探

Word2vec Parameter Learning Explained 论文学习笔记

原始论文:http://www-personal.umich.edu/~ronxin/pdf/w2vexp.pdf 之前学习Word2vec时,脱离了神经网络,这周补充了下相关知识,打算之后再 看下源码,加深自己的理解.

GoogleNet:inceptionV3论文学习

Rethinking the Inception Architecture for Computer Vision 论文地址:https://arxiv.org/abs/1512.00567 Abstract 介绍了卷积网络在计算机视觉任务中state-of-the-art.分析现在现状,本文通过适当增加计算条件下,通过suitably factorized convolutions 和 aggressive regularization来扩大网络.并说明了取得的成果. 1. Introduct

2015.03.10,学习,论文学习笔记-“水轮机压力脉动的混频幅值置信度分析方法研究”

文献:胡江艺.水轮机压力脉动的混频幅值置信度分析方法研究.水利机械技术,2014(3).17-22 摘要:文章根据IEC试验规程要求,对常见的用于计算压力脉动混频幅值置信度方法进行了分析.结合模型试验结果对压力脉动随机变量的分布进行了讨论.分析了计算方法的实用性,提出了符合要求的计算方法. 关键词:压力脉动 混频幅值 置信度 正态分布 笔记: 国内压力脉动测量采用混频压力脉动相对幅值,即时域内压力脉动的峰峰值与净水头H的幅值,其中峰峰值采用了置信度方法计算.IEC规程定义的混频压力脉动幅值称为宽

Raft论文学习笔记

先附上论文链接  https://pdos.csail.mit.edu/6.824/papers/raft-extended.pdf 最近在自学MIT的6.824分布式课程,找到两个比较好的github:MIT课程<Distributed Systems >学习和翻译 和 https://github.com/chaozh/MIT-6.824-2017 6.824的Lab 2 就是实现Raft算法.Raft是一种分布式一致性算法,提供了和paxos相同的功能和性能,但比paxos要容易理解很多