专利汇可以提供SYSTEM AND METHOD OF IMPLEMENTING SYNCHRONIZED AUDIO AND VIDEO STREAMING专利检索,专利查询,专利分析的服务。并且A server comprises a processing unit configured to interlace audio data packets with video data to form an interlaced audio/video data file having an approximately uniform audio time interval between consecutive audio data packets in the interlaced audio/video data file. The server also comprises an interrupt timer configured to provide periodic interrupt signals. The processing unit is configured to synchronize the start of transmission of each instance of the audio data packets and the video data packets with the periodic interrupt signals from the interrupt timer.,下面是SYSTEM AND METHOD OF IMPLEMENTING SYNCHRONIZED AUDIO AND VIDEO STREAMING专利的具体信息内容。
What is claimed is:
High Definition Video On-Demand (HD VOD) systems allow multiple users to simultaneously watch the same or different HD video content. Such systems can provide functionality such as pause, fast-forward, fast-rewind etc. The users' experience is improved when proper lip synchronization is achieved. Lip synchronization refers to matching the visual scene (such as lip movements of a speaker) to the corresponding sound (such as the words spoken by the speaker). Without proper lip synchronization, audio may be heard ahead of or after its corresponding video frame.
To achieve proper lip synchronization, typical HD VOD systems require high performance hardware on a “per stream” basis to support multiple simultaneous streams. In addition, typical HD VOD systems require accurate & high precision timers to manage synchronization between audio and video content. Without such high performance hardware and timers, packets may be dropped or lip synchronization will be lost progressively over time. Lip synchronization may also be lost for every forward and rewind operation. A forward or rewind operation involves jumping ahead, or behind in audio & video content files simultaneously.
However, the usage of a “per stream” high performance hardware pipeline, accurate & precise timers, and complex algorithms needed to achieve proper lip synchronization on a per stream basis can increase the cost of HD VOD hardware significantly. Thus, there is a trade-off between the number of simultaneous streams supported and the hardware required to support the streams.
In one embodiment, a server is provided. The server comprises a processing unit configured to interlace audio data packets with video data to form an interlaced audio/video data file having an approximately uniform audio time interval between consecutive audio data packets in the interlaced audio/video data file. The server also comprises an interrupt timer configured to provide periodic interrupt signals. The processing unit is configured to synchronize the start of transmission of each instance of the audio data packets and the video data packets with the periodic interrupt signals from the interrupt timer.
Understanding that the drawings depict only exemplary embodiments and are not therefore to be considered limiting in scope, the exemplary embodiments will be described with additional specificity and detail through the use of the accompanying drawings, in which:
In accordance with common practice, the various described features are not drawn to scale but are drawn to emphasize specific features relevant to the exemplary embodiments.
In the following detailed description, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration specific illustrative embodiments. However, it is to be understood that other embodiments may be utilized and that logical, mechanical, and electrical changes may be made. Furthermore, the method presented in the drawing figures and the specification is not to be construed as limiting the order in which the individual steps may be performed. The following detailed description is, therefore, not to be taken in a limiting sense.
The server includes a memory 110 co-located in the server 102 with the processing unit 112. In some embodiments, the memory 110 is implemented using solid state drives in a Redundant Array of Independent Disks (RAID) configuration. However, it is to be understood that other memory devices can be used in other embodiments. In some embodiments, the memory 110 stores video metadata 120 and corresponding audio metadata 118. Additionally, in some embodiments, video data and audio data are retrieved by the server 102 from an external network 122 coupled to a network port 121 in the server 102. The retrieved audio and video data is then stored on the memory 110 for delivery to the client devices 104-1 . . . 104-N when requested. An external network, as used herein, is a network of components and/or devices that are physically located apart from the device in which the processing unit is located. For example, the external network 122 can be implemented as a local area network, a wide area network, the internet, etc.
The processing unit 112 in server 102 is configured, in this embodiment, to interlace the audio data with the video data to form a single file containing both audio and video data. The single file is also referred to herein as an interlaced audio/video data file. In other embodiments, the audio and video data is preprocessed to form the interlaced audio/video data file. The preprocessed interlaced audio/video data file is then delivered to the server 102. An approximately uniform audio time interval is used between any two consecutive packets of audio data packets within a video stream. For example, in some embodiments, a minimum uniform time interval is used during interlacing. In some embodiments, where audio timestamps are not supplied, the appropriate audio time interval is configured and the audio location is calculated from the video data timestamp to maintain correct audio time intervals. In this embodiment, the interlaced audio/video data 128 is stored on memory 110. The interlaced audio/video data 128 is used to synchronize transmission of the audio and video data when requested by a client device 104, as described below.
An exemplary depiction of interlaced transmission times is shown in
The server 102 also includes a single interrupt timer 116 configured to provide interrupt signals to the processing unit 112. In particular, the interrupt timer 116 provides a repeating interrupt signal at a periodic rate. The processing unit 112 outputs requested interlaced streams to the respective client device 104 that requested the streams based on the timing of the interrupt signals from the single timer 116 and the interlaced audio/video data file 128. The audio and video data are output as separate streams on different IP ports. However, the start of transmission of each instance of the audio data packets and corresponding video data packets is synchronized to the interrupts from the timer 116 based on the interlaced audio/video data file 128.
In particular, the audio and video data packets are pre-sorted in the interlaced audio/video data file 128 such that there is an approximately equal time interval for the audio packets in the interlaced audio/video data file 128. The size of the video and audio data packets may vary. Thus, the uniformity of the time intervals is approximately equal as a function of time not as a function of the number of video packets between each audio packet. For example, the last packet in the interlaced audio/video data file 128 can be a video packet in some implementations. The approximately equal time interval is based on the time to send the audio packets.
When the data is to be sent, the presorted audio and video packets in the interlaced audio/video data file 128 are transmitted in order over the respective IP port such that the audio data packets are sent at proper time to reduce pauses and jumps in the audio. In particular, the interrupts from timer 116 are used to determine when to send the next packet in the presorted interlaced audio/video data file 128.
The single precise timer 116 provides interrupts at predetermined intervals (i.e. periodic rate), such as on the order of milliseconds. The periodic rate at which the interrupts are generated can be approximately the same as the interval at which video frames are transmitted (i.e. the video frame rate), in some embodiments. Since the audio data is interlaced with the video data, a separate interrupt timer is not required for delivery of the audio data. In the example shown in
The processing unit 112, in this embodiment, uses the interrupt interval from the single timer 116 to transmit each instance of a plurality of concurrent audio/video streams. Thus, the processing unit 112 synchronizes the frame rate of all the concurrent audio/video streams as shown in
The processing unit 112 includes or functions with software programs, firmware or other computer readable instructions for carrying out various methods, process tasks, calculations, and control functions, used in interlacing the audio/video data and processing requests for controlling interlaced streams. For example, the processing unit 112 can utilize data describing the frame rate of each interlaced audio/video file for simplified timer configuration; pointers to I-Frames within each file to support navigation commands from the clients; and IP address and port information for each client accepting streams.
These instructions are typically stored on any appropriate computer readable medium used for storage of computer readable instructions or data structures. The computer readable medium can be implemented as any available media that can be accessed by a general purpose or special purpose computer or processor, or any programmable logic device. Suitable processor-readable media may include storage or memory media such as magnetic or optical media. For example, storage or memory media may include conventional hard disks, Compact Disk—Read Only Memory (CD-ROM), volatile or non-volatile media such as Random Access Memory (RAM) (including, but not limited to, Synchronous Dynamic Random Access Memory (SDRAM), Double Data Rate (DDR) RAM, RAMBUS Dynamic RAM (RDRAM), Static RAM (SRAM), etc.), Read Only Memory (ROM), Electrically Erasable Programmable ROM (EEPROM), and flash memory, etc. For example, in the embodiment shown in
By using a single timer 116 to synchronize the transmission of each stream and interlacing the audio and video data into a single file, the hardware requirements for server 100 to provide multiple video streams on demand is reduced. In particular, the number of timers and/or performance requirements of the server components is reduced. Thus, system 100 is a cost effective system to provide a plurality of simultaneous streams on demand. System 100 can be implemented in various environments, such as, but not limited to, internet protocol (IP) networks in public transportation systems, corporate office buildings, hotels and in-flight video service on commercial airline flights.
At block 304, a repeating interrupt signal from a single interrupt timer is provided at a periodic rate, as described above. In some embodiments, the periodic rate is approximately equal to the video frame rate of the video data in the interlaced audio/video data file. At block 306, a start of transmission of each of a plurality of instances of the audio data packets and video data packets is synchronized to the repeating interrupt signal. In particular, each instance is streamed in response to a request from a respective client device, as described above. For example, each client device, such as a handheld device or laptop computer, can request a video/audio stream from a server. Thus, each instance of the audio data packets and each instance of the corresponding video data packets is synchronized to the interrupt signals from a single interrupt timer regardless of when the request is received.
Although specific embodiments have been illustrated and described herein, it will be appreciated by those of ordinary skill in the art that any arrangement, which is calculated to achieve the same purpose, may be substituted for the specific embodiments shown. Therefore, it is manifestly intended that this invention be limited only by the claims and the equivalents thereof.
标题 | 发布/更新时间 | 阅读量 |
---|---|---|
处理直播内容的方法、装置、系统、设备、存储介质 | 2020-05-08 | 257 |
推送HEVC视频的方法和装置 | 2020-05-08 | 679 |
目标识别的训练方法、装置、计算机设备和存储介质 | 2020-05-11 | 980 |
移动终端定位系统及其建立方法、移动终端的定位方法 | 2020-05-08 | 17 |
一种基于CPU的8K超高清视频高速解码方法 | 2020-05-08 | 632 |
一种视频存储业务的处理方法、系统及设备 | 2020-05-08 | 912 |
人机交互式软件录屏方法 | 2020-05-08 | 564 |
一种文本远程预览方法及装置 | 2020-05-08 | 298 |
视频处理方法、装置、设备和存储介质 | 2020-05-08 | 667 |
一种跨平台集成视频流的系统及方法 | 2020-05-08 | 613 |
高效检索全球专利专利汇是专利免费检索,专利查询,专利分析-国家发明专利查询检索分析平台,是提供专利分析,专利查询,专利检索等数据服务功能的知识产权数据服务商。
我们的产品包含105个国家的1.26亿组数据,免费查、免费专利分析。
专利汇分析报告产品可以对行业情报数据进行梳理分析,涉及维度包括行业专利基本状况分析、地域分析、技术分析、发明人分析、申请人分析、专利权人分析、失效分析、核心专利分析、法律分析、研发重点分析、企业专利处境分析、技术处境分析、专利寿命分析、企业定位分析、引证分析等超过60个分析角度,系统通过AI智能系统对图表进行解读,只需1分钟,一键生成行业专利分析报告。