All 0 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
4:2:2, 4:4:4, 4:4:4:4
Aspect Ratio, Display Aspect Ratio, DAR
ATA, EIDE, IDE
AVC, H.264, H264
BluRay, Blu-Ray, BD
Cinema Craft Encoder
Demultiplex, Demultiplexing, Demux, Demuxing
Dolby Digital, AC3, AC-3
DVD Studio Pro
DVD+R DL, DVD+R9
ECC Constraint Length
Final Cut Pro
Garbage In Garbage Out, GIGO
GOP, Group Of Pictures
HD-DVD, HDDVD, HD DVD
Interlace, Interlaced, Interlacing, non-progressive
Inverse Telecine, IVTC
Kodak Picture CD
Linear PCM, LPCM
M2T, m2ts, mts
Main Concept Encoder
Matroska, MKA, MKV, MKS
Motion compensation and prediction
MP3 ID3 Tag, ID3
MPA, MP1, MP2
MPV, M1V, M2V
Multiplex, Multiplexing, Mux, Muxing
Overlay, Hardware Overlay, Video Overlay
Pan & Scan
Physical Sector Number
Pixel Aspect Ratio, PAR
Portable DVD Player
Progressive scan, Progressive, Noninterlaced, Non-Interlaced
QAM, QAM tuner
Region Coding Enhancement, RCE
Time Base Corrector, TBC
VCD Header trick
1080i is the shorthand name for a category of video modes. The number 1080 stands for 1080 lines of vertical resolution, while the letter i stands for interlaced or non-progressive scan. 1080i is considered to be an HDTV video mode. The term usually assumes a widescreen aspect ratio of 16:9, implying a horizontal resolution of 1920 pixels and a frame resolution of 1920 × 1080 or about 2.07 million pixels, and a field resolution of 1920 × 1080 / 2 (because it's interlaced) or about 1.04 million pixels. The field rate (not the frame rate) in hertz can be either implied by the context or specified after the letter i. The two field rates in common use are 50 and 60 Hz, with the former (1080i50) generally being used in traditional PAL and SECAM countries (Europe, Australia, much of Asia, Africa), the latter (1080i60) in traditional NTSC countries (e.g. United States, Canada and Japan). Both variants can be transported by both major digital television formats, ATSC and DVB.
1080p is the shorthand name for a category of video modes. The number 1080 represents 1,080 lines of vertical resolution, while the letter p stands for progressive scan or non-interlaced. 1080p is considered an HDTV video mode. The term usually assumes a widescreen aspect ratio of 16:9, implying a horizontal (display) resolution of 1920 dots across and a frame resolution of 1920 × 1080 or over two million pixels. The frame rate in hertz can be either implied by the context or specified after the letter p (such as 1080p30, meaning 30 frames per second).
Aspect ratio most commonly known as widescreen or letterbox. It is wider than the standard 4:3 aspect ratio. 16:9 supporters state that the wider picture corresponds much better to the human visual field than the almost square 4:3.
The process of transferring 24-frame-per-second film to video by repeating each film frame as two video fields. When 24-fps film is converted via 2:2 pulldown to 25-fps 625/50 PAL video, the film runs 4 percent faster than normal.
The process of converting 24-frame-per-second film to video by repeating one film frame as three fields, then the next film frame as two fields
An uncommon variation of 2-3 pulldown, where the first film frame is repeated for 3 fields instead of two. Most people mean 2:3 pulldown when they say 3:2 pulldown.
The mpeg4 based video format used in mobile terminals, like cell phones.
3ivx is an MPEG-4 toolkit that supports MPEG-4 Video, MPEG-4 Audio and the MP4 File Format.
A Mac program that goes directly from DVD to various video formats including VCD, SVCD, and Divx.
480p is the shorthand name for a video mode. The p stands for progressive scan, i.e. non-interlaced, while the 480 denotes a vertical resolution of 480 lines, usually with a horizontal resolution of 854 pixels and a 16:9 aspect ratio on high-definition television (HDTV), or 640 pixels and 4:3 aspect ratio on standard-definition television (SDTV).
A ratio used to describe the sampling frequency of a digitized signal. The ratio describes luminance as being sampled 4 times at 3.37 MHz, while color is sampled 1 time at 3.37 MHz in each of it's separate parts. DV, DVCAM and DVCPRO25 use 4:1:1 color sampling. Formulated as: Y (luminance) is sampled at 13.5 MHz (or 3.37 x 4), R-Y (color) is sampled at 3.37 MHz (or 3.37 x 1), B-Y (color) is sampled at 3.37 MHz (or 3.37 x 1) equals 4:1:1.
4:2:2, 4:4:4, 4:4:4:4
Put simply 4:4:4 and 4:2:2 terms are descriptions of the sample formats used in digital video. In the early 80's tests were done to determine the sample formats and rates for digital video. The eventual sample structure used for SDI video ended up being 4 times the base sample rate chosen.
The first 4 in the 4:2:2 term is for luminance or the black and white information, and this is where most of the picture detail is. Early tests in television human vision discovered a greater sensitivity to black and white information, while the color is filled in with less detailed areas of the human eye. This means you can reduce the color information and your eye cannot really tell. This is what the 2:2 part of 4:2:2 is for. It means the red and blue channels of the video signal are half the bandwidth of the luminance information. Green is not sent, as you can calculate green from red, blue and luminance information.
This color bandwidth reduction has been used for years in broadcast color television, and in fact the color bandwidth of 4:2:2 is much higher than composite video. This all adds up to 4:2:2 being compatible with black and white or composite television, as the color and luminance information is sent separately, while only 2/3 of the data rate is required for about the same visual quality video.
4:4:4 video is similar, but this time all the color information is sent. RGB computer graphics are really 4:4:4. The 4:4:4:4 format adds a key channel.
Traditional nearly square aspect ratio used for most current analog television screens and IMAX movie theater screens. This aspect ratio will slowly be phased out in favor of the wider, more panoramic and movie-like 16:9 ratio. Video displays using a 4-by-3 ratio display images 4 units wide (horizontal measure) by 3 units tall (vertical measure).
The 4:3 ratio performs fine for television programming, which was designed for it, but it creates problems with movie material originally designed for theater release. The movies are created with a wider, more rectangular aspect ratio (16:9 or wider) in order to create a larger viewing surface and bring the viewer more into the film. On a traditional 4-by-3 aspect ratio display, these movies must be letterboxed or cut down in size (pan & scan).
In contrast to the Stereo sound system and conventional Surround Systems, this sound system offers five separate full band audio signals: Left, middle, right, rear left, rear right. An additional subwoofer (LFE) channel is also provided.
The scanning system of 525 lines per frame and 60 interlaced fields (30 frames) per second. Used by the NTSC television standard.
The scanning system of 625 lines per frame and 50 interlaced fields (25 frames) per second. Used by PAL and SECAM television standards.
720p is the shorthand name for a category of HDTV video modes. The number 720 stands for 720 lines of vertical display resolution, while the letter p stands for progressive scan or non-interlaced.
Advanced audio coder. An audio-encoding standard for MPEG-2 that is not backward-compatible with MPEG-1 audio.
A mac video conversion program. It can easily demux files and has various other functions available.
Macintosh AIFF Resource ( .aif, .aifc, .aiff) Audio Interchange File Format (AIFF) is an audio file format that was developed by Apple Computer. This format may be used to store high-quality sampled audio and musical instrument information.
A distortion (artifact) in the reproduction of digital audio or video that results when the signal frequency is more than twice the sampling frequency. The resolution is insufficient to distinguish between alternate reconstructions of the waveform, thus admitting additional noise that was not present in the original signal.
Process where a “wide” video image (typically in a 16:9 widescreen format) is compressed or squeezed horizontally to fit a more narrow video display standard but expands to full size when played over a wide video display.
Letterboxing an image enables the viewer to see the entire widescreen presentation of a movie as it was intended and as it was shown in the theater. However, in order to fit a “wide” image in a “narrow” television, the wide image must be centered in the screen with black bars above and below it (in order to fit a wide image in a narrow screen, the width must match the width of the narrow display so the height of the image is necessarily less than that of the more narrow and square video display). While this method allows the user to see the entire image as it was meant to be seen (narrow 4:3 aspect ratio television sets normally show pan & scan movies), the image loses some horizontal resolution to the black bars.
While not much can be done about this on standard 4:3 televisions, there are wider 16:9 displays, which can show an entire movie image with no bars thus allowing the picture to fill the screen. To take advantage of this, a movie can be distributed in a squeezed anamorphic format without black bars. On a more square 4:3 television this results in an image which seems tall and pinched with actors looking too narrow and objects distorted. However, when played on a widescreen display, the picture is stretched out to its proper width resulting in a widescreen image with no bars and the maximum possible resolution. This technique is being used primarily with DVDs to provide superior quality video to users of widescreen televisions. DVDs featuring this anamorphic version allow a user to watch the image in a letterboxed or pan & scan format on their traditional more square 4:3 televisions when this is selected while allowing users with widescreen televisions to enjoy the full benefit of their displays. AudioVideo101 - Ultimate Guide
Smoothing or reducing disturbing picture effects. By means of calculation of intermediate values along the sharp edges of types and graphics, these edges can be smoothed out, thus generating a smoother picture. The pixel structure along tilted or bent edges is mixed with the surrounding colors. When creating DVD Menu text, antialiasing must not be used.
Advanced Regional Copy Control Operating Solution. Sony's copy protection scheme for DVD-Video, designed to prevent 1:1 digital copying (ripping).
An unnatural effect not present in the original video or audio, produced by an external agent or action. Artifacts can be caused by many factors, including digital compression, film-to-video transfer, transmission errors, data readout errors, electrical interference, analog signal noise, and analog signal crosstalk. Most artifacts attributed to the digital compression of DVD are in fact from other sources. Digital compression artifacts will always occur in the same place and in the same way. Possible MPEG artifacts are mosquitoes, blocking, and video noise.
Advanced Streaming Format (ASF): This file format stores audio and video information, and it is specially designed to run on networks like the Internet. This file format is a highly flexible and compressed format that contains streaming audio, video, slide shows, and synchronized events. When you use .asf files, content is delivered to you as a continuous flow of data. You no longer have to wait for your audio and video files to fully download before you start to view them. When an Audio Video Interleave (.avi) file is compressed and converted to an .asf file, the file begins playing after only a few seconds. The file can be unlimited in length and can run over Internet bandwidths. Microsoft ASF information - MS Sample
Aspect Ratio, Display Aspect Ratio, DAR
A 4:3 aspect ratio means the horizontal size is a third again wider than the vertical size. Standard television ratio is 4:3 (or 1.33:1). Widescreen DVD and HTDV aspect ratio is 16:9 (or 1.78:1). Common film aspect ratios are 1.85:1 and 2.35:1. Aspect ratios normalized to a height of 1 are often abbreviated by leaving off the :1.
ASPI stands for Advanced SCSI Programming Interface. Originally developed by Adaptec. It is a software layer that enables programs to communicate with SCSI and ATAPI devices(CD and DVD Drives and other storage peripherals). Bart's page about ASPI.
(Audio Still Video) A still picture on a DVD-Audio disc.
Advanced Stream Redirector (ASX): When you use .asx files, you are directed to streaming media content, usually on multimedia Web sites. The .asx files are simple text files that contain server and media information. They are metafiles (a file that provides information about Windows Media files and their presentation) that are similar to Windows Media Redirector (.wvx) files.
If you want to find out more about the streaming content you can open an ASX file in a text editor and often find the filenames of the actual streaming content (often with an .asf extension denoting an ASF file).
ATA, EIDE, IDE
Advanced Technology Attachment or called Parallel ATA is a disk drive implementation that integrates the controller on the disk drive itself. There are several versions of ATA:
ATA, also called IDE.
ATA-2, also called EIDE.
Ultra-ATA, also called Ultra-DMA, ATA-33, and DMA-33.
Advanced Technology Attachment Packet Interface is a interface to support CD Drives and DVD Drives using the computers current ATA(IDE/EIDE) connections. ATA was originally designed for hard drives only, but with help of ATAPI it is possible to connect other devices to the ATA(IDE/EIDE) connections.
The Advanced Television Systems Committee, Inc., is an international, non-profit organization developing voluntary standards for digital television. Specifically, ATSC is working to coordinate television standards among different communications media focusing on digital television, interactive systems, and broadband multimedia communications. ATSC Digital TV Standards include digital high definition television (HDTV), standard definition television (SDTV), data broadcasting, multichannel surround-sound audio, and satellite direct-to-home broadcasting.
UDF file name used for the DVD-Audio directory on a DVD disc volume. DVD-Audio is a separate format from DVD-Video so on a standard DVD-Video is the AUDIO_TS folder empty.
To format video into a form ready to burn onto a recordable disc or to stream onto the Internet. VCD, SVCD and DVD Author is to format video into its standard file structure and also add optional menus, chapters, audio tracks, subtitles, slideshows and much more.
VCD File structure
SVCD File structure
DVD File structure
VCD Authoring guides
SVCD Authoring guides
DVD Authoring guides
AVC, H.264, H264
H.264, MPEG-4 Part 10, or AVC, for Advanced Video Coding, is a digital video codec standard which is noted for achieving very high data compression. It was written by the ITU-T Video Coding Experts Group (VCEG) together with the ISO/IEC Moving Picture Experts Group (MPEG) as the product of a collective partnership effort known as the Joint Video Team (JVT). The ITU-T H.264 standard and the ISO/IEC MPEG-4 Part 10 standard (formally, ISO/IEC 14496-10) are technically identical. The final drafting work on the first version of the standard was completed in May of 2003.
AVCHD (Advanced Video Codec High Definition) is a new high definition recording format introduced by Sony and Panasonic. It can use various storage media, including 8 cm (3") recordable DVD discs, as well as hard disk, and SD and Memory Stick Pro memory cards, and is being positioned to compete with handheld video camera recording formats like HDV and MiniDV. As its name implies, AVCHD uses the MPEG-4 AVC (H.264) video codec. From wikipedia.
Audio Video Interleaved - A multimedia file format for storing sound and moving pictures in RIFF format developed by Microsoft. An AVI file can use different codecs and formats so there is no set format for an AVI file unlike for example standard VCD video which sets a standard for resolution, bitrates, and codecs used. Microsoft AVI information - MS Sample File
One of three picture types used in MPEG video. B pictures are bidirectionally predicted, based on both previous and following pictures. B pictures usually use the least number of bits. B pictures do not propagate coding errors since they are not used as a reference by other pictures.
Belt worn over the right shoulder to support a sword or bugle by the left hip.
A form of compression in which the codec uses information not only from frames that have already been decompressed, but also from frames yet to come. The codec looks in two directions: ahead as well as back. This helps avoid large spikes in data rate caused by scene changes or fast movement, improving image quality.
The .BIN / .CUE CD image format was made popular by the CDRWin software. Afterwards many programs have started supporting or partially supporting it, including: Nero, BlindWrite, CloneCD, FireBurner, vcdimager and cdrdao. The .CUE file contains VCD or SVCD or other data track layout information, while the .BIN file holds the actual data. VCDhelp.com bin/cue pages - afterdawn.com
Bitrate or Bit Rate is the average number of bits that one second of video or audio data will consume. Higher bitrate means bigger file size and generally better video or audio quality while lower bitrate means lower file size but worse video or audio quality. Some bitrate examples in common video and audio files:
MP3 about 128 kbps (kilobits per second)
VCD about 1374 kbps
DVD about 4500 kbps
DV about 25 Mbps (megabits per second).
For a DVD player or drive to identify what kind of disc is loaded, it queries the so called "Book Type Field" found in the lead-in section of each DVD disc. These few bits, commonly referred to as "compatibility bitsettings" tell the drive which low-level format specification does the media conform to, such as DVD-ROM, DVD+R or DVD+RW.
Most DVD players will read a DVD+RW or DVD+R disc without any problems, however a small minority of them report a disc error when a disc is loaded that is not marked as a "DVD-ROM" disc in the compatibility bits. Ususally, these players are physically able to read the disc (since DVD+RW reflectivity is identical to that of a dual layered DVD-Video disc, which all players must be capable of reading), but their compatibility problems are due to different interpretations of these bits in the various firmware versions. In most cases, the problem can be solved by updating the firmware. www.dvdplusrw.org/Article.asp?mid=0&sid=2&aid=42
The block is a matrix of 8x8 elements. They can be 8x8 adjacent luminance or chrominance samples, or the corresponding DCT coefficients.
At the block layer is performed the Discrete Cosine Transform.
Video decoding process at the block layer
Variable length decoding.
The bitstream codewords of the block are decoded to form a vector of quantised DCT coefficient.
The vector elements are put into a two-dimensional array, which is the block, following one of two possible patterns. The pattern is defined by the flag alternate_scan which is set at the picture layer. The scanning purpose is to optimize the entropy coding.
DCT coefficients are converted to their original range of values. See also dct quantisation.
Eventually the IDCT is performed. Now the block elements represent either the image sample (Intra block) or the prediction-error (Non-Intra block).
BluRay, Blu-Ray, BD
A Blu-ray Disc (BD) is a next-generation optical disc format designed for high-density storage of high-definition video and data.
Produce, distribute, or sell without permission or illegally. Dictionary.com.
Bits per second. A unit of data rate
A device that enables a broadband connection to the Internet by using cable television infrastructure. Access speeds vary greatly, with a maximum throughput of 10 megabits per second (Mbps).
This type of video was recorded by someone in a cinema with a camcorder and the audience can sometimes be heard or seen! The picture quality is usually OK but the sound is usually very bad and it is hard to make out speech.
A textual representation of the audio information in a video program. Captions are usually intended for the hearing impaired, and therefore include additional text to identify the person speaking, offscreen sounds, and so on.
Also called Cap or Capping - To capture video or TV/Sattelite signals to disk. This can include firewire capture from DV cameras. VCDhelp Capture Section
Constant Angular Velocity, the disc(CD/DVD) is read/written at a constantly increasing speed.
Constant Bit Rate - the bitrate is the same at any part of a single video or audio stream. VCD standard MPEG video and audio are constant bit rate as are most MP3 standalone audio files. Also see VBR (variable bit rate).
see Cinema Craft Encoder
Compact disc plus graphics. A variation of CD which embeds graphical data in with the audio data, allowing video pictures to be displayed periodically as music is played. Primarily used for karaoke.
Compact disc digital audio. The original music CD format, storing audio information as digital PCM data. Defined by the Red Book standard.
Compact disc interactive. An extension of the CD format designed around a set-top computer that connects to a TV to provide interactive home entertainment, including digital audio and video, video games, and software applications. Defined by the Green Book standard. CD-i Assn.
A type of Enhanced CD format using stamped multisession technology.
CD-ROM extended architecture. A mode 2,multi-session disk where data is on one session and audio/video on another(CD-Extra,Mixed-Mode).
CD Audio Track - audio files that are on CD media. You can play .cda files only from a CD-ROM. Often the CDA tracks are ripped to WAV or MP3 files.
A program that records / burns audio, video, and data files to CD-Rs/CD-RWs. Cdrdao records CD-Rs in disk-at-once (DAO) mode based on a textual description of the CD contents. The program runs on a variety of operating systems including Linux and Windows (in command line mode or via a GUI). cdrdao homepage
A 'Cell' is a small segment of a chapter (or part). It is the smallest resolution at which DVD navigation commands can act (e.g. 'Jump to Cell 3 of Part 4 of Title 2'). Typically one chapter contains one Cell but on complex DVDs it may be useful to have multiple Cells per chapter. dvd.sourceforge.net
A DVD 'Chapter' (somewhat confusingly referred to as a 'Part' in the parlence of DVD authors) is generally a logical segment of a Title such as a scene in a film or one interview in a set of cast interviews. There can be up to 999 Chapters in one DVD Title. dvd.sourceforge.net
The basic "Chroma Bug" manifests itself as streaky or spiky horizontal lines running through the chroma channel, most notably on diagonal edges. As mentioned above, this problem has been around for a long time. It's only just now being noticed largely because one needs a good high-resolution display, such as a front projector and a six foot projection screen, to really see the problem clearly. In addition, the increasingly common use of large progressive displays has really allowed people to get up close to the screen and see every artifact magnified in great detail. Problems that might have gone unnoticed on a 20 inch interlaced TV suddenly hit you in the face. With the advent of relatively high resolution media like DVD, people are starting to compare the video image to the original film image, not to other forms of TV. And suddenly strange problems that people accepted in a TV picture, but would never be allowed on film, look out of place. The Chroma Bug is one of the most visible artifacts around, but because it's specific to MPEG and 4:2:0 encoding, there was nothing written about it until very recently. http://www.hometheaterhifi.com/volume_8_2/dvd-benchmark-special-r ... -2001.html
The Chroma Key process is based on the Luminance key. In a luminance key, everything in the image over (or under) a set brightness level is "keyed"out and replaced by either another image, or a color from a color generator. Also known as Blue Screen Compositing, the
Chroma Key Process was made famous by films such as star wars where spacecraft miniatures were composited onto starfield backgrounds.
Chroma noise affects areas of colour in the image. Instead of being clean, even areas of colour, chroma noise makes colours look grainy due to random noise being inserted into the colour signal. Chroma noise seems to particularly affect blue, although it can potentially be seen in any large expanse of a single colour. Chroma noise is pretty much exclusively an artefact of analogue video processing, and it is very rare to see it in modern, all-digital transfers. Increased MPEG macro-blocking artefacts are a potential side-effect of chroma noise, as the MPEG encoder attempts to encode the extra spurious random noise, leaving less bits for actual picture information.
Cinema Craft Encoder
A very high quality software encoder often refered as CCE .
Though there are different versions of CCE, Cinema Craft SP is the most known and used type http://www.cinemacraft.com/eng/product.html#sp .
Current price on this version is $1995 and can be purchased here :
When encoding MPEG video, a Closed GOP is one that uses no referenced pictures from the previous GOP at the current GOP boundary. For example the GOP is closed when it starts with an I Frame and subsequent B Frames do not rely on I or P frames from the previous GOP. Also see Open GOP.
Constant Linear Velocity, the disc(CD/DVD) is read/written at a constant speed.
An authored disc that won't play, either due to improper authoring, poor media quality, or write error. The name is derived from the disc's uselessness as a DVD/VCD, may as well be used to set drinks on.
An acronym for "compression/deccompression", a codec is an algorithm or specialized computer program that encodes or reduces the number of bytes consumed by large files and programs. Files encoded with a specific codec require the same codec for decoding. Some codecs you may encounter in computer video production are Divx, MPEG-1, MPEG-2, Xivd, DV type 1 and type 2 for video and MP3 for audio.
A DVD-ROM drive capable of reading and writing CD-R and CD-RW media. May also refer to a DVD-R or DVD-RW or DVD+RW drive with the same capability.
A video system containing three separate color component signals, either red/green/blue (RGB) or chroma/color difference (YCbCr, YPbPr, YUV), in analog or digital form. The MPEG-2 encoding system used by DVD is based on color-difference component digital video. Very few televisions have component video inputs.
An analog video signal in which the luma and chroma components are combined (by frequency multiplexing), along with sync and burst. Also called CVBS. Most televisions and VCRs have composite video connectors, which are usually colored yellow.
The process of removing redundancies in digital data to reduce the amount that must be stored or transmitted. Lossless compression removes only enough redundancy so that the original data can be recreated exactly as it was. Lossy compression sacrifices additional data to achieve greater compression.
To change from one form into another. In video obviously it is to change one form of video into another. For example, many people like to convert divx to MPEG, quicktime to AVI, etc. Conversions to a final format is called encoding - an example is AVI to VCD MPEG-1. VCDhelp Convert Section
For noise reduction:
Coring is used to remove fine detail information that does not contribute significantly to the detail of the picture but which adds noise to the image. Imagine the detail information viewed on a scope. About the baseline you'd see primarily the noise information, with the detail extending beyond that. Now imagine that you sliced (or cored) this signal so that only the information above the noise on the baseline came through. You would be left with most of the detail information intact but with much of the noise information removed. The coring adjustment determines how far from the baseline the detail information removed. You want to use just enough coring to reduce the noise in the picture but not so much that the fine detail in the image is affected.
For black level proccessing:
Coring is the process where pixels are evaluated against a threshold value and set to pure black if below. This is a "poor man's" noise reduction in the black areas of an image and is implemented in many digital video cameras and capture chips (like the BT848 family).
To cut away pieces of a video stream without rendering; similiar to cutting a picture with scissors.
Content Scrambling System. In DVD-Video, an encryption scheme designed to protect copyrighted material that resides on a disc by periodically scrambling the data using encryption keys.
China Video Disk - a precursor to SVCD marketed since 1998. Resolutions are 352x480 NTSC, 352x576 PAL, 44.1khz audio (unlike 1/2 D1 DVD that is the same resolution at 48khz audio). Not all players will play CVD (compatible players). CVD Guide
DVHS is a digital recording and playback format for High Definition material. It's based on the existing 1/2" VHS-sized cassettes.
http://www.homecinemachoice.com/articles/frame.html?http://www.ho ... lVHS.shtml
A video resolution standard. In the NTSC system, "Full D1" means 720x480 pixels, and in the PAL and SECAM systems full D1 is 720x576. You also see "cropped D1", which is 704xNN, which is useful because the 8 pixels on either edge of the video frame aren't supposed to contain useful information. Therefore, some programs will prefer the cropped D1 resolution to save bandwidth. Other popular resolutions are often described in terms of D1: the SVCD resolution is 2/3 D1 (480xNN) and 352xNN is 1/2 D1. Occasionally you see SIF somewhat inaccurately described as 1/4 D1. videographica
The Discrete Cosine Transform is performed at the Block layer.
For MPEG-2 (ISO/IEC 13818-2), it's specified in the Annex A to the Recommendation.
The input of the DCT and the output of the inverse transform (IDCT) are 8x8 matrices of 9 bits/element, while the DCT coefficients are represented in 12 bits (-2048:+2047).
DCT is used to remove the spatial correlation existing among adjacent pixels in order to allow a more efficient entropy coding.
As DCT is performed to the 8x8 block, only the correlation inside the block can be removed. Besides, the subdivision of the image in 8x8 blocks for the DCT process is the cause of the typical block artifacts of all the DCT based compression algorithms.
If DCT and IDCT processes were performed with infinite precisions, they would be lossless; unfortunatly they are performed with limited integer precision so that, even without quantisation, they can produce some impairment.
The DCT coefficients have a relationship with spatial frequencies and, given that the different components have different subjective importance, DCT gives an important tool to remove also the subjective redundancy.
The first coefficient (0,0) is related to DC component, the coefficients of the first line are related to purely horizontal spatial frequencies, while those of the first column to purely vertical ones, the other coefficients are related to diagonal components.
A representation of DCT coefficients
The image on the left side is the original one, it has been divided in blocks of 8x8 elements for the DCT; the image on the right consists of 8x8 images, each of them represents a coefficient of the DCT and consists of the values that such coefficient takes in all the blocks the DCT has been performed to.
The relevance of the DC values and of the low frequencies values can be noted.
As MPEG-1 and MPEG-2 adopt hybrid DCT compression algorithm, DCT may be applied not only to sample-values (Intra blocks), but also to prediction error values (Non-Intra blocks). For the latter the DC coeffients are no more so important as for the former, but their relevance is quite similar to the other low frequencies coefficients'.
The process of creating a single frame from the 2 interlaced fields of a video frame. Deinterlacing is used to remove the interlacing artifacts if a still frame is required, or if the video is being used at a different rate than it was created. Extensive Info
Demultiplex, Demultiplexing, Demux, Demuxing
Splitting the video and audio to separate files. Also called "Demux".
Camcorder format which allows you to record digital-quality video onto standard Hi8 or 8mm tape. Most Digital8 camcorders also play back analog Hi8 and 8mm recordings, although they do not record in Hi8 or 8mm. A 120-minute Hi8/8mm tape yields one hour of recording when used with a Digital8 camcorder, giving you essentially the same stunning picture quality as you get with Mini DV (500 lines of horizontal resolution).
DiVA is a powerful MPEG-1/MPEG-2 video converter for Mac OS X 10.2 or later. It uses QuickTime, MPEG, MOV, SMP, AltiVec, YUV, Cocoa, Quartz, XML and other amazingly great acronyms and buzzwords. It's also fast, high quality, and integrates extremely well with 3ivx D4 4.5, allowing it to perform automated 2-pass encoding with 3ivx
DivX™ is a new format for digital video, much like MP3 is a format for digital music. DivX™ is the brand name of a patent-pending video compression technology created by DivXNetworks, Inc., (also known as Project Mayo). The DivX™ codec is based on the MPEG-4 compression standard. This codec is so advanced that it can reduce an MPEG-2 video (the same format used for DVD or Pay-Per-View) to ten percent of its original size. DivX.com.
DivX High Definition brings the Hollywood stars to your living room with just the click of a mouse. Supporting resolutions of up to 720p at bit rates as low as 4Mbps, DivX HD delivers astonishing video at one fifth the bit rate of broadcast HD. Download one of the samples from our HD showcase to experience the stunning video and dazzling audio quality of DivX HD today. http://www.divx.com/hd/
Dolby Digital, AC3, AC-3
Dolby Digital, or AC-3, is the common version containing up to 6 total channels of sound, with 5 channels for normal-range speakers (Right front, Center, Left Front, Right Rear and Left Rear) and one channel for the LFE, or subwoofer. The Dolby Digital format supports Mono and Stereo usages as well.
Colour video was slowly introduced into broadcast. It was therefore necessary to make it compatible with black and white receivers and to design colour receivers or televisions to be able to receive black and white programming as well. In order to accommodate the extra information needed for colour the b&w’s 30 frame/second rate was slowed to 29.97 f/s for colour. Although usually not an issue for non broadcast applications, in broadcast, the small difference between real time (or the wall clock) and the time registered on the video can be problematic. Over a period of 1 hour (SMPTE) the video will be 3.6 seconds or 108 extra frames longer in relation to the wall clock. To overcome this discrepancy drop frame is used.
Drop frame: Every frame :00 & :01 are dropped for each minute change (60 X 2 = 120) except for minutes with 0’s (00:, 10:, 20:, 30:, 40: & 50:) (6 X 2 = 12, 120 - 12 = 108)
Double Sided Dual Layer DVD. See DVD-18.
Data Search Information. Information for Fast Forward/Fast Backward and seamless playback. This is real time control data spread throughout the DVD-Video data stream. Along with PGCI, these packets are part of the 1.00 mbit/sec overhead in video applications (Book B). These packets contain navigation information which makes it possible to search and maintain seamless playback of the Video Object Unit (VOBU). The most important field in this packet is the sector address where the first reference frame of the video object begins. Advanced angle change and presentation timing are included to assist seamless playback.
Digital Subscriber Line - a telephone communication line that uses modulation technology to maximize the amount of data that can be sent over copper wires. DSL is used for broadband and voice connections from telephone switching stations to a subscriber with bitrates similar or slightly less than Cable Modem and greater speeds than ISDN.
Double Sided Single Layer DVD. See DVD-10.
Digital Theater Systems Digital Sound. A product of DTS, Inc., DTS is a multichannel audio compression format similar to Dolby Digital used in DVD-video discs, DVD-audio, 5.1 channel audio CDs, and some movie theaters. DTS differs from Dolby Digital in that it generally uses higher data rates and many have the opinion that DTS is better quality. DTS can only be on a DVD-video disc if accompanied by a Dolby Digital or LPCM track (for North America) or mpeg audio and LPCM (European Community) to ensure compatibility, because DVD players are only required to decode those standards in those regions. DTS Website
Digital TV, the standard for broadcasting picture and sound using digital signals, DTV allows for improvements in both picture and sound quality versus conventional Analog TV.
In USA can DTV be delivered in two basic formats: Standard analog Definition (SDTV) or High Definition (HDTV).
In Europe is DTV deliverd in DVB-formats.
USA DTV information.
Digital Video - video captured to a PC from a digital camcorder, often through Firewire. There are two methods of storing DV video data, referred to in this article as type-1 and type-2. Both are stored usually in AVI files. You should be aware of two salient points regarding these respective types to keep in mind when designing multimedia devices and their respective software drivers and utilities:
Any DV stored as type-1 cannot be used with VfW-based editors.
Microsoft provides DV encoder and decoder filters for DirectShow only, and will not provide support for encoding or decoding DV video data for VfW.
It is important to understand the format used to store video and audio in an AVI file for VfW.
Although an AVI file can have n number of streams, the most common case is one video stream and one audio stream. The stream format headers define the format (including compression) of each stream. The existence of one video stream, one audio stream, or both in an AVI file is a de facto standard for VfW.
A native DV stream, on the other hand, interleaves the video and audio data into a single stream. As stated in the introduction, Microsoft is defining two methods (type-1 and type-2) that developers can use for storing DV data in AVI files. The method chosen by a developer will impact the ease with which the data can then be used with current and future video editing applications.
The native DV interleaved stream that is produced and consumed in I/O with a DV device contains DV compressed video and pulse code modulated (PCM) audio data. This single interleaved stream can be stored in an AVI file as "ivas" stream (for interleaved video/audio stream). Microsoft refers to this format as a type-1 DV AVI file.
Because the type-1 format stores data as a single AVI stream, type-1 DV AVI files are not compatible with VfW. DirectShow, however, easily handles type-1 data streams by routing the streams to a DV Splitter filter that produces a DV-encoded video stream and one or more PCM audio streams for playback or subsequent processing.
Interleaved DV data can also be split into a single video stream and one to four audio streams within an AVI file. Microsoft refers to this format of storing DV data as type-2. This format has the advantage of being backward compatible with VfW, because it contains a standard video stream and at least one standard audio stream.
The type-2 file format requires a small amount of additional processing to split and multiplex the DV stream during the functions of capture and transmit to IEEE 1394 DV devices. MS Info
Free Type 1 to Type 2 Converter from Ulead
A device that can capture analog video like VHS, S-VHS, Hi8 and 8mm and convert it to DV. View our Capture Card list for DV Converters.
Also known as DV Time, a DV or MiniDV camcorder starts recording at 00:00:00. The timecode is drop frame for NTSC (minute differences in timing are made to get the film from 30 fps to 29.97 fps). DV Time is carried on the FireWire cable with the video, audio and Device Control. The biggest problem that arises with DV Time is that it resets to zero if the camera operator does not 'hook' to the end of the previously shot footage (there is an unrecorded gap between recordings).
If dealing with a miniDV or DVCAM tape with 'broken' timecode (that is in many parts), either do a clone copy to another DV tape so that the timecode is created continuously for the entire tape, or name each timecode section as a different tape.
DVB is an acronym for "Digital Video Broadcasting". DVB was set up by the EBU (European Broadcast Union) to set the standards for digital video transmission. They have published these via ETSI (European Telecommunications Standards Institute) who also set standards for devices such as GSM telephones. In fact there are several DVB standards for different transmission media.Some of these are:
DVB-SI Specification for Service Information
DVB-CI Common Interface for conditional access
DVD once stood for digital video disc or digital versatile disc, but now it just stands for DVD -- the next generation of optical disc storage technology. DVD is essentially a bigger, faster CD that can hold cinema-like video, better-than-CD audio, and computer data. DVD Demystified FAQ.
A DVD Player that can store 2 or more DVDs (DVD-Video, DVD-Audio) or CDs (CD Audio, VCD, SVCD...) and play them after each other.
DVD Studio Pro
DVD Studio Pro is a software application from Apple that makes it affordable and simple for non-specialists to encode, author, and write professional-quality DVD-Video discs on their Power Mac G4 desktops. If you're shooting and editing using digital video, you can now retain digital quality and precision all the way to the final product. DVD Studio Pro is in a class by itself because it is the only full-featured DVD authoring tool that is both affordable and easy to use.
DVD+Recordable defines a standard for recordable DVD drives and media defined by the DVDRW Alliance. Often called "plus R", the format is write once (compared to DVD+RW wich can be erased and rewritten). The single sided discs can hold 4,700,000,000 bytes (4.38 Gigabytes at 1024 bytes to the kilobyte) with double sided discs holding twice as much. There are no dual layer single sided recordable discs. This format competes with the DVD Forum DVD-R specification. DVDRhelp DVDR information
DVD+R DL, DVD+R9
DVD+R DL or called DVD+R9 is a Dual Layer writeable DVD+R. The dual layered discs can hold 7.95 GB or around 8 540 000 000 bytes (called DVD-9) and a double sided dual layered disc 15.9 GB or around 17 080 000 000 bytes (called DVD-18).
DVD+RW is a ReWriteable media format of the DVD+R standard.
DVD-10 is a double sided single layer DVD which can fit up to 9.4 GB or 8.7 computer GB. Video DVD, DVD-R/W and DVD+R/W supports this format.
DVD-18 is a double sided dual layer DVD which can fit up to 17 GB or 15.9 computer GB which some commercial video DVDs are using today (a DVD-18 is basicly four pressed plastic DVD-5s pressed together, they are not burned). Video DVD supports this format but DVD-R/W and DVD+R/W does not support this format.
DVD-5 is a single sided single layer DVD that stores up to about 4.7 GB = 4 700 000 000 bytes and that is 4.38 computer GigaBytes where 1 kilobyte is 1024 bytes(4 700 000 000B/1024 = about 4 589 843KB/1024 = about 4485MB/1024 = about 4.38GB) . Video DVD, DVD-R/W and DVD+R/W supports this format. Often referred to as "single sided, single layer". DVDRhelp DVD information
DVD-9 is a single sided dual layer DVD which can fit up to 8.5 GB or 7.95 computer GB which many commercial video DVDs are using today (a DVD-9 is basicly two pressed plastic DVD-5s pressed together, they are not burned). Video DVD supports this format but DVD-R/W and DVD+R/W does not support this format.
DVD-Audio or sometimes called DVD-A is a separate format from DVD-Video. It is a format specifically designed to provide the highest possible audio fidelity capable on DVD. DVD-Audio provides for audio in stereo and in multi-channel surround in a wide range of specifications. In addition to audio, a DVD-Audio disk can contain a limited amount of video, which can be used to display text, such as lyrics or notes. DVD-Audio can only be played on DVD Players with DVD-Audio support (most DVD Players do not support this format). DVD-Audio is currently competing with SACD as the new audio defacto standard. DigitalAudioGuide DVD Audio FAQ
This type of disc is created when MP3 audio files are burned on a DVDR/W disc. Very few MP3 capable standalone DVD players supports DVD-MP3 because most players verify DVDR/W as DVD-Video only (compatability list).
DVD-Recordable defines a standard for recordable DVD drives and media defined by the DVD Forum. Often called "minus R", the format is write once (compared to DVD-RW wich can be erased and rewritten). The single sided discs can hold 4,700,000,000 bytes (4.38 Gigabytes at 1024 bytes to the kilobyte) with double sided discs holding twice as much. This format competes with the DVD+R format. DVDRhelp DVDR information
DVD-R DL or called DVD-R9 is a Dual Layer writeable DVD-R. The dual layered discs can hold 7.95 GB or around 8 540 000 000 bytes (called DVD-9) and a double sided dual layered disc 15.9 GB or around 17 080 000 000 bytes (called DVD-18).
A recordable format supported by the DVD Forum. It has superior recording features but it is not compatible with most DVD-ROM drives or DVD Video players. It works well when set up like a removable hard disk.
DVD-RW is a ReWriteable media format of the DVD-R standard.
This is SVCD authored video on a DVDR/W. The DVD standard does not support the SVCD resolution but it may work anyway if the audio has been resampled to 48 khz like the DVD-VCD. Read more here about DVD-SVCD discs.
A DVD Player and a TV in same unit.
Basically this is VCD content authored on a DVDR/W. DVD supports the VCD resolution but the audio has to be resampled to 48 khz. Read more here how to make such a disc.
A DVD Player and VHS Video Recorder in same unit. DVD-VHS Combo DVD Players.
DVD-Video is the video element of the DVD format. DVD Demystified DVD-Video Features.
The format is known as "DVD dash VR" or DVD-VR. Its actually known from the DVD specification as DVD-RTRW for real-time read/write. One thing that makes it different from the "DVD Video" standard which is used by professionally repliacted DVD's is that the indexing for DVD-VR is forward only. This allows the "writing" of the disc image immediately without having to come up with some cludge to try to write a fully index'd DVD Video file. Burnworld DVD-VR Info
A term used to cover both the DVD-R and DVD+R standards in one word.
DVI stands for (D)igital (V)ideo (I)nterface. DVI is a new form of video interface technology made to maximize the quality of flat panel LCD monitors and high- end video graphics cards. It is a replacement for the P&D Plug & Display standard, and a step up from the digital-only DFP format for older flat panels. DVI is becoming increasingly popular with video card manufacturers, and most cards purchased include both a VGA and a DVI output port.
In addition to being used as the new computer interface, DVI is also coming out as the digital transfer method of choice for HDTV, EDTV, Plasma Display, and other ultra-high-end video displays for TV, movies, and DVDs. Likewise, even a few of the top-end DVD players are now featuring DVI outputs in addition to the high-quality analog Component Video. Don't expect to throw away all your old video cables just yet, but keep an eye out for DVI availability in the future.
DVR-MS (Microsoft Digital Video Recording) is a proprietary video and audio file format, developed by Microsoft. Video is encoded using the MPEG-2 standard and audio using MPEG-1 Layer II or Dolby Digital AC-3 (ATSC A/52). The format extends these standards by including metadata about the content and digital rights management.
ECC Constraint Length
The number of sectors that are interleaved to combat bursty error characteristics of discs. 16 sectors are interleaved in DVD. Interleaving takes advantage of typical disc defects such as scratch marks by spreading the error over a larger data area, thereby increasing the chance that the error correction codes can conceal the error.
When films are transferred to video in preparation for DVD encoding, they are commonly run through digital processes that attempt to clean up the picture. These processes include noise reduction (DVNR) and image enhancement. Enhancement increases contrast (similar to the effect of the "sharpen" or "unsharp mask" filters in PhotoShop), but can tend to overdo areas of transition between light and dark or different colors, causing a "chiseled" look or a ringing effect like the haloes you see around street lights when driving in the rain. Video noise reduction is a good thing, when done well, since it can remove scratches, spots, and other defects from the original film. Enhancement, which is rarely done well, is a bad thing. The video may look sharper and clearer to the casual observer, but fine tonal details of the original picture are altered and lost.
Enhanced data services. Additional information in NTSC line such as a time signal.
The output of an MPEG video encoder is a video elementary stream and the output of an audio encoder is an audio elementary stream. Before being multiplexed video and audio elementary streams are packetized to form the Video PES and the Audio PES.
PES Packet structure
The packet length is variable:
packet start-code prefix (3bytes)
stream identifier (1 byte)
PES packet length (2 bytes)
optional PES HEADER (variable length)
stuffing bytes (FF) (variable length)
PES packet data bytes
The PES packets are the input of Program Stream and Transport Stream.
Encoding is the process of changing data from one form into another according to a set of rules specifiec by a codec. The data is usually a file containing audio, video or still image. Often the encoding is done to make a file compatible with specific hardware (such as a DVD Player) or to compress or reduce the space the data occupies.
Common video encoding methods are DivX, MPEG-1, MPEG-2 and MPEG-4. A common audio encoding method is MP3 although many others exist including MPEG1 audio, DTS, and Dolby Digital.
In entropy coding, also called variable length coding or Huffmann coding, the more likely values are associated to shorter codewords, while less likely values are associated to longer codewords.
Consequently, known the statistics of the event to code, provided that such statistics are representative enough, it is possible to code such event with an average number of bits lower than in fixed length coding.
So a variable number of bits is produced in the time unity, but many applications (transmission usually) need Constant Bit Rate. In this case the decoder needs a buffer, where it stores the received bits at constant bit rate and from which it gets the bits to decode at variable bit rate. The encoder must have a similar buffer for transmission and must take into account the decoder buffer size and buffer fillness so that neither overflow nor underflow occur at the decoder buffer. This is the purpose of Annex C to Specification ISO/IEC 13818-2 entitled Video buffering verifier.
The tool available to control the bit-rate is the changing of the quantiser_scale, but the rate control algorithm isn't specified and it's a responsability of the encoder.
Error Correction on optical media
ECC = Reed-Solomon Error correction Code , ECC corrects errors on the fly by rewriting the blocks within the same track.
Errors are inevitable but by means of robust error correction systems, CD and DVD can have uncorrectable error rates as low as that specified for computers, i.e., 10-12 (one uncorrectable error in one trillion). Audio applications do not require this degree of accuracy.
Sources of error: Include dropouts from the media (oxide wear, fingerprints, scratches), signal degradation (reflection, intersymbol interference, impedance mismatches, RF interference).
Measures of error: The burst length is the maximum number of adjacent erroneous bits that can be fully corrected. The bit-error rate BER is the number of error bits per total bits. Optical disk systems can handle BERs of 1:100000 to 1:10000. The block error rate BLER is the rate of block or frames per second having at least one incorrect bit. The burst error length BEL is the number of consecutive blocks in error.
Methods of correction: Goal is to introduce redundancy to permit validity checking and error detection, error correction code ECC to replace errors with calculated valid data, and error concealment to substitute approximate data for uncorrectable invalid data. Redundancy includes repeating the data, adding single-bit parity bits (to check if odd or even), checksums (e.g., weighted checksums computed modulo 11), and cyclic redundancy check code CRCC.
CRCC uses a parity check word obtained by dividing a k-bit data block by a fixed number (generation polynomial g) and appended to the data block to creat the transmission polynomial v. When the data u is received, it is divided by the same g, and the result subtracted from the original checksum to yield the syndrome c: a zero sydrome indicates no error. Error correction can be accomplished using mathematical manipulation and modulo arithmetic ... Polynomial notation is the standard terminology in the field: e.g., the fixed number 1001011 (MSB leading) is represented as 1x26 + 0x25 + 0x24 + 1x23 + 0x22 + 1x21 + 1x20 or 26 + 23+ 21 + 20. CRCC is typically used as an error pointer and other methods are used for correction.
Error correction techniques employ block codes having row and column parity (CRCC are a subclass of linear block codes), convolutional or recurrent codes (which introduce a delay), and interleaving including cross-interleaving.
Reed-Solomon R-S codes (Irving Reed and Gustave Solomon 1960) employ polynomials derived from Galois fields to encode and decode block data. They are a subclass of q-ary BCH codes which are a subclass of Hamming codes . They are especially effective in correcting burst errors and are widely used in audio, CD, DAT, DVD, direct broadcast satellite, and other apps. Cross-Interleave Reed-Solomon Code CIRC is used in CDs. It includes the use of C2 then C1 encoders (C1 then C2 on decoding). The C1 level of CIRC is meant to correct small, random errors. The C2 level corrects larger errors and burst errors. Interleaving is used between the C2 (28,24) and C1 (32,2 encoders and deinterleaving is needed on decoding. (28, 24) means 28*8 bits are output for the original 24*8 bit input and the final output is 32 8-bit words of which 8 are for "parity" and 24 are actual data. The cross-interleaving stores one C2 word in 28 different blocks spanning a distance of 109 blocks using delay lines etc., crossing the data array in two directions (thus "cross"). With audio CDs, CIRC can correct burst errors up to 3874 consecutive erroneous bits or symbols (2.5 mm track length) and can well conceal 13,282 error bits (8.7 mm) and marginally conceal 15,500 bits. The CD standard requires a block error rate BLER [the number of data blocks that have any bad symbols at the initial C1 error correction stage] of less than 220 per second averaged over 10 seconds (50 would be typical). There are 7350 blocks/sec on a CD (a block or frame, derived from 24x8=192 bits input data, is 32x8=256 bits output to modulator). The resulting CD data rate = 1.4112 Mbps (input data rate, not including parity bits added by CIRC and EFM).so the maximum Redbook BLER of 220/sec (averaged over 10 sec) allows 3% of the blocks to be erroneous. E12 is the rate of single symbol errors at the C2 encoder, which are correctable. E22 expresses the rate of double symbol errors at the C2 encoder--these are the worst but still correctable errors [the first number is always the number of errors and the second number is always the decoder level]. E32 errors are triple bit errors at C2 and are uncorrectable and require interpolation--they should not appear in a new CD and are unacceptable in a CD-ROM. Other measures of error are the E11, E21, E31. The burst error count BST combines E21 & E22 and expresses the number of consecutive C2 block errors that occur in excess of a threshold value such as 7. A new CD might typically have a raw bit error rate of 1E-5 to 1E-6, BLER = 5, E11 = 5, E22 = 0 and E31 = 0 and should never have E32 uncorrectable errors. Digital audio data can be copied with high reliability.
Error concealment includes interpolation (may be low or high order, zero order simply holds the last good value) and muting.
Credits: "Principles of Digital Audio", by Ken C. Pohlmann
A video clip, often short, used at the end of a disc. Some common extros are "change Disc" clips for multidisk volumes or credits. On a VCD you should ensure the length of the clip is at least 4 second for compliance with the specification.
Short for Frequently Asked Questions - a good place to search for information before posting on the forums. VCDhelp FAQ
A swiss amy knife of MAC video conversion. It has numerous presets, plus the ability to customize your settings to create video to your exact specifications. A few examples of the presets include .mov -> VCD, .mov -> SVCD, .mov ->divx.
In an interlaced video a field is 1/2 of a complete picture (Frame) consisting of the even or odd scanlines in the frame. Usually each field is labeled ie. Field A and Field B. When working with interlaced video it is important to note the Field Order (if Field A comes before Field B in the video stream) especially when encoding. If you notice flicker after encoding you will want to change the field order in the encoding template and reencode.
Filter: To manipulate a video stream to achieve a desired effect. This can include, but is not limited to, re-sizing, noise reduction, de-interlacing, softening, sharpening, and noise reduction. Video filters are available in many commercial editing packages and some free editors including Virtualdub and AVIsynth.
Final Cut Pro
Final Cut Pro 3 is a comprehensive and innovative editing solution with unprecedented flexibility featuring award-winning editing capabilities, precise color correction tools, and built-in compositing and effects. Revolutionary out-of-the-box G4 real-time effects deliver real-time playback without rendering or additional PCI hardware1. The new OfflineRT format yields over 40 minutes of footage per gigabyte, providing a highly efficient editing workflow. Designed for ease of use, with tools that can expertly handle virtually any video format, Final Cut Pro 3 is the complete solution for professional video editing. With a comprehensive set of award winning features that rival expensive proprietary editing systems, it’s no wonder why Final Cut Pro is quickly becoming the digital editing choice for professional editors and filmmakers worldwide.
FireWire is a fast peripheral interconnect standard capable of transfer speeds up to 400 Mbs. It works well for multimedia peripherals such as DV (Digital Video) cameras and other high-speed devices like the latest hard disk drives, CD/DVD burners and printers. Apple FireWire information.
The firmware of a device is the program code that is permanently stored in the device's memory. It contains all the necessary software routines to make the device fully functional. New updated firmware is sometimes distributed for dvd players, cd/dvd writers and many other computer devices to add features or fix bugs.
The first video track that a DVD player will play when a DVD is inserted. Usually a special short video clip, such as Dolby Digital logos, FBI warnings, and company logos. Often accompanied by a PUO.
Flash Video is the name of a file format used to deliver video over the Internet using Adobe Flash Player. Flash Video files contain video bit streams which are a variant of the H.263 video standard, Flash Player 8 and newer revisions support the playback of On2 TrueMotion VP6 video bit streams, Flash Player 9 Update 3 includes support for H.264 video standard which is even more computationally demanding, but offers significantly better quality/bitrate ratio.
A four character code that uniquely identifies a video data stream format. A video player will look up the FourCC code then look for the codec associated to the code in order to play the associated video stream. This idea was used in the IFF multimedia format developed by Electronic Arts for the Amiga in the early 1980s. This file format was copied by Apple (who called it AIFF) and Microsoft (RIFF).
Almost Definitive FourCC List
Frames per second. A measure of the rate at which pictures are shown for a motion video image. In NTSC and PAL video, each frame is made up of two interlaced fields.
A set of scanlines in video to make a complete picture. If the video is interlaced the frame consists of both of the interlaced fields (half frames). If the video is progressive the the frame is made up of one continuous scan from top to bottom. The number of scanlines vary in a frame depending on the TV system used. PAL50 uses 625 scan lines, NTSC60 (US) 525.
A frame is one picture but depending on the encoding scheme it may not be a complete picture (I-Frame) but dependent on frames before or after the current frame (P-Frame, B-Frame).
The process of creating a direct video "link" from one application to another. For example a video editor application to standalone mpeg encoder so you don't need a plugin or create a temporary video file.
Garbage In Garbage Out, GIGO
Garbage In, Garbage Out or sometimes called Crap In, Crap Out is a computer term describing the fact that the output data is only as good as the input data. It means basicly the same as a video term, the output video and audio quality can only be as good as the source video and audio quality.
GOP, Group Of Pictures
A Group Of Pictures (GOP) consists of all the pictures that follow a GOP header before another GOP header.
The GOP layer allows random access because the first picture after the GOP header is an Intra picture that means that it doesn't need any reference to any other picture.
The GOP layer is optional, i.e. it's not mandatory to put any GOP header in the bitstream.
In the header there is also the timecode of the first picture of the GOP to be displayed.
The decoding process, as the GOP header is immediately followed by an Intra picture, can begin at that point of the bitstream. Anyway it's possible that some B pictures, following such I_picture in the bitstream, have references coming from the previous GOP and can't be correctly decoded.
In this case the GOP is called an Open GOP because some references from the previous GOP exist; if a random access to such a GOP is performed, some B_pictures shouldn't be displayed .
A GOP is called a Closed GOP when either there are no B_pictures immediately following the first I_picture or such B_pictures haven't any references coming from the previous GOP (in this case a GOP header flag must be set).
In the "coding people" language the GOP length is the period (often expressed in frames) by which an Intra frame occurs. It must be noticed that such a value cannot be found in the bitstream and it is unnecessary to the decoding process. Furthermore it isn't specified any fixed period for the Intra frame. As the presence of the Intra frames is quite important for many applications, it is the encoder that has to provide them, while the decoder has only to work with all the valid bitstreams.
A GUI (usually pronounced GOO-ee) is a Graphical User Interface to a computer. As you read this, you are looking at the GUI or graphical user interface of your particular Web browser. The term came into existence because the first interactive user interfaces to computers were not graphical; they were text-and-keyboard oriented and usually consisted of commands you had to remember and computer responses that were infamously brief. The command interface of the DOS operating system (which you can still get to from your Windows operating system) is an example of the typical user-computer interface before GUIs arrived. An intermediate step in user interfaces between the command line interface and the GUI was the non-graphical menu-based interface, which let you interact by using a mouse rather than by having to type in keyboard commands. http://searchwebservices.techtarget.com/sDefinition/0,,sid26_gci213989,00.html
An MPEG-2 video encoding mode in which half the horizontal resolution is sampled (352x480 for NTSC, 352x576 for PAL). See also D1
HD-DVD, HDDVD, HD DVD
HD DVD (High Density DVD or High Definition DVD) is a next-generation optical disc format designed for high-density storage of high-definition video and data.
High-bandwidth Digital Content Protection (HDCP) is a form of digital copy protection developed by Intel Corporation to prevent copying of digital audio and video content as it travels across DVI or HDMI, even if such copying would be permitted by fair use laws.
HDMI (High-Definition Multimedia Interface) is the first industry-supported, uncompressed, all-digital audio/video interface. HDMI provides an interface between any audio/video source, such as a set-top box, DVD player, and A/V receiver and an audio and/or video monitor, such as a digital television (DTV).
HDMI supports standard, enhanced, or high-definition video, plus multi-channel digital audio on a single cable. It transmits all ATSC HDTV standards and supports 8-channel digital audio, with bandwidth to spare to accommodate future enhancements and requirements.
High Definition TV is high-resolution digital television combined with Dolby Digital surround sound (AC-3). HDTV is the highest DTV resolution in the new set of standards. This combination creates a stunning image with stunning sound. HDTV requires new production and transmission equipment at the HDTV stations, as well as new television equipment for reception by the consumer. The higher resolution picture is the main selling point for HDTV. Imagine 720 or 1080 lines of resolution compared to the 525 lines people are used to in the United States (or the 625 lines in Europe) -- it's a huge difference!
Of the 18 DTV formats, six are HDTV formats, five of which are based on progressive scanning and one on interlaced scanning. Of the remaining formats, eight are SDTV (four wide-screen formats with 16:9 aspect ratios, and four conventional formats with 4:3 aspect ratios), and the remaining four are video graphics array (VGA) formats. Stations are free to choose which formats to broadcast.
The formats used in HDTV are:
720p - 1280x720 pixels progressive
1080i - 1920x1080 pixels interlaced
1080p - 1920x1080 pixels progressive
High Definition Video (HDV) is a video format designed to record compressed HDTV video on standard DV media (DV or MiniDV cassette tape).
Analog camcorder format which allows you to record video with 400 lines of resolution onto Hi8 tape, or 240 lines of resolution onto standard 8mm tape. Hi8 tapes can get up to 2hours in SP and 4hours in LP modes. Most Hi8 tapes will work in Digital8 camcorders but typically only can record 1 hour of Digital video.
High-quality Video Compact Disc. Developed by the Video CD Consortium (Philips, Sony, Matsushita and JVC) as a successor to VCD. Evolved into SVCD.
Home Theater Personal Computer, a computer designed to be used as a media center for digital home entertainment such as Movies, Music, Television, Games. http://www.htpcnews.com/
A fast, lossless Win32 video codec developed by Ben Rudiak-Gould. "Lossless" means that the output from the decompressor is bit-for-bit identical with the original input to the compressor. "Fast" means a compression throughput of up to 38 megabytes per second on a 416 MHz Celeron. Huffyuv is intended to replace uncompressed YUV as a video capture format. It is fast enough to compress full-resolution CCIR 601 video (720 x 480 x 30fps) in real time as it is captured. Huffyuv also supports lossless compression of RGB data, so it can be used for the output of programs like VirtualDub. Creator's Site
HVD is an Asian standard of advanced high-definition technology originally developed in China by AMLogic Inc., for high-definition video. The format resolutions support 720p, 1080i, or 1080p on version 1 discs. Version 2 of the format added high-resolution beyond the standard fare of HD for use on non-TV monitors that support higher resolutions, up to 1880p.
An I frame is encoded as a single image, with no reference to any past or future frames. Often video editing programs can only cut MPEG-1 or MPEG-2 encoded video on an I frame since B frames and P frames depend on other frames for encoding information.
Intraframe MPEG. An unofficial variation of MPEG video encoding that uses only intraframe compression. I-MPEG is used by DV equipment.
The Sony term for IEEE1394 or Firewire
Apple's easy DVD creator. iDVD allows users an easy interface to create and burn their video, photos, or data to the dvd format. It allows users to create motion menus and chapter points simply by clicking the mouse. iDVD is like the little brother of DVD Studio Pro.
The standard name for Firewire
IMG is an image/iso of a DVD, CD, Floppy. Burn it to a DVD or CD with DVD Decrypter or extract the content with Isobuster or mount it as a virtual DVD/CD unit with Daemon Tools.
Interlace, Interlaced, Interlacing, non-progressive
Each frame of a video picture is scanned twice. Firstly, all the odd lines are broadcast, then all the even lines are broadcast. Each set of odd/even lines is known as a field. Two fields therefore make up a frame. The point of doing this is to reduce flicker, and not increase bandwidth.
A video clip, often short, used at the start of a disc. Some common intros are THX, DTS, Dolby Digital sound bites with graphics or a countdown like the old movies. On a VCD you should ensure the length of the clip is at least 4 second for compliance with the specification.
Inverse Telecine, IVTC
Inverse telecine (IVTC) is when a codec takes a 29.97 frames per second interlaced NTSC video that has gone through the telecine process and reconstructs the original 24 frames per second progressive FILM video.
Besides the standards organization, this is a CD/DVD image format somewhat similar to a BIN/CUE image fileset, but the one single .ISO file contains both: the data and the CD/DVD layout information. These types of images can be burned with several CD /DVDburning programs.
An ISO 9660 file system is a standard CD-ROM file system that allows you to read the same CD-ROM whether you're on a PC, Mac, or other major computer platform. The standard, issued in 1988, was written by an industry group named High Sierra. Almost all computers with CD-ROM drives can read files from an ISO 9660 file system.
There are several specification levels. In Level 1, file names must be in the 8.3 format (no more than eight characters in the name, no more than three characters in the suffix) and in capital letters. Directory names can be no longer than eight characters. There can be no more than eight nested directory levels. Level 2 and 3 specifications allow file names up to 32 characters long.
Joliet, an extension to ISO 9660 from Microsoft, allows the use of Unicode characters in file names (needed for international users) and file names up to 64 characters in length. whatis ISO
An extension to ISO 9660, the specification for the file system (including file names) for the content on a compact disc (CD); it allows file names up to 64 characters in length (including spaces) and the use of Unicode characters in file names (sometimes needed for internationalization). Written by Microsoft, Joliet is fully supported in Windows 95 and later Windows operating systems (except Windows NT prior to its version 4). In operating systems (such as Windows 3.1) that support only eight-character file names, a longer file name on the CD is truncated into an eight-character name using a tilde (~) followed by a unique number as the last characters in the name. whatis microsoft
Kodak Picture CD
Kodak Picture CD is a CD that contains your pictures in JPEG format(.jpg) along with software that lets you view, enhance, share, and print your pictures from your computer. Some standalone DVD Players supports this format also, but then only for viewing. This format will also work on DVD Players that supports "JPEG file viewing" but you may lose some Kodak Picture CD specific features. Kodak Picture CD.
KVCD is a modification to the standard MPEG-1 and MPEG-2 GOP structure and Quantization Matrix. It allows you to put more than 120 minutes of video on a single 80 minute CD-R/CD-RW. The KVCDx3 template creates 528x480 (NTSC) and 528x576 (PAL) MPEG-1 variable bit rate video, from 64Kbps to 3,000Kbps. One of the other templates uses 352x240 (NTSC) or 352x288 (PAL), allowing up to ~360 minutes on a single 80 minute CD-R. You must burn the KVCD MPEG files as non-standard VCD or non-standard SVCD (depends on your player) with Nero or VCDEasy. The KDVD version of KVCD allows up to 6 hours Full D-1 720x480 on one DVD, or about 10 hours at Half D-1 352x480. Because KVCD and KDVD are not recognized "formats", the MPEG files created may or may not playback in your standalone DVD player. More info at kvcd.org
The process or form of video where black horizontal mattes are added to the top and bottom of the display area in order to create a frame in which to display video using an aspect ratio different than that of the display. The letterbox method preserves the entire video picture, as opposed to pan & scan. DVD-Video players can automatically letterbox a widescreen picture for display on a standard 4:3 TV.
Linear PCM, LPCM
Linear PCM (LPCM) is an uncompressed audio format that is similar to CD audio, but with higher sampling frequencies and quantisations. LPCM offers up to 8 channels of 48kHz or 96kHz sampling frequency and 16, 20 or 24 bits per sample but not all at the same time. These values compare with 44.1kHz and 16 bits as used for CD audio. The maximum bit rate is 6.144 Mb/s, which is much higher than Dolby Digital or MPEG-2 coding. LPCM offers high quality (similar to DVD-Audio) but its high data rate leaves little bandwidth for video on a DVD video disc. Disctronics DVD-Video audio
Compression techniques that allow the original data to be recreated without loss. Contrast with lossy compression.
In the DVD+RW Video format, video can be encoded with a variable bit-rate (VBR). Because the writing process takes place at a constant bit rate, the writing process needs to be suspended and continued frequently. Normally, this would result in a linking loss, making the disc incompatible with read-only devices like DVD Video players and DVD-ROM drives. With DVD+RW it is possible to perform lossless linking, i.e. to suspend and continue the writing process without linking loss. This feature makes the format very efficient and suitable for random write in data as well as video applications.
Compression techniques that achieve very high compression ratios by permanently removing data while preserving as much significant information as possible. Lossy compression includes perceptual coding techniques that attempt to limit the data loss to that which is least likely to be noticed by human perception.
M2T, m2ts, mts
The .mts, .m2t and .m2ts are a container file format for multiplexing audio, video and other streams. It is based on the MPEG transport stream container and is also known as BDAV MPEG-2 transport stream. This format is commonly used for high definition video on Blu-ray Disc and AVCHD and HD Camcorders. It usually contain AVC/H264 or MPEG2 video.
An .m3u file is a special type of metafile playlist that is used with MP3 files that have an .mp3 file extension. The .m3u file includes information about the location of the .m3u file on the computer and the properties of the file. An .m3u file is similar to the ASX playlist files.
If an error message occurs on play then the MP3 files may have been moved or deleted.
A MAC program used to convert .ac3 files to .aiff or .mp3 files.
A macroblock is a portion of image that consists of 16x16 picture elements (pixels or pels).
At the macroblock layer motion compensation and prediction are performed and it's possible to change the quantisation step.
It must be noticed that, if the picture is an interlaced frame picture, the odd lines of the macroblock belong to the first field and the even lines to the second field.
Video decoding process at the macroblock layer
Decode the macroblock mode and the possible quantiser_scale_code.
If it's an Intra macroblock:
Decode the blocks which the macroblock consists of.
If it's a Non-Intra macroblock:
Decode the prediction mode and the motion vectors.
Produce the suitable prediction for the macroblock.
Decode the blocks which the macroblock consists of, obtaining the prediction errors values.
Add the prediction errors values to the prediction.
An analog video copy protection scheme that alters the unseen part of a video signal such that a VCR or other macrovision enabled device may not record the video signal properly. There are several types:
- Automatic Gain Control
- 2-line color stripe
- 4-line color stripe
Symptoms of this include picture fading in and out or color banding of the signal.
A VCD player available for the Mac OS X platform
Main Concept Encoder
A very good mid range (in cost) mpeg encoder. Often found as the encoding engine in other products (Vegas Video and Adobe Premiere are some such applications that Main Concept).
Main Concept also makes other video applications such as a compositing program and soon to be released full featured video editing program and low end video editor.
Main Concept also has a very high quality DV codec
Matroska, MKA, MKV, MKS
A new video and audio container format similiar to AVI but with several new features like support for OGG audio, Variable Framerate Video.
Matroska File Formats:
.mkv : Generally video files, as well those containing audio ( movies ) or video only
.mka : audio only files, can contain any supported audio compresion format, such as MP2, MP3, Vorbis, AAC, AC3, DTS, PCM and soon MPC ( musepack )
.mks : a so called 'elementary' matroska stream containing any subtitles stream
For more info, see http://www.matroska.org
The MICROMV cassette is the smallest type of camcorder tape to date — nearly 70% smaller than already tiny Mini DV tapes — so it's no surprise that all MICROMV camcorders feature an incredibly compact form factor. MICROMV cassettes feature a built-in memory chip for conveniences like custom title storage and index thumbnails for easy access to specific scenes.
Mini DV is a video cassette designed for use in MiniDV digital camcorders. The picture quality of digital video (DV) recorded on a Mini DV cassette is basically identical or better to the quality of DV recorded on a Hi8 or 8mm cassette by a Digital8 camcorder. Mini DV can have up to 530 lines of video resolution for some camcorder models. However, Mini DV tapes are smaller which allows for smaller camcorders. Mini DV tapes are available in lengths of 30 and 60 minutes (plus, recording in LP mode lets you extend total recording time with a 60-minute tape to 90 minutes).
miniDVD is a DVD video written onto a CD-R(W) instead of a DVD disc. miniDVD is also sometimes called cDVD. A miniDVD only fits about 15 minutes of DVD quality video on a 650 MB CD-R(W). Not many DVD players will play miniDVD - see the DVD Players miniDVD list for compatible players. DVDRhelp.com miniDVD page
Moving JPEG. A moving image which is made by storing each frame of a moving picture sequence in JPEG compression, then decompressing and displaying each frame at rapid speed to show the moving picture.M-JPEG does not use interframe coding as MPEG does. Sometimes called Motion JPEG.
Meridian Lossless Packing. A lossless compression technique (used by DVD-Audio) that removes redundancy from PCM audio signals to achieve a compression ratio of about 2:1 while allowing the signal to be perfectly recreated by the MLP decoder.
Motion compensation and prediction
Motion compensation and prediction are performed at the macroblock layer.
The goal of motion compensation is to provide a good prediction for the macroblock. Actually, in the macroblocks where prediction is applied, the DCT is performed to the prediction errors instead of to the image samples and more the prediction errors are low and more the entropy coding is effective. Therefore, with good predictions it's possible to have low bit rate and good quality.
In nearly still pictures it's quite easy to have very good predictions using the pixels just in the same position of those to predict, but in motion pictures it's necessary to take movements into account.
Both the coder and the decoder have two frame-memories where they store the decoded pictures used as references.
Where do the predictions come from ?
It depends on the kind of the picture.
Predictions for a P_picture
If it's a frame picture they may come from the previous I or P frame.
If it's a field picture they may come from the two I or P fields coded most recently.
Predictions for a B_picture
They may come from the previous (in display order) I or P frame as from the next (in display order) I or P frame and there may be an interpolation between predictions coming from both directions.
See also the sequence layer.
What's a motion vector ?
A motion vector is a bi-dimensional pointer that tell the decoder how much left/right and up/down, from the position of the macroblock, is located the prediction macroblock in the reference frame or field.
Motion vectors have an half-pel resolution, that means that an interpolation process is necessary to get the prediction (for MPEG-1 is also possible the simpler pel resolution selected at the picture layer). It must be noticed that the same motion vector is applied both to luminance and, after being scaled, to chrominance.
What's motion estimation ?
Motion estimation is the process, perfomed by the coder, that should find the motion vector pointing to the best prediction macroblock in a reference frame or field.
The goodness of a prediction macroblock is in general evaluated minimizing a cost function that may be the absolute error or the mean squared error. The cost function is applied to the macroblock (or a part of it), such technique is called block matching and is the most used in video coding. In general every possible prediction in a given range is evaluated, so we speak about full search. Unfortunatly the computation's complexity is proportional to the search area and can be quite heavy, and on the other hand the search area has to be wide enough to include every movement.
It must be noticed that the capability to perform a good motion estimation is a key point for the quality of a coder.
The prediction menu for frame pictures.
A single motion vector for the whole macroblock. It's used when movements between the two fields are unsignificant. It's the only possible choice for progressive images.
Two different motion vectors. One for the samples belonging to the first field and one for those belonging to the second field. It's used when movements between the two fields are important.
The prediction menu for field pictures.
A single motion vector for the whole macroblock. It's used when the whole area has a single movement.
Two different motion vectors. One for the top half and one for the bottom half of the macroblock. It's used when the macroblock includes different objects with different movements and the macroblock area is to large to have a good prediction.
In video encoding, the process of analyzing previous or future frames to identify blocks that have not changed or have only changed location. Motion vectors are then stored in place of the blocks. This is very computation-intensive and can cause visual artifacts when subject to errors.
Mount Rainier is a standard that provides background formatting and defect management for storage on CD-RW and DVD+RW. This makes rewritable discs far easier to use and allows the replacement of the floppy. In the near future, native Operating System support for Mount Rainier is available. http://www.licensing.philips.com/information/mtr/gi/
QuickTime Content (.mov, .qt) - a file format developed by Apple Computer to create, edit, publish, and view multimedia files. QuickTime supports video, animation, graphics, 3D and virtual reality (VR). Sample MOV file
MP3 is an acronym for MPEG-1 (or MPEG-2) Layer 3 audio encoding (it is not an acronym for MPEG3). MP3 is a popular compression format used for audio files on computers and portable devices.
The compression in MP3 works on the basis of a "psychoacoustic model" which means that parts of the audio that human ears cannot detect are discarded by the encoder. Although this is a LOSSY process, it can yield very high quality audio files are relatively high compression rates.
A typical MP3 file encoded at 128 kbit/s (12:1 compression) is near CD quality.
MP3 audio is increasingly being used in video production coupled with various MPEG-4 video codecs like divx. The audio may be encoded with a constant or variable bitrate.
MP3 ID3 Tag, ID3
An MP3 ID3 Tag is information stored at the end of an MP3 file. The tag can contain information about the Title/Songname, Artist, Album, Year, Comment, and Genre in version 1 and also Track in version 1.1. A proposed Version 2 is out which would be extendable to include more information and picture(s). ID3.org
MP4 is a new container format, a container format allows you to combine different multimedia streams into one single file. Multimedia containers are for example the well known AVI, MPEG , Matroska, OGM.
MP4 is the global file extension for the official container format defined in the MPEG-4 standard. MP4 is streamable and supports all kinds of multimedia content, multiple audio-, video-, subtitlestreams, pictures, variable-framerates, -bitrates, -samplerates...) and advanced content like 2D and 3D animated graphics, user interactivity, DVD-like menus.
doom9 MP4 FAQ
MPA, MP1, MP2
Shorthand for MPEG Audio elemantary stream(no video). Also called MP2 for MPEG Audio Layer2 but MP2 could also be MPEG2 Audio.
MPEG Audio is a family of generic standards for low bit-rate coding
low complexity, good for consumer recording
high efficiency with medium complexity, good for professional recording and for broadcast
high complexity and high efficiency, suitable for very low bit-rates application
Input and output signals
sampling frequencies: 32, 44.1 and 48 kHz
sample representation: PCM max. 24 bits
from 32 Kbps to:
448 kbps for Layer I
384 kbps for Layer II
320 kbps for Layer III
masking threshold computation (based on the perceptive model) on a 1152 samples window (384 for Layer I)
signal subdivision in 32 subbands
subband bit allocation according to the masking threshold
MPEG-1: high quality digital audio coding
MPEG1 audio provides:
variable compression factor
different complessity layers
compact disc quality at compression factor ~6
possible use in many applications (broadcasting, telecom, recording, multi-media, etc..)
Compact Disc Quality
The reproduced signal quality must be subjectively indistinguishable from the quality obtained with a 16 bits PCM system sampled at 44.1 kHz (compact disc) in the majority of normal use programs.
MPEG-2: the multi-channel extension
MPEG2 audio extension allows:
compact disc quality
environment sonority reconstruction
sonorous sources location
additional freq. 16000Hz, 22050Hz and 24000Hz
MPEG-Multichannel does not only support 5.1 audio, but also 7.1 audio
additional frequencies added: 8000Hz, 11025Hz and 12000Hz
Layer II multi-channel characteristics
MPEG-Multichannel does not only support 5.1 audio, but also 7.1 audio
sounds that don't contribute to the source location can be transmitted by any channel
an adaptive predictor is used to reduce the inter-channel redundancy
the multi-channel extension channels are predicted from the two base channels
the prediction coefficients can be computed and transmitted at each frame
dynamic channel switching
the perceptive coding
The studies about the human ear sensitivity to the different frequencies and the different masking effects yielded a perceptive model on which the audio perceptive coding is based.
The static masking
The minimum perceived sonorous pressure, called threshold pressure, depends on several factors, as the source direction, the presence of other sounds, the kind of sound and especially the frequency. To exploit the static masking the threshold pressure must be known for each frequency. It must be noticed that the threshold is subjective and has a statistical value.
The dynamic masking
The dynamic masking is the effect of the threshold pressure increase because of the presence of another sound.
When the masking sound and the masked one occur at the same time. The masking effect depends on the intensity and frequency of both sounds.
Non simultaneous masking
It's possible that the masking sound occurs before (sometimes after) the masked one. Actually the masking effect can last more than the sound by which it's caused.
The noise allocation
Because of all the masking effects the human ear is able to perceive only a part of the sound spectrum. The perceptive model allows the computation of the masking thresold for a defined samples set.
In the perceptive coding the bit-allocation is performed looking at the Signal to Mask Ratio (SMR) obtainable. In such a way bits aren't wasted representing sounds that wouln't be perceived.
The help of Giorgio Dimino was essential for providing the information.
An ISO/IEC (International Organization for Standardization/ International Electrotechnical Commission) standard for medium quality and medium bitrate video and audio compression. It allows video to be compressed by the ratios in the range of 50:1 to 100:1, depending on image sequence type and desired quality. The encoded data rate is targeted at 1.5Mb/s - this was a reasonable transfer rate of a double-speed CD-ROM player (including audio and video). VHS-quality playback is expected from this level of compression. The Motion Picture Expert Group (MPEG) also established the MPEG-2 standard for high-quality video playback at a higher data rates. MPEG-1 is used in encoding video for VCD. MPEG FAQ
An encoding standard designed as an extension of the MPEG-1 international standard for digital compression of audio and video signals. MPEG-1 was designed to code progressively scanned video at bit rates up to about 1.5 Mbit/s for applications such as CD-i. MPEG-2 is directed at broadcast formats at higher data rates; it provides increased support for efficiently coding interlaced video, supports a wide range of bit rates and provides for multichannel surround sound coding such as PCM, Dolby Digital, DTS and MPEG audio.
A proposed variant of the MPEG video and audio compression algorithm and file format. MPEG-3 was intended as an extension of MPEG-2 to cater for HDTV but was eventually merged into MPEG-2.
MPEG-3 should not be confused with MP3 which is MPEG-1 layer 3 popularly used for audio encoding.
An ISO/IEC standard 14496 developed by the Moving Picture Experts Group (MPEG), the committee that also developed MPEG-1 and MPEG-2. These standards made interactive video on CD-ROM, DVD and Digital Television possible. MPEG-4 is the result of another international effort involving hundreds of researchers and engineers from all over the world. MPEG-4 was finalized in October 1998 and became an International Standard in 1999. The fully backward compatible extensions under the title of MPEG-4 Version 2 were frozen at the end of 1999, to acquire the formal International Standard Status early in 2000. Several extensions were added since and work on some specific work-items is still in progress.
MPEG-4 builds on the proven success of three fields:
Interactive graphics applications (synthetic content)
Interactive multimedia (World Wide Web, distribution of and access to content)
More information about MPEG-4 can be found at MPEG’s home page
MPEG-7 is an ISO/IEC standard developed by the Moving Picture Experts Group. MPEG-7, formally named “Multimedia Content Description Interface”, is a standard for describing the multimedia content data that supports some degree of interpretation of the information’s meaning, Unlike previous MPEG standards aimed at encoding, MPEG-7 is not aimed at any one application in particular; rather, the elements that MPEG-7 standardizes support as broad a range of applications as possible. MPEG-7 Alliance
A Mac tool used to give the user various helpful pieces of information about a mpeg file.
A compact program for Windows PCs that will give you all the parameters for a MPEG file. medialab
One of the more popular VCD/SVCD players for the Mac OS X platform.
MPV, M1V, M2V
MPEG-1 or MPEG-2 video elemantary stream(no audio). Also called M1V for MPEG-1 video and M2V for MPEG-2 video.
A scene recorded from different viewpoints. Each angle is equal in time length and an Angle Block may contain up to nine (9) angles.
Multiplex, Multiplexing, Mux, Muxing
Joining video and audio to one file. Also called "Mux".
Describes a video component that can handle 2 or more types of broadcast video standards. Multisystem televisions, videocassette recorders, and DVD players are not found in all stores but are manufactired by many of the large electronics companies. Why buy multisystem? People who move between countries can play/watch video in both countries. Or if you have a relative in India and want to watch the latest from Bollywood in the US then multisystem works also. Multisystem components often cost more than the single system equivalents. Also some DVD players that are billed as single system VCD capable (NTSC / PAL) can play both.
In DVD-Video there are five types of navigation data: Video Manager Information (VMGI), Video Title Set Information (VTSI), Program Chain Information (PGCI), Presentation Control Information (PCI) and Data Search Information (DSI).
A popular PC program for recording data and video CDs and DVDs. Ahead Nero Site
A person new to a subject, a beginner, novice. Also noob, n00b, etc. Newbie Help
Irrelevant, meaningless, or erroneous information added to a signal by the recording or transmission medium or by an encoding/decoding process. An advantage of digital formats over analog formats is that noise can be completely eliminated (although new noise may be introduced by compression).
Abbreviation of National Television Standards Committee. The NTSC is responsible for setting television and video standards in the United States (in Europe and other parts of the world, the dominant television standards are PAL and SECAM). The NTSC standard for television defines a composite video signal with a refresh rate of 60 fields (half-frames interlaced) per second. Each frame contains 525 lines and can contain 16 million different colors. The resolution of an NTSC VCD is 352x240 pixels, an NTSC SVCD is 480x480, and an NTSC full D1 DVD is 704 or 720 x 480.
Original Equipment Manufacturer. OEMs buy computers or hardware in bulk and customize them for a particular application. They then sell the customized computer under their own name. http://www.webopedia.com/TERM/O/OEM.html
Now also used to describe a DVD burner or other item
sold at a cheaper price, either without or with less included
software than the more expensive "retail' version.
Theora is the video compression codec part of the ogg multimedia project. It is based on On2's VP3 codec, but On2 has released it now under a BSD type liscence and renounced its patents, so it is now free and open source. A final release is due out in early summer 2003. http://www.theora.org
Ogg is the name of an open souce multimedia project maintained by the xiph.org foundation. Vorbis refers to the lossy general purpose audio compression format that surpasses mp3 in quality and rivals new formats such as AAC and TwinVQ (a.k.a. VQF). http://www.xiph.org http://www.vorbis.com
OGM, Ogg Media file/stream/container is a video and audio container similiar to avi, matroska. http://en.wikipedia.org/wiki/OGM
When encoding MPEG video, a GOP which uses referenced pictures from the previous GOP at the current GOP boundary. For example the GOP is open when B Frames at the start of a GOP rely on I or P frames from the immediately previous GOP. Also see Closed GOP.
According to the official standards, a cd (and a cd-r disc as well) should have a capacity of 650MB or 700MB of data, or an equivalent of 74 minutes or 80 minutes of audio.
As the laser beam in a cd recorder writes on cd-r media, it travels from the center of the disc towards its edge. Before the physical edge of the cd, there is an already set limit to prevent the laser beam writing beyond that, so that the physical edge of the cd's writable surface will never be met.
This means that there exists a security zone at the edge of the cd-r media. If we could write into that security zone, we could gain in capacity, since we would be able to write more data on the cd. This is called overburning.
Overlay, Hardware Overlay, Video Overlay
Hardware overlay, also known as video overlay, is a method of rendering an image to a display screen with a dedicated memory buffer inside computer video hardware, to display a fast-moving video image such as a computer game, a DVD, or the signal from a TV card. Hardware overlay is supported by most video cards (since about 1998) and media players. http://en.wikipedia.org/wiki/Hardware_overlay
The area at the edges of a television tube that is covered to hide possible video distortion. Overscan typically covers about 4 or 5 percent of the picture.
A P-frame is a video frame encoded relative to the past reference frame. A reference frame is a P- or I-frame. The past reference frame is the closest preceding reference frame.
Each macroblock in a P-frame can be encoded either as an I-macroblock or as a P-macroblock. An I-macroblock is encoded just like a macroblock in an I-frame. A P-macroblock is encoded as a 16x16 area of the past reference frame, plus an error term. To specify the 16x16 area of the reference frame, a motion vector is included. A motion vector (0, 0) means that the 16x16 area is in the same position as the macroblock we are encoding. Other motion vectors are relative to that position. Motion vectors may include half-pixel values, in which case pixels are averaged. The error term is encoded using the DCT, quantization, and run-length encoding. A macroblock may also be skipped which is equivalent to a (0, 0) vector and an all-zero error term. The search for good motion vector (the one that gives small error term and good compression) is the heart of any MPEG-1 video encoder and it is the primary reason why encoders are slow. MPEG FAQ
Partial-Constant Angular Velocity, the disc(CD/DVD) is being read/written at an increasing speed until a certain point (speed). After this point the speed will not increase anymore and remain at this speed.
Short for Phase Alternating Line, the dominant television standard in Europe. The United States uses a different standard, NTSC. PAL delivers 625 lines at 50 fields (half-frames interlaced) per second. The resolution of a PAL VCD is 352x288 pixels, a PAL SVCD is 480x576, and a PAL full D1 DVD is 704 or 720 x 576.
Pan & Scan
The technique of reframing a picture to conform to a different aspect ratio by cropping parts of the picture. DVD-Video players can automatically create a 4:3 pan & scan version from widescreen video by using a horizontal offset encoded with the video, which allows the focus of attention to always be visible.
Playback control, PBC, is available for Video CD (VCD) 2.0 and Super Video CD (SVCD) 1.0 disc formats. PBC allows control of the playback of play items and the possibility of interaction with the user through the remote control or some other input device available. VCDimager details
Pulse Code Modulation. An uncompressed, digitally coded representation of an analog signal. The waveform is sampled at regular intervals and a series of pulses in coded form (usually quantized) are generated to represent the amplitude.
Short for Personal Computer Memory Card International Association, and pronounced as separate letters, PCMCIA is an organization consisting of some 500 companies that has developed a standard for small, credit card-sized devices, called PC Cards. Originally designed for adding memory to portable computers, the PCMCIA standard has been expanded several times and is now suitable for many types of devices. There are in fact three types of PCMCIA cards. Webopedia.
Lossy compression techniques based on the study of human perception. Perceptual coding systems identify and remove information that is least likely to be missed by the average human observer.
Not a standard by itself, this term is used when a group of image files are placed on a VCD and displayed in a format like a slideshow. There are two ways of doing this with VCD 2.0 technology:
1) Import the pictures into a video editing program, adding optional music and transition effects as desired. The time between pictures is decided by the filmmaker, not the viewer. The result is output as a VCD or even SVCD motion video file and authored like any motion video to VCD or SVCD. The resolution is limited to VCD 352x240 NTSC, 352x288 PAL or SVCD 480x480 NTSC, 480x576 PAL which may appear grainy or low resolution compaired to the original pictures.
2) Author a photo VCD 2.0 disc encoding each picture as a still MPEG at 704x480 NTSC, 704x576 PAL. This allows better resolution but (probably) lacks transition effects. The resulting collection of still MPEG images can be authored with difficulty in VCDimager/VCDeasy or you can use a program that automates the whole process from images to disc like Ulead DVD Pictureshow 2 (which also does VCD well). VCDhelp PhotoVCD Guides
Physical Sector Number
Serial number assigned to physical sectors on a DVD disc. Serial incremented numbers are assigned to sectors from the head sector in the Data Area as 30000h from the start of the Lead In Area to the end of the Lead Out Area.
Picture in picture. A feature of some televisions that shows another channel or video source in a small window superimposed in a corner of the screen.
A microscopic depression in the recording layer of a disc. Pits are usually 1/4 of the laser wavelength so as to cause cancellation of the beam by diffraction.
The smallest picture element of an image (one sample of each color component). A single dot of the array of dots that makes up a picture. Sometimes abbreviated to pel. The resolution of a digital display is typically specified in terms of pixels (width by height) and color depth (the number of bits required to represent each pixel).
Pixel Aspect Ratio, PAR
The ratio of width to height of a single pixel. Often means sample pitch aspect ratio (when referring to sampled digital video). Pixel aspect ratio for a given raster can be calculated as y/x multiplied by w/h (where x and y are the raster horizontal pixel count and vertical pixel count, and w and h are the display aspect ratio width and height). Pixel aspect ratios are also confusingly calculated as x/y multiplied by w/h, giving a height-to-width ratio.
Picture outside picture. A feature of some widescreen displays that uses the unused area around a 4:3 picture to show additional pictures.
Portable DVD Player
Versatile portables play anywhere and also connect to your home TV.
A software DVD player for the PC. Cyberlink product info
The process of preparing data in the final format to create a DVD disc image for mastering. Includes creating DVD control and navigation data, multiplexing data streams together, generating error-correction codes, and performing channel modulation. Often includes the process of encoding video, audio, and subpictures.
An encoding technology developed by Dolby Laboratories, Pro-Logic and Pro-Logic II are methods of encoding 4 channels for Pro-Logic (left, right, center, surround) and 5 channels for Pro-Logic II (left, right, center, left surround, right surround), into a stereo (left, right) channel format to be decoded into 4 or 5 channels with a proper decoder. It is actually just a small amount of inaudible data slipped into a stereo audio stream that can be reconized by a decoder so that the decoder can separate the stereo signal into 4 or 5 channels. A Pro-Logic signal can be encoded into any form of stereo audio stream including digital files such as mpeg-1/2 layer 2, MP3, uncompressed PCM, audio CD's, and it is also used to get multichannel in VHS tapes and other analog sources. You can listen to Pro-Logic encoded audio streams with any normal equitment but you will only have 2 channels. DVD players are required to have a capable Dolby Digital decoder to downmix a 5.1 Dolby Digital signal on DVDs to a Pro-Logic stereo signal to be output with analog RCA cables. Pro-Logic II can be decoded with Pro-Logic decoders but you will only get 4 channels instead of 5. Pro-Logic is also called Dolby Surround. Dolby
The Input of the Program Stream Multiplexer and the Output of the Program Stream Demultiplexer are the Video and Audio Packetized Elementary Streams (PES)
MPEG-2 Program Stream:
may contain a single program with a common time-base for the different PES; is suitable to error-free transmision environment;
has a variable length packet structure (where packets may be relatively large).
Progressive scan, Progressive, Noninterlaced, Non-Interlaced
Progressive or non-interlaced scanning is any method for displaying, storing or transmitting moving images in which the lines of each frame are drawn in sequence. This is in contrast to the interlacing used in traditional television systems.
In DVD-Video, a menu used to access specific Part of Title (PTT) in a Video Title Set (VTS). Usually referred to as a Chapter Menu.
Film is generally shot and projected at 24 frames per second (fps), so when film frames are converted to NTSC video, the rate must be modified to play at 29.97 fps. During the telecine process, twelve (12) fields are added to each 24 frames of film (12 fields = 6 frames) so the same images that made up 24 frames of film then comprise 30 frames of video.Video plays at a speed of 29.97 fps so the film actually runs at 23.976 fps when transferred to video.
AVID does a 2-3 pulldown (which is used in a lot of film editing)
But most other editors use 3-2 pulldown.
2-3 Pulldown vs. 3-2
It is commonly referred to as 3-2 pulldown; while modern telecine machines can go either way. Therefore, AA BB BC CD DD. If the telecine is set for 3-2, you'll get BB BC CD DD AA.
Another slightly confusing consideration: When the pulldown process occurs, it turns out that the video version of the film runs slightly SLOWER than the original film did. This occurs because the film is running at 24 frames per second, but in order to create the right pattern of A-B-C-D on the video tape, which runs at 29.97 frames per second, the film was actually played at 23.976 fps during the telecine (film->tape process).
Prohibited User Operation. Optional button instructions written into the authored DVD. Often used to disallow buttons from functioning. For example, during the FBI warning on commercial discs, a PUO usually prevents it being skipped by pressing menu or FF buttons.
QAM, QAM tuner
In North American digital video, a QAM tuner is a device present in some digital televisions and similar devices which enables direct reception of digital cable channels without the use of a set-top box. QAM-based HD programming of local stations is sometimes available to analog cable subscribers, without paying the additional fees for a digital cable box. The availability of QAM HD programming is rarely described or publicized in cable company product literature. If cable providers provide rebroadcasts of locally aired programming, they must also carry rebroadcasts of high-definition digital locally aired programming, in an unencrypted form, that does not require the customer to use leased equipment, per FCC Sec. 76.630 and CFR Title 47, §76.901(a). These usually include the local affiliates for CBS, NBC, ABC, PBS, and FOX, and the cable providers comply by rebroadcasting them over QAM channels. The law does not require the cable provider to advertise their availability, and the cable customer service representatives are known to unequivocally (and incorrectly) insist to customers that a converter box is mandatory to view any HD channels.
A Mac tool used to sync Quicktime movies by altering the frame rate of the video. Has now been replaced with programs that create in sync video files.
The quantisation process refers to the DCT coefficients and is performed in order to both remove the subjective redundancy and control the compression factor.
The setting of the quantisation parameters is a key point for the quality of the coder.
To remove the subjective redundancy some quantisation matrices are used.
A couple of default matrices are given, for Intra and Non-Intra blocks, but other matrices can be transmitted at the sequence layer.
Each element of a quantisation matrix refers to a coefficient of the Discrete Cosine Transform and it should be proportional to the subjective redundancy of that coefficient.
In general lower spatial frequencies coefficients are better quantised than higher frequencies ones.
With 4:2:2 or 4:4:4 data four matrices can be used, distinguishing between Luminance and Chrominance.
In order to perform the bit-rate control or, more in general, to achieve an high enough compression factor , it is often necessary to quantise the DCT coefficients more heavily.
So a quantiser_scale_code is transmitted at each slice and may be transmitted also at each macroblock.
The quantiser_scale_code is a pointer to the value of the quantiser_scale, which is applied to any DCT coefficient, except the DC coefficient of Intra blocks.
Two different tables can be chosen at the picture layer setting the q_scale_type flag.
The Intra DC coefficients are quantised in a different manner to all other coefficients.
The number of bits of precision (from 8 to 11) is set by the intra_dc_precision field at the picture_layer
A term used in all mpeg video encoders, both software and hardware, which refers to the two 8x8 blocks of numbers appearing under the Quantize Matrix tab in the settings on some encoders or hard coded. These blocks of numbers represent the mathematical functions that the encoder will perform in order to best optimize the video for the appropriate format. Settings vary between a video being encoded as computer animation intended for a computer monitor or full motion video intended for a television. Different Matrix are used for progressive frame and interlaced and also the matrix varies (should) dependent on bit rate.
See also BLOCK and DCT
A digital video software standard developed by Apple Computer for Macintosh (Mac OS) and Windows operating systems.
Quicktime Player can be used to view numerous types of video and audio files.
Redundant Array of Independent Disks - a method used to standardize and categorize fault-tolerant disk systems. RAID levels provide various mixes of performance, reliability, and cost. Three of the the most implemented RAID levels are Level 0 (striping), Level 1 (mirroring), and Level 5 (RAID-5). RAID disk systems may offer advantages during video capture.
A DVD-RAM drive capable of reading and writing CD-R and CD-RW media.
An error-correction encoding system that cycles data multiple times through a mathematical transformation in order to increase the effectiveness of the error correction, especially for burst errors (errors concentrated closely together, as from a scratch or physical defect). DVD uses rows and columns of Reed-Solomon encoding in a two-dimensional lattice, called Reed-Solomon product code (RS-PC).
Region coding is how Hollywood studios stagger DVD movie releases across the planet. These codes ensure that one country doesn't get a DVD movie before the same movie is out in that country's theatres. In their corporate omniscience, movie studios have carved the planet into regions with each region having a specific code.
All DVD players and discs have region codes. A DVD player and disc must be of the same region or the disc will not play.
If you want to watch movies from other countries, you need a multiregion DVD player. This will allow you to play any disc from any region. However, because TV standards differ, you might need a specialized NTSC/SECAM/PAL TV or a DVD player that can output any signal to the standard your TV accepts. TechTV
Region Coding Enhancement, RCE
Abbreviated RCE, it is a digital enhancement added to some studios DVDs to stop region 1 (R1) DVDs from playing on Region-free DVD players.
Extensive discussion & movie list is found on DVDtalk.com
A database repository for information about a computer's configuration. The registry contains information that Windows continually references during operation, such as:
- Profiles for each user.
- The programs installed on the computer and the types of documents each can create.
- Property settings for folders and program icons.
- What hardware exists on the system.
- Which ports are being used.
The registry is organized hierarchically as a tree and is made up of keys and their subkeys, hives, and entries.
If programs do not uninstall properly or store configuration information not available via the program interface you might have to manually edit the registry. You can access the registry by running the program RegEdit. WARNING - if you incorrectly modify the registry you can disable programs, file associations or the whole operating system. Making a backup copy of the registry is highly recommended.
The process of converting between different spatial resolutions or different temporal resolutions. This may be based on simple sampling of the source information at higher or lower resolution or may include interpolation to correct for differences in pixel aspect ratios or to adjust for differences in display rates.
1) A measurement of relative detail of a digital display, typically given in pixels of width and height;
2) the ability of an imaging system to make clearly distinguishable or resolvable the details of an image. This includes spatial resolution (the clarity of a single image), temporal resolution (the clarity of a moving image or moving object), and perceived resolution (the apparent resolution of a display from the observer's point of view). Analog video is often measured as a number of lines of horizontal resolution over the number of scan lines. Digital video is typically measured as a number of horizontal pixels by vertical pixels. Film is typically measured as a number of line pairs per millimeter;
3) the relative detail of any signal, such as an audio or video signal. Also see lines of horizontal resolution.
Video information in the form of red, green, and blue tristimulus values. The combination of three values representing the intensity of each of the three colors can represent the entire range of visible light.
To take off the audio or video from a CD or DVD. Often CD Audio is "ripped" to MP3 files or DVD video ripped to VOB files. VCDhelp ripping
Run-length coding. Lossless compression method that exploits contiguous samples with same value.
Reed-Solomon Product Code. An error-correction encoding system used by DVD employing rows and columns of Reed-Solomon encoding to increase error-correction effectiveness.
Super Audio CD is the next generation of audio disc, offering full-range, uncompressed digital multi-channel surround sound. SACD can also be backward compatible using so called hybrid discs with an extra layer that allows them to be played on conventional CD players but then only with ordinary CD quality. SACD can be played on SACD Players, DVD Players with SACD support and if using hybrid discs also CD Players. SACD is currently competing with DVD-Audio as the new audio defacto standard. Philips SACD information.
The number of times a digital sample is taken, measured in samples per second, or Hertz. The more often samples are taken, the better a digital signal can represent the original analog signal. Sampling theory states that the sampling frequency must be more than twice the signal frequency in order to reproduce the signal without aliasing. DVD PCM audio allows sampling rates of 48 and 96 kHz.
Converting analog information into a digital representation by measuring the value of the analog signal at regular intervals, called samples, and encoding these numerical values in digital form. Sampling is often based on specified quantization levels. Sampling may also be used to adjust for differences between different digital systems.
Serial ATA is an evolution of the Parallel ATA storage interface. Serial ATA is a serial link, a single cable with a minimum of four wires creates a point-to-point connection between devices. Transfer rates for Serial ATA begin at 150MBps.
Scalability offers a set of tools by which video can be coded at different Resolutions (different scales) in one total bitstream.
On the decoder side, video can be decoded at the suitable resolution (scale) extracting a portion of the total bitstream.
It adds compatibility and error concealment.
Types of Scalability
The SNR scalability allows the enhancement of the video quality by means of an enhancement layer bitstream. The DCT coefficients encoded by an another layer are refined between the inverse quantisation and the inverse DCT processes. See also the block layer.
So two different goals are obtained: compatibility, as the base layer bitstream can be decoded by a simpler decoder, and graceful degradation, as the base layer can be better protected in transmission with errors.
The spatial scalability allows the increasing of the picture size. The enhancement layer bitstream refers to the lower layer bitstream in order to get a possible spatial prediction for the macroblock. The spatial prediction is made from the lower layer decoded picture referenced by the lower layer temporal reference, that picture needs to be upsampled to the enhancement layer picture size. In the enhancement layer, prediction can be: only temporal, only spatial or a weighted combination of both. Spatial prediction may be used also in Intra pictures.
The base layer bitstream can be decoded by simpler decoders, while the more complex ones may have larger displays. Error concealment is possible protecting better the lower bitstream during transmission and displaying the upconverted images.
The temporal scalability allows an ehancement of the picture rate. It hasn't been included yet as a tool of a defined Profile.
Frequency (Data Partitioning)
Frequency scalabilty provides an ehancement in terms of "bands" of DCT coefficients. The upper layer would contain those DCT coefficients that have been set to zero in the lower layer stream. It hasn't been included yet as a tool of a defined Profile
A video usually recorded form a promotional video tape or DVD which is sent to censors and film critics etc. The quality is usually as good as a commercial video or DVD. Sometimes a copyright message appears on the screen.
Small Computer System Interface is a standard electronic interface between your computer and its peripherals(hard drives, CD and DVD Readers and Writers and other peripherals). SCSIFAQ.
Standard Definition Television or SDTV refers to DIGITAL transmissions with 480-line resolution, either interlaced or progressive scanned formats. SDTV offers significant improvement over today's conventional NTSC picture resolution, similar to comparing DVD quality to VHS, primarily because the digital transmission eliminates snow and ghosts, common with the current NTSC analog format. However, SDTV does not come close to HDTV in both visual and audio quality.
Séquential Couleur Avec Mémoire/Sequential Color with Memory. A composite color standard similar to PAL (image format 4:3, 625 lines, 50 Hz and 6 Mhz video bandwidth), but currently used only as a transmission standard in France and a few other countries. Video is produced using the 625/50 PAL standard and is then transcoded to SECAM by the player or transmitter.
A sequence consists of all the pictures that follow a sequence header till a sequence_end_code.
Encoding and displaying parameters are transmitted with the sequence header.
The sequence header can be repeated in order to allow random access, but all the data elements of the repeated sequence header, except those concerning quantisation matrices, must have the same values as in the first sequence header.
The repeated sequence header must precede in the bitstream either an I-picture or a P-picture. In the case that random access is performed to a P-picture, it's possible that the decoded pictures may not be correct.
As the GOP layer is optional the management of the pictures is performed at the sequence layer.
An important item is the fact that the frames are not coded in the order in which they are displayed.
In particular the B_frames, that use references "from the future", are always coded after the P_frame (or the I_frame) used for backward predictions.
The frame reordering causes a delay in the coding and decoding processes. On the coder side the delay is given by the number of B_frames, that must wait for the following P_frame (or I_frame), while the decoder should wait for having full the two frame-memories before starting the display.
For special application is possible to code the sequence without any B_frame with the mode low_delay, set at the sequence layer. In this case the decoder needs only one frame-memory.
In an MPEG file, a sequence header is placed before one or more groups of pictures (GOPs) and contains encoding and displaying parameters. The sequence header can be repeated in order to allow random access, but all the data elements of the repeated sequence header, except those concerning quantisation matrices, must have the same values as in the first sequence header. The repeated sequence header must precede in the bitstream either an I Frame or a P Frame.
To allow for better access and editing, many people place a sequence header after every GOP. In final output, some place fewer sequence headers in (every 5 GOPs, etc.) to save bitrate or make the file smaller.
If you need to add sequence headers to a file you can use the tool MPEG SequenceMaker in the Tools Section.
Source Interchange Format is a video resolution standard defined as 352x240 for NTSC and 352x288 for PAL and SECAM.
A slice is a portion of image of 16 lines x ( n x16) pels.
Each slice is coded indipendently from the other slices of the picture.
Therefore the slice layer allows error confinement because, when errors in the bitstream are detected, the decoder can try to continue the decoding process looking for the next slice header.
Video decoding process at the slice layer
Decode all the macroblocks that compose the slice.
Uses a 1.0 pixel aspect ratio. Use this setting if your video has a 640x480 or 648x486 frame size.
Single Sided Dual Layer DVD. See DVD-9.
Single Sided Single Layer DVD. See DVD-5.
SVCD stands for 'Super VideoCD'. A SVCD is very similiar to a VCD, it has the capacity to hold about 35-60 minutes on 74/80 min CDs of very good quality full-motion MPEG-2 video along with up to 2 stereo audio tracks and also 4 selectable subtitles. A SVCD can be played on many standalone DVD Players and of course on all computers with a DVD-ROM or CD-ROM drive with the help of a software based decoder / player. SVCDHelp.com.
Cinematic film movies are shot at 24 progressive frames per second speed. A Frame is the smallest unit of a 24 fps FILM format. NTSC video is a "field-based" format of 59.94 fields per second. A Field is the smallest unit in interlaced video format. 2 fields make up 1 frame. So, this 59.94 fields per second equals 29.97 frames per second. 1 second in FILM (24 frames) is NOT equal to 1 second in NTSC Video (29.97 frames).
To be able to match the speed of an NTSC Video, conversion from a FILM format to an NTSC Video format undergoes a process called 2:3 pulldown or TELECINE. This process, in simplest terms, means "to add 6 frames so that a 24 fps becomes 30fps which is close to 29.97 fps (another trick is used to get to 29.97). Picture of the process - doom9.org
A video recorded in a cinema but usually on an expensive camera and a seperate audio source or direct audio connection (so the audience cannot be heard). The result is a video generally of very good quality.
Short for The Great Pat-o. He is the coolest guy ever.
Time Base Corrector, TBC
A hardware device used to remove or mask variations in the video synchronizing signals, which can result in "skewing" and other distortions in the video image generated by unavoidable mechanical inaccuracies in helical scan recorders. This is accomplished by automatically delaying the video signal so that each line starts at the proper time. In capturing a TBC is used to 'clean up' analog tape problems and may assist in the capture of old VHS tapes when used betwee a VCR and capture card. Hardware TBCs can be expensive.
Information recorded with audio or video to indicate a position in time. Usually consists of values for hours, minutes, seconds, and frames. Also called SMPTE time code. Some DVD-Video material includes information to allow the player to search to a specific time code position.
A DVD 'Title' is generally a logically distinct section of a DVD-Video. For example the main feature film on a DVD might be Title 1, a behind-the-scenes documentary might be Title 2 and a selection of cast interviews might be Title 3. There can be up to 99 Titles on any DVD. dvd.sourceforge.net
A popular, low cost MPEG-1 and MPEG-2 encoder commonly used to encode VCD, SVCD, and DVD. Free trial is available. TMPGenc.net
1) A distinct element of audiovisual information, such as the picture, a sound track for a specific language, or the like. DVD-Video allows one track of video (with multiple angles), up to 8 tracks of audio, and up to 32 tracks of subpicture; 2) one revolution of the continuous spiral channel of information recorded on a disc.
A relatively short video that is either used to preview a longer feature film, a television program, food, a sound system or other advertizing. Trailers often come before a main feature movie as an intro.
On this site generally another name for encoding.
A more technical term would be "The reformatting of content, without changing the source, to another type of content - most often of a different format than the original (but does not have to be)"
The Input of the Transport Stream Multiplexer and the Output of the Transport Stream Demultiplexer are the Video and Audio Packetized Elementary Streams (PES)
MPEG-2 Transport Stream:
may contain one or multiple programs (even with independent time-base); is suitable to no error-free transmision; has a fixed length packet structure.
Transport Packet (188 bytes)
Header: 4 bytes
Payload: 184 bytes
Transport-Stream Packet and ATM transmission
The packet length is the output of a compromise: on one hand longer the packet, lower the overhead; on the other hand shorter the packet, easier the streams syncronization.
The 188 bytes transport stream packet can be easily divided into 4 ATM cells. Each ATM cell is composed by 5 bytes of header + 1 byte for the Adaptation Layer and 47 bytes for the payload (=188/4). So a transport stream can be transmitted through ATM with the minimum overhead.
Possible operations on the Transport-Streams
Program selection and demultiplexing of its elementary streams.
Recomposition of one or more Transport-Streams into one or more different Transport-Streams.
Extraction of a given Program from a Transport-Stream and production of a Program-Stream.
Transformation of a Program-Stream into a Transport-Stream.
A GUI program frontend to VCDimager created by TTool. The program was the first to provide a full range of VCD and SVCD authoring but development stopped in 2002. Similar in functionality to VCDeasy. TSCV Site
One of the major achievements of DVD is that it has brought all the conceivable uses of CD for data, video, audio, or a mix of all three, within a single physical file structure called UDF, the Universal Disc Format. Promoted by the Optical Storage Technology Association (OSTA), the UDF file structure ensures that any file can be accessed by any drive, computer or consumer video. It also allows sensible interfacing with standard operating systems as it includes CD standard ISO 9660 compatibility. UDF overcomes the incompatibility problems from which CD suffered, when the standard had to be constantly rewritten each time a new application like multimedia, interactivity, or video emerged.
The version of UDF chosen for DVD-Video to suit both read-only and writable versions - is a subset of the UDF Revision 1.02 specification known as MicroUDF (M-UDF).
Because UDF wasn't supported by Windows until Microsoft shipped Windows 98, DVD providers were forced to use an interim format called UDF/ISO (UDF Bridge).
UDF has been revised and now appears in revisions 1.02, 1.5, and 2.0.
Burnworld - Disctronics
Also called UDF bridge, UDF/ISO is a hybrid filesystem utilizing UDF and ISO 9660. Developed because UDF wasn't supported by Windows until Microsoft shipped Windows 98, DVD providers were forced to use an interim format. Windows 95 OSR2 supports UDF Bridge, but earlier versions do not. As a result, to be compatible with Windows 95 versions previous to OSR2, DVD vendors had to provide UDF Bridge support along with their hardware.
DVD-ROM discs use the UDF Bridge format. (Note: Windows 95 was not designed to read UDF but can read ISO 9660). The UDF Bridge specification does not explicitly include the Joliet extensions for ISO 9660, which are needed for long filenames. Windows 98 and later MS operating systems read UDF so these systems have no problem with either UDF or long filenames. Burnworld
User Operation Prohibitions. Settings that prevent the viewer from skipping or fast-forwarding certain parts of DVDs.
Universal Plug and Play (UPnP) is a set of computer network protocols. The goals of UPnP are to allow devices to connect seamlessly and to simplify the implementation of networks in the home (data sharing, communications, and entertainment) and corporate environments. UPnP AV MediaServers store and share digital media, such as photographs, movies, or music. These media servers use the Universal Plug and Play (UPnP) protocols to communicate with other devices. UPnP MediaServers software Tversity, XBMC, MythTV, Windows Media Connect.
Universal Serial Bus is the solution for all PC users who want an instant, no-hassle way to connect new hardware like digital joysticks, scanners, digital speakers, digital cameras or a PC telephone to their computer. USB makes adding peripheral devices extremely easy. With USB-compliant PCs and peripherals, you just plug them in and turn them on. USB 2.0 (USB2) is the latest USB technology. This technology is approximately forty times faster than the previous USB 1.1 technology, increasing the speed of the device to PC connection from 12Mbps on USB 1.1 to up to 480Mbps on USB 2.0. usb-ware.com.
Vertical Blanking Interval - the part of a TV transmission that is blanked, or left clear of viewable content, to allow time for the TV’s electron gun to move from the bottom to the top of the screen as it scans images. This blank area is now being used to broadcast closed captioned and text formatted information.
Variable Bit Rate - the bitrate can vary at any part of a single video or audio stream. VBR can is used to increase bitrate during high motion scenes in a video or to reduce overall file size. DVD MPEG-2 video is often variable bit rate. Also see CBR (constant bit rate).
VC-1 is a video codec standard. Its most popular implementation is Windows Media Video 9. It is an evolution of the conventional DCT-based video codec design also found in H.261, H.263, MPEG-1, MPEG-2, and MPEG-4. It is widely characterized as an alternative to the latest ITU-T and MPEG video codec standard known as H.264/MPEG-4 AVC. VC-1 contains coding tools for interlaced video sequences as well as progressive encoding. The main goal of VC-1 development and standardization is to support the compression of interlaced content without first converting it to progressive, making it more attractive to broadcast and video industry professionals.
VCD stands for 'Video Compact Disc' and basically it is a CD that contains moving pictures and sound. If you're familiar with regular audio/music CDs, then you will know what a VCD looks like. A VCD has the capacity to hold up to 74/80 minutes on 650MB/700MB CDs respectively of full-motion video along with quality stereo sound. VCDs use an encoding standard called MPEG-1 to store the video and audio. A VCD can be played on almost all standalone DVD Players and of course on all computers with a DVD-ROM or CD-ROM drive with the help of a software based decoder / player. VCDHelp.com.
VCD Header trick
Some DVD players just can't play SVCD... but in this case and if you are lucky your DVD player may be able to play VCDs that use the "VCD-Header Trick". By changing the "Header" of a SVCD MPEG-2 video file into the one of a VCD2.0 MPEG-1 file and by using it as standard VCD2.0 file, you may be able to play MPEG-2 files with your home DVD player. www.vcdeasy.org/modules.php?name=_Guides&id=VcdTrick
A freeware VCD / SVCD authoring program for the PC. The program works with the freeware program VCDimager to automate production of video CDs with complex features like menus, chapters and the like. VCDeasy.org
GNU VCDImager is a full-featured mastering suite for authoring, disassembling and analyzing Video CD's and Super Video CD's.
The core functionality consists of directly making Video CD BIN/CUE-style CD images from mpeg files, which (after being written to CDR(W) media) can be played on standalone VCD players or DVD players and on computers running GNU/Linux, MacOS, Windows or any other OS capable of accessing VCD's. BIN/CUE images can be burned with the freeware program cdrdao (please use a recent version, since older ones do not support BIN/CUE-style cuesheets). VCDimager.org
Shorthand for Virtualdub
Variable Frame Rate(VFR) is a term in video compression for a feature supported by some container formats like mkv, mp4, flv which allows for the frame rate to change actively during video playback, or to drop the idea of frame rate completely and set individual timecode for each frame.
Video for Windows (VfW) The first video capture and display system developed by Microsoft for the Windows operating system. The design of VfW video capture was optimized for capturing movies to disk. Features important to video conferencing, TV viewing, capture of video fields, and ancillary data streams are missing from the VfW architecture. To circumvent these limitations, vendors augmented VfW by implementing proprietary extensions. However, without standardized interfaces, applications that use these features must include hardware-dependent code.
With the integration of DVD, MPEG decoders, video decoders and tuners, video port extensions (VPE), and audio codecs on single adapters, a unified driver model that supports all these devices and handles resource contention simplifies development efforts.
The stream class driver provides a framework for addressing these issues. It supports a uniform streaming model for standard and custom data types. Similarly, property sets for most standard devices are defined and can easily be extended if needed. Because the stream class follows WDM streaming conventions, it supports data transfer between kernel drivers without requiring a thread to transition to user mode. Thus there is no decrease in system performance associated with thread context switches between user-mode and kernel-mode.
Due to the large installed base of VfW applications, it is anticipated that VfW drivers will continue to flourish for devices that are primarily used for capturing movies. Capture devices that are used primarily for TV viewing and video conferencing are expected to migrate more quickly from the VfW model to the WDM streaming model.
To bridge the VfW and WDM worlds, a mapper is provided as part of the operating system. This component, called the VfW-to-WDM mapper, makes WDM drivers appear as VfW drivers for legacy applications.
Virtualdub uses only VfW drivers, not the newer WDM interface. MS Info
VHS an analog format capable of delivering 240 lines of video resolution, along with stereo sound that's nearly as good as CD (in dynamic range and frequency response). Blank tapes usually feature either 120 minutes or 160 minutes of recording time at the highest recording speed (6 hours or 8 hours at the slowest speed). VHS and VCR's are slowly being phased out in favor of DVD players and other digital tape media.
The process for changing a video from one format to another by altering the resolution and/or the bitrate. Normally the result of this process is a movie with a different compression. For a proper encoding you need a piece of software and/or hardware, which is called codec.
The UDF file name used for DVD-Video directory on a DVD disc volume. Files under this directory name contain pointers to the sectors on the disc which hold the program streams. Read more about all files in the VIDEO_TS folder here.
A video capture/processing utility written by Avery Lee for 32-bit Windows platforms (98/NT/2000/XP), licensed under the GNU General Public License (GPL). While not a full featured video editor like Adobe Premiere, it is streamlined for fast linear operations over video. It has batch-processing capabilities for handling large numbers of files and can be extended with third-party video filters. VirtualDub is mainly geared toward processing AVI files, although it can read (not write) MPEG-1 and also handle sets of BMP images. Various offshoot versions are on the web from other developers to handle MP3 and MPEG-2/VOB/AC3. Creator's Site
All DVD movies are stored in on a DVD video disc in so-called VOB files. VOB files usually contain multiplexed Dolby Digital audio and MPEG-2 video. VOB files on a DVD are numbered as follows: vts_XX_y.vob where XX represents the title and Y the part of the title. There can be 99 titles and 10 parts, although vts_XX_0.vob does not contain any video, usually just menu or navigational information. You can find them on a DVD video disc in a subdirectory labelled VIDEO_TS (all upper case).
All VOB files are essentially MPEG2 Program streams with audio, video, sub-picture and navigation data multiplexed. A VOB file is organized as a set of cells; a cell is a basic unit of play data. Each cell consists of a sequence of units called VOBUs. Each VOBU is a sequence of
packs. The first pack in a VOBU is a navigation pack and contains Program Control Information (PCI) packet and Data Search Information (DSI) packet. The remaining packs contain audio,
video and sub-picture data multiplexed together. Each pack has a fixed size of 2048 bytes. A pack typically contains only one data packet and may be stuffed with dummy bytes or a packet
called ‘padding’ bytes/packet to make it a fixed size.
DVD allows easy navigation in its audio and video data. Information for navigation across different VTS is contained in the VMGM. Within a title, the play order of different cells (from one
or more VOBs in the title) is described in a Program Chain (PGC). A PGC is a logical unit to present a part of or the entire Title or Menu. A PGC is further divided into programs. Each program contains integral number of cells. A Title may have one or more PGCs. However, a Title
that has parental guidance levels, will have more than one PGC. Depending on the parental level selected by the DVD disk viewer, the PGCs are selected for being played. PGC contains PGCI
which gives the order of presentation of cells within that PGC.
The information for presentation of a cell, such as the angle information for seamless and nonseamless play and highlight information is contained in the Navigation packs occurring within the
When playing non-seamlessly, the cells within a logical block are placed contiguously. Therefore, during cell presentation, intermittent blocks may have to be skipped depending on the angle information selected by the DVD disk viewer.
An angle block is a logical block containing cells for different angle presentations. During presentation, not all the cells within the block are played. The different angle cells are of almost
the same play time and since they are placed adjacent to each other, the DVD disk viewer can seamlessly change from one angle to another.
When playing a parental level seamlessly, cells from different VOBs may be interleaved in a logical block. Such a block is called an interleaved block. Each unit of VOB that lies in an interleaved block is called an ILVU of that VOB. This means that the cells in a VOB may not be placed contiguously over the physical address space and may be interleaved with ILVUs from other VOBs.
Intellectual property (IP) that is distributed illegally. Examples are software, movies or music being given away or resold without the permission of the author, serial numbers or ways to crack software that is sold (serialz or crackz).
WAV files are probably the simplest of the common formats for storing audio samples. Unlike MPEG audio and other compressed formats, WAVs store samples "in the raw" where no pre-processing is required other that formatting of the data.
The WAV file itself consists of three "chunks" of information: The RIFF chunk which identifies the file as a WAV file, The FORMAT chunk which identifies parameters such as sample rate and the DATA chunk which contains the actual data (samples). MS File Formats
A video image wider than the standard 1.33 (4:3) aspect ratio. When referring to DVD or HDTV, widescreen usually indicates a 1.78 (16:9) aspect ratio.
A software DVD player for the PC. intervideo.com
Windows Media Audio.
Windows Media Format files are audio/video files encoded with the Windows Media Encoder, providing high quality and media security for streaming and download-and-play applications on PCs, set-top boxes, and portable devices. Windows Media Format comprises Windows Media Audio and Video codecs, an optional integrated digital rights management (DRM) system, and a file container. Microsoft WMF Information
Windows Media Player - a multimedia audio and video player bundled with the Windows Operating System. The player can play many different formats natively including WAV, ASF, WMF, MPEG-1 and can play many types of AVI files if the codec is installed including Divx. WMP can also play MPEG-2 with a third party codec installed (like the ones installed by software DVD players such as WinDVD and PowerDVD). MS Info
Windows Media file with Audio and/or Video (WMV): You can use a .wmv file either to download and play files or to stream content. The .wmv file format is similar to the .asf file format. MS Sample File
WMV HD stands for Windows Media Video High-Definition. Just what makes high definition video different? It comes down to video resolution. The video most of us are used to seeing on our TVs today, called "standard definition," has at most 480/576 visible lines of detail, whereas "high definition" video has as many as 1,080. High definition video looks sharper and clearer than regular video, especially on big-screen displays. It actually comes in two different resolutions, called 1080p and 720p. http://www.microsoft.com/windows/windowsmedia/content_provider/fi ... video.aspx
eXtended SVCD - XSVCD has same features as SVCD but it is possible to use higher bitrates and higher resolution to get higher video quality. XSVCD is basically everything that uses MPEG-2 video, is not within the SVCD standard or close to DVD, and burned in "SVCD" Mode on a CD-R or CD-R(W). XSVCD can be played on some hardware DVD players and many computers with appropriate software like a software DVD player or a media player with a MPEG-2 codec. VCDHelp X(S)VCD Info
eXtended VCD - XVCD has same features as VCD but it is possible to use higher bitrates and higher resolution to get higher video quality. XVCD is basically everything that uses MPEG-1 video, is not within the VCD standard, and burned in "VCD" Mode on a CD-R or CD-R(W). XVCD can be played on some hardware VCD or DVD players and many computers with appropriate software. VCDHelp XVCD Info
XviD is an ISO MPEG-4 compliant video codec. It's not a product but an open source project which is developed and maintained by people around the world. XviD.org
The analog luminance and color-difference components of a color image (in digitized form, as Y, Cr, Cb, in JPEG) or video (NTSC and PAL). If you take the typical Red, Green, Blue colorspace (RGB), you can get YUV from:
Y (or Luma)= 30% Red + 59% Green + 11% Blue Analog Luminance
U (or Cb)=R-Y the red signal component minus the luminance
V (or Cr)=B-Y the blue signal component minus the luminance
The large percentage of Green and the small percentage of Blue (along with Green being sent twice) help to explain why chroma-keying for video is done against greenscreens and not bluescreens like film.
YUV was originally developed for backward compatibility with black-and-white television.
Originally, TV stations only transmitted the black and white signal. When color TV was to be deployed, most people still only owned a black-and-white set. It became clear that transmitting an RGB signal separately from a black-and-white signal would be highly impractical. A system was needed in which a TV station could transmit a signal that could be seen as monochrome on the older but still common black-and-white TV sets, but the same signal should be in color on the new color TV sets. YUV encoding allows the signal had to continue to have the black-and-white image, and the color information is added to it transparently.
Many video filters handle video streams in either RGB or YUV colorspace. Some can handle both.
The human eye is less sensitive to colour variations than to intensity variations. YUV allows the encoding of luminance (Y) information at full bandwidth and chrominance (UV) information at reduced bandwidth.
Often called component video, there are different standards for pro video and consumer video. YUV is used on such video types as Beta SP, a very common pro video editing format.
Zone-Constant Linear Velocity, the disc(CD/DVD) is divided into zones. After each zone the write speed is increased