Reply
 
Thread Tools Display Modes
  #1   Report Post  
wylbur37
 
Posts: n/a
Default Question about WAV file parameters

Each WAV file is saved with certain parameters.
The following is a typical example of such parameters ...

PCM, 44.1kHz, 16bit, stereo, 172 kbps

I know what the "44.1kHz" (sampling rate) means,
and what the "172kbps" (bitrate) means.

But what does the "16bit" refer to?

Is there a FAQ that explains these items?
  #2   Report Post  
Richard Crowley
 
Posts: n/a
Default

"wylbur37" wrote ...
Each WAV file is saved with certain parameters.
The following is a typical example of such parameters ...

PCM, 44.1kHz, 16bit, stereo, 172 kbps

I know what the "44.1kHz" (sampling rate) means,
and what the "172kbps" (bitrate) means.


"172kbps" sounds like a highly compressed format such as
would be used for DVD, etc. If you do the math, a 44.1k x
16 bit WAV calculates to 5.292mbps (30 times bigger than
"172kbps")

But what does the "16bit" refer to?


The "width" of the sample. 16-bit is the standard for CDs
and most music recording. Some high-end systems use 24-
bits.

Is there a FAQ that explains these items?


There are many. Google is your friend.

For example, here is an explanation of the WAV file header and
file format...

http://ccrma.stanford.edu/courses/42...ts/WaveFormat/


  #3   Report Post  
 
Posts: n/a
Default

RC [Fri, 24 Sep 2004 05:53:11 -0700]:
"wylbur37" wrote ...
Each WAV file is saved with certain parameters.
The following is a typical example of such parameters ...

PCM, 44.1kHz, 16bit, stereo, 172 kbps

I know what the "44.1kHz" (sampling rate) means,
and what the "172kbps" (bitrate) means.


"172kbps" sounds like a highly compressed format such as


More likely, the person doesn't understand or appreciate
using proper units:

176,400 bytes per second is the CD data rate

176400/1024 = 172.266 KB/sec (not kbps, wrong on two counts)


For the curious, 176400 bytes/sec = 44100 Hz x 2 ch x 2 bytes/sample/ch.

176400 bytes 44100 samples 2 channels 2 bytes
----------- = ------------ x x ---------
sec sec sample / channel

Channels and samples cancel, leaving bytes and sec. 176400 bytes/sec.
In kbps, that's 176400*8= 1411200/1000= 1411.2 kbps.

Here, LITTLE k (as in kbps) is 1000. Little b is bits.
BIG K, as in KB, is 1024. Big B is bytes.

Usually, you only see kb -OR- KB, but not kB, nor Kb. Why?
Because bits are usually described in 1000s (k) of,
and bytes in 1024s (K) of. The K symbol was purposely
made "close to" the standard metric k symbol (kilo = 1000)
since 1024 is "close to" 1000, but a little more (capital K).

(You'd have known this if you didn't skip that day.)

--
40th Floor - Software @ http://40th.com/
iPlay : the ultimate audio player for iPAQs
mp3, ogg, mp4, m4a, aac, wav, play & record
parametric eq, xfeed, reverb - all on a ppc
  #4   Report Post  
datanet
 
Posts: n/a
Default

Hmm, good reply, hel. Only your forgot 16 bits: That's the
width of the data stream, meaning 16 bits at a time is
transmitted in the data stream. Or, ultra-simply, a combination
of 16 zeroes and ones of data at a time is transmitted around the
system. The wider the virtual or literal buss (bit rate), the
faster things are internally and the less problem with
bottlenecks. 32 bit is twice as fast as 16 bit (well, about,
anyway, neglecting some added control bits) and so on.

Pop


wrote in message
...
| RC [Fri, 24 Sep 2004 05:53:11 -0700]:
| "wylbur37" wrote ...
| Each WAV file is saved with certain parameters.
| The following is a typical example of such parameters ...
|
| PCM, 44.1kHz, 16bit, stereo, 172 kbps
|
| I know what the "44.1kHz" (sampling rate) means,
| and what the "172kbps" (bitrate) means.
|
| "172kbps" sounds like a highly compressed format such as
|
| More likely, the person doesn't understand or appreciate
| using proper units:
|
| 176,400 bytes per second is the CD data rate
|
| 176400/1024 = 172.266 KB/sec (not kbps, wrong on two counts)
|
|
| For the curious, 176400 bytes/sec = 44100 Hz x 2 ch x 2
bytes/sample/ch.
|
| 176400 bytes 44100 samples 2 channels 2 bytes
| ----------- = ------------ x x ---------
| sec sec sample / channel
|
| Channels and samples cancel, leaving bytes and sec. 176400
bytes/sec.
| In kbps, that's 176400*8= 1411200/1000= 1411.2 kbps.
|
| Here, LITTLE k (as in kbps) is 1000. Little b is bits.
| BIG K, as in KB, is 1024. Big B is bytes.
|
| Usually, you only see kb -OR- KB, but not kB, nor Kb. Why?
| Because bits are usually described in 1000s (k) of,
| and bytes in 1024s (K) of. The K symbol was purposely
| made "close to" the standard metric k symbol (kilo = 1000)
| since 1024 is "close to" 1000, but a little more (capital K).
|
| (You'd have known this if you didn't skip that day.)
|
| --
| 40th Floor - Software @ http://40th.com/
| iPlay : the ultimate audio player for iPAQs
| mp3, ogg, mp4, m4a, aac, wav, play & record
| parametric eq, xfeed, reverb - all on a ppc


  #5   Report Post  
Codifus
 
Posts: n/a
Default

datanet wrote:
Hmm, good reply, hel. Only your forgot 16 bits: That's the
width of the data stream, meaning 16 bits at a time is
transmitted in the data stream. Or, ultra-simply, a combination
of 16 zeroes and ones of data at a time is transmitted around the
system. The wider the virtual or literal buss (bit rate), the
faster things are internally and the less problem with
bottlenecks. 32 bit is twice as fast as 16 bit (well, about,
anyway, neglecting some added control bits) and so on.

Pop


wrote in message
...
| RC [Fri, 24 Sep 2004 05:53:11 -0700]:
| "wylbur37" wrote ...
| Each WAV file is saved with certain parameters.
| The following is a typical example of such parameters ...
|
| PCM, 44.1kHz, 16bit, stereo, 172 kbps
|
| I know what the "44.1kHz" (sampling rate) means,
| and what the "172kbps" (bitrate) means.
|
| "172kbps" sounds like a highly compressed format such as
|
| More likely, the person doesn't understand or appreciate
| using proper units:
|
| 176,400 bytes per second is the CD data rate
|
| 176400/1024 = 172.266 KB/sec (not kbps, wrong on two counts)
|
|
| For the curious, 176400 bytes/sec = 44100 Hz x 2 ch x 2
bytes/sample/ch.
|
| 176400 bytes 44100 samples 2 channels 2 bytes
| ----------- = ------------ x x ---------
| sec sec sample / channel
|
| Channels and samples cancel, leaving bytes and sec. 176400
bytes/sec.
| In kbps, that's 176400*8= 1411200/1000= 1411.2 kbps.
|
| Here, LITTLE k (as in kbps) is 1000. Little b is bits.
| BIG K, as in KB, is 1024. Big B is bytes.
|
| Usually, you only see kb -OR- KB, but not kB, nor Kb. Why?
| Because bits are usually described in 1000s (k) of,
| and bytes in 1024s (K) of. The K symbol was purposely
| made "close to" the standard metric k symbol (kilo = 1000)
| since 1024 is "close to" 1000, but a little more (capital K).
|
| (You'd have known this if you didn't skip that day.)
|
| --
| 40th Floor - Software @ http://40th.com/
| iPlay : the ultimate audio player for iPAQs
| mp3, ogg, mp4, m4a, aac, wav, play & record
| parametric eq, xfeed, reverb - all on a ppc


The number of bits has nothing to do with speed. It is simply the width
of each sample. For the audio CD, there are 44,100 of these 16-bit
samples for every second of sound. For the ultimate DVD audio disc there
are 192,000 24-bit samples for every second of sound. The width of the
sample, be it 16 bits or 24, determines the dynamic range you can
capture in digital audio. The rule of thumb is to multiply the bits by 6
to get the maximum dynamic range capability. For 16 bits, that would be
16*6 or 96 db, and for 24 bits it would be 24*6=144db. In reality, due
to limitations in electronics and such, 24 bit systems haven't even come
close to 144 db capabilty. It's more like 108-110db, and 16 bit systems
are around 80 to 85 db.

CD


  #6   Report Post  
Dick Pierce
 
Posts: n/a
Default

"datanet" wrote in message ...
Each WAV file is saved with certain parameters.
The following is a typical example of such parameters ...

PCM, 44.1kHz, 16bit, stereo, 172 kbps

I know what the "44.1kHz" (sampling rate) means,
and what the "172kbps" (bitrate) means.

But what does the "16bit" refer to?

Is there a FAQ that explains these items?

Hmm, good reply, hel. Only your forgot 16 bits: That's the
width of the data stream, meaning 16 bits at a time is
transmitted in the data stream. Or, ultra-simply, a combination
of 16 zeroes and ones of data at a time is transmitted around the
system. The wider the virtual or literal buss (bit rate), the
faster things are internally and the less problem with
bottlenecks. 32 bit is twice as fast as 16 bit (well, about,
anyway, neglecting some added control bits) and so on.


Ultra simple indeed but, unfortunately, ultra wrong. The 16 bits
cited has NOTHING WHATSOEVER to do with the "width of the data
stream." All the hoo-ha above about "vritual or literal buss" and
16 vs 32 bits and bottlenecks is utterly irrelevant to the question
at hand.

It is simply declaring that each audio sample will have
a precision of 16 bits.

Now, to answer the original question definitively: A WAVE file
is one particular kind of RIFF (Resource Interchange File Format)
file. A RIFF file is composed of individual "chunks" of data, each
chunk is dientified by a 4-character chunk identifier and a 32 bit
chunk size (in bytes).

A WAVE file MUST have at least 2 such chunks to be valid:

1. The 'fmt ' chunk: this holds the parameters describing the
properties of the audio data, and, in fact, it is the contents
of the 'fmt ' chunk that you are seeing. The 'fmt ' chunk is
broken down as follows:

wFormatTag - a 16-bit integer holding a format indicator for
the file. SOme examples of which a

1 - PCM
2 - ADPCM
6 - A-Law
7 - u-Law
48 - Dolbay AC3
80 - MPEG

nChannels - 16 bint integer of the number of audio channels
in the file

nSamplesPerSecond - 32 bit unsigned integer of the sample rate
of the audio

nAvgBytesPerSecond - 32 bit unsigned integer of the average
buffer rate for the audio.

nBlockAlign - block alignment (in bytes) for the waveform
data.

And, if the wFormatTag is 1 (PCM), the fmt chunk is extended to
include:

nBitsPerSample - number of bits per sample.

If it is another format, the chunk extension will include relevant
formation for that format.

2. the 'data' chunk. This actually hold the audio data samples, as
described in the 'fmt ' chunk.

There may well be other chuncks as well. FOr example, EBU Tech 3285
describes the so-called "Broadcast Wave Format" extensions, while
AES-46 describes additional chunk information for use in radio
applications.

So your are, in essence, seeing a human readable "dump" of the
contents
of the 'fmt ' chunk. Your reading:

"PCM, 44.1kHz, 16bit, stereo, 172 kbps"

is directly mappable as follows:

wFormatTag = 1 "PCM"
nChannels = 2 "Stereo"
nSamplesPerSecond = 44100 "44.1 kHz"
nAvgBytesPerSecond = 176400 "172 kbps" (a typo, perhaps?)
nBlockAlign = 4
wBitsPerSample = 16 "16 bits"

Now, nBlockALign is calculated as follows: each sample holds
16 bits. A byte is 8 bits, so 2 bytes are required to hold
a single sample. The file is in stereo, so two samples are
required (one for each channel) at each sample block, thus
2 (bytes/channel) * 2 channels = 4 bytes.

Then, the nAvgBytesPerSecond is (in the case of PCM), simply
the number of bytes per sample block times the number of sample
blocks per second, 44,100 * 4 = 176,400. Now, they used the term
"kilo" in the wrong sense here, they used the disk convention of
1024 as opposed to the normal 1000, thus 176,400/1024 = 172.26
kiloBYTES per second.

Now, the "16 bits" describes, as I mentioned, the width of each
sample word, or its "precision." A 16 bit sample can represent,
on a per-sample basis, one of 2^16 or some 65,536 unique values.
You can also view this as describing how wide a dynamic range
that a sample can represent. In the case of 16 bits, this is a
dynamic range of around 96 dB, that is, the difference between
the lowest unambiguous value and the highest that can be represented
by a 16 bit sample stream is 96 dB.

But, please, ignore the above nonsense about "virtual busses" and
"bottlenecks," it's a load of irrelevant hooey.
  #7   Report Post  
datanet
 
Posts: n/a
Default


"Codifus" wrote in message
...
| datanet wrote:
| Hmm, good reply, hel. Only your forgot 16 bits: That's the
| width of the data stream, meaning 16 bits at a time is
| transmitted in the data stream. Or, ultra-simply, a
combination
| of 16 zeroes and ones of data at a time is transmitted around
the
| system. The wider the virtual or literal buss (bit rate),
the
| faster things are internally and the less problem with
| bottlenecks. 32 bit is twice as fast as 16 bit (well, about,
| anyway, neglecting some added control bits) and so on.
|
| Pop
|
|
| wrote in message
|
...
| | RC [Fri, 24 Sep 2004 05:53:11 -0700]:
| | "wylbur37" wrote ...
| | Each WAV file is saved with certain parameters.
| | The following is a typical example of such parameters
....
| |
| | PCM, 44.1kHz, 16bit, stereo, 172 kbps
| |
| | I know what the "44.1kHz" (sampling rate) means,
| | and what the "172kbps" (bitrate) means.
| |
| | "172kbps" sounds like a highly compressed format such as
| |
| | More likely, the person doesn't understand or appreciate
| | using proper units:
| |
| | 176,400 bytes per second is the CD data rate
| |
| | 176400/1024 = 172.266 KB/sec (not kbps, wrong on two
counts)
| |
| |
| | For the curious, 176400 bytes/sec = 44100 Hz x 2 ch x 2
| bytes/sample/ch.
| |
| | 176400 bytes 44100 samples 2 channels 2 bytes
| | ----------- = ------------ x x ---------
| | sec sec sample /
channel
| |
| | Channels and samples cancel, leaving bytes and sec. 176400
| bytes/sec.
| | In kbps, that's 176400*8= 1411200/1000= 1411.2 kbps.
| |
| | Here, LITTLE k (as in kbps) is 1000. Little b is bits.
| | BIG K, as in KB, is 1024. Big B is bytes.
| |
| | Usually, you only see kb -OR- KB, but not kB, nor Kb. Why?
| | Because bits are usually described in 1000s (k) of,
| | and bytes in 1024s (K) of. The K symbol was purposely
| | made "close to" the standard metric k symbol (kilo = 1000)
| | since 1024 is "close to" 1000, but a little more (capital
K).
| |
| | (You'd have known this if you didn't skip that day.)
| |
| | --
| | 40th Floor - Software @ http://40th.com/
| | iPlay : the ultimate audio player for iPAQs
| | mp3, ogg, mp4, m4a, aac, wav, play & record
| | parametric eq, xfeed, reverb - all on a ppc
|
|
| The number of bits has nothing to do with speed. It is simply
the width
| of each sample. For the audio CD, there are 44,100 of these
16-bit
| samples for every second of sound. For the ultimate DVD audio
disc there
| are 192,000 24-bit samples for every second of sound. The width
of the
| sample, be it 16 bits or 24, determines the dynamic range you
can
| capture in digital audio. The rule of thumb is to multiply the
bits by 6
| to get the maximum dynamic range capability. For 16 bits, that
would be
| 16*6 or 96 db, and for 24 bits it would be 24*6=144db. In
reality, due
| to limitations in electronics and such, 24 bit systems haven't
even come
| close to 144 db capabilty. It's more like 108-110db, and 16 bit
systems
| are around 80 to 85 db.
|
| CD

You either didn't read or jumped to a conclusion. Rather than
argue, why didn't you just phrase a reasonable answer for the
guy? You're wrong, but you're right. And I'm right, in the
context he wanted to know, IMO. If not, then he has to say so.


  #8   Report Post  
datanet
 
Posts: n/a
Default

Woof! EXCELlent response! Way to go.\

Pop


"Dick Pierce" wrote in message
om...
| "datanet" wrote in message
...
| Each WAV file is saved with certain parameters.
| The following is a typical example of such parameters ...
|
| PCM, 44.1kHz, 16bit, stereo, 172 kbps
|
| I know what the "44.1kHz" (sampling rate) means,
| and what the "172kbps" (bitrate) means.
|
| But what does the "16bit" refer to?
|
| Is there a FAQ that explains these items?
| Hmm, good reply, hel. Only your forgot 16 bits: That's the
| width of the data stream, meaning 16 bits at a time is
| transmitted in the data stream. Or, ultra-simply, a
combination
| of 16 zeroes and ones of data at a time is transmitted around
the
| system. The wider the virtual or literal buss (bit rate),
the
| faster things are internally and the less problem with
| bottlenecks. 32 bit is twice as fast as 16 bit (well, about,
| anyway, neglecting some added control bits) and so on.
|
| Ultra simple indeed but, unfortunately, ultra wrong. The 16
bits
| cited has NOTHING WHATSOEVER to do with the "width of the data
| stream." All the hoo-ha above about "vritual or literal buss"
and
| 16 vs 32 bits and bottlenecks is utterly irrelevant to the
question
| at hand.
|
| It is simply declaring that each audio sample will have
| a precision of 16 bits.
|
| Now, to answer the original question definitively: A WAVE file
| is one particular kind of RIFF (Resource Interchange File
Format)
| file. A RIFF file is composed of individual "chunks" of data,
each
| chunk is dientified by a 4-character chunk identifier and a 32
bit
| chunk size (in bytes).
|
| A WAVE file MUST have at least 2 such chunks to be valid:
|
| 1. The 'fmt ' chunk: this holds the parameters describing the
| properties of the audio data, and, in fact, it is the
contents
| of the 'fmt ' chunk that you are seeing. The 'fmt ' chunk is
| broken down as follows:
|
| wFormatTag - a 16-bit integer holding a format indicator for
| the file. SOme examples of which a
|
| 1 - PCM
| 2 - ADPCM
| 6 - A-Law
| 7 - u-Law
| 48 - Dolbay AC3
| 80 - MPEG
|
| nChannels - 16 bint integer of the number of audio channels
| in the file
|
| nSamplesPerSecond - 32 bit unsigned integer of the sample
rate
| of the audio
|
| nAvgBytesPerSecond - 32 bit unsigned integer of the average
| buffer rate for the audio.
|
| nBlockAlign - block alignment (in bytes) for the waveform
| data.
|
| And, if the wFormatTag is 1 (PCM), the fmt chunk is extended
to
| include:
|
| nBitsPerSample - number of bits per sample.
|
| If it is another format, the chunk extension will include
relevant
| formation for that format.
|
| 2. the 'data' chunk. This actually hold the audio data samples,
as
| described in the 'fmt ' chunk.
|
| There may well be other chuncks as well. FOr example, EBU Tech
3285
| describes the so-called "Broadcast Wave Format" extensions,
while
| AES-46 describes additional chunk information for use in radio
| applications.
|
| So your are, in essence, seeing a human readable "dump" of the
| contents
| of the 'fmt ' chunk. Your reading:
|
| "PCM, 44.1kHz, 16bit, stereo, 172 kbps"
|
| is directly mappable as follows:
|
| wFormatTag = 1 "PCM"
| nChannels = 2 "Stereo"
| nSamplesPerSecond = 44100 "44.1 kHz"
| nAvgBytesPerSecond = 176400 "172 kbps" (a typo, perhaps?)
| nBlockAlign = 4
| wBitsPerSample = 16 "16 bits"
|
| Now, nBlockALign is calculated as follows: each sample holds
| 16 bits. A byte is 8 bits, so 2 bytes are required to hold
| a single sample. The file is in stereo, so two samples are
| required (one for each channel) at each sample block, thus
| 2 (bytes/channel) * 2 channels = 4 bytes.
|
| Then, the nAvgBytesPerSecond is (in the case of PCM), simply
| the number of bytes per sample block times the number of sample
| blocks per second, 44,100 * 4 = 176,400. Now, they used the
term
| "kilo" in the wrong sense here, they used the disk convention
of
| 1024 as opposed to the normal 1000, thus 176,400/1024 = 172.26
| kiloBYTES per second.
|
| Now, the "16 bits" describes, as I mentioned, the width of each
| sample word, or its "precision." A 16 bit sample can represent,
| on a per-sample basis, one of 2^16 or some 65,536 unique
values.
| You can also view this as describing how wide a dynamic range
| that a sample can represent. In the case of 16 bits, this is a
| dynamic range of around 96 dB, that is, the difference between
| the lowest unambiguous value and the highest that can be
represented
| by a 16 bit sample stream is 96 dB.
|
| But, please, ignore the above nonsense about "virtual busses"
and
| "bottlenecks," it's a load of irrelevant hooey.


Reply
Thread Tools
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
wave file duration incorrect x-cons Pro Audio 7 December 18th 03 05:32 PM
Fostex MR-8 WAV file conversion question Jaxon Bridge Pro Audio 0 December 11th 03 06:04 PM
What is a .dat file? HiC Pro Audio 1 October 11th 03 10:43 PM
Q: SoundForge copying markers from one file to another? Mad Scientist Pro Audio 4 July 29th 03 03:23 PM
Repost: Reason 2.0 on a Celeron 2GHz laptop. Scott Elliott Birch General 17 July 7th 03 11:20 PM


All times are GMT +1. The time now is 12:11 AM.

Powered by: vBulletin
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 AudioBanter.com.
The comments are property of their posters.
 

About Us

"It's about Audio and hi-fi"