Conference PaperPDF Available

Investigation of various algorithms on multichannel audio compression

November 2017

November 2017

DOI:10.1109/ICSIMA.2017.8311985

Conference: 2017 IEEE 4th International Conference on Smart Instrumentation, Measurement and Application (ICSIMA)

Authors:

Teddy Surya Gunawan

International Islamic University Malaysia

Mira Kartiwi

International Islamic University Malaysia

5.1 Multichannel Speakers Setup [10]

…

7.1 Multichannel Speakers Setup [10] III. AUDIO COMPRESSION ALGORITHMS Of the various lossless and lossy compression algorithms available, Dolby AC3, AAC and Ogg Vorbis have been selected as lossy compression algorithms, while FLAC and MPEG-4 ALS has been chosen as lossless compression algorithms. It is selected due to its capability to encode multichannel audio and its popularity.

…

AAC encoder C. Ogg Vorbis Ogg Vorbis is a full open, non-proprietary, patent and royalty free compression audio format. Fig. 6 illustrates the possible implementation of Ogg Vorbis encoder. It is based on vector quantization and transformation with overlapping windows, i.e. modified discrete cosine transform (MDCT). Each windows can have 2048 or 512 samples. The shorter one is used only to encode a transient signals. After transformation to frequency domain, the signal is analyzed by psychoacoustic model and inaudible part of the spectrum is removed. Then the floor vector is generated for each of the channels.

…

MPEG-4 ALS encoder

…

Figures - uploaded by Teddy Surya Gunawan

Content may be subject to copyright.

Content uploaded by Teddy Surya Gunawan

Content may be subject to copyright.

Proc. of the 4th IEEE International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)

28-30 November 2017, Putrajaya, Malaysia

Investigation of Various Algorithms on

Multichannel Audio Compression

Teddy Surya Gunawan#, Siti Aisyah Abdul Rashid#, Mira Kartiwi*

#Electrical and Computer Engineering Department, International Islamic University Malaysia

*Information Systems Department, International Islamic University Malaysia

Corresponding email: tsgunawan@iium.edu.my

second.com

Abstract — Multichannel audio or surround sound

compression is rather more challenging to compress compare to

mono and stereo audio. Nowadays, many methods and

algorithms have been proposed to improve the compression

performance on multichannel audio. This book focuses on

performance evaluation of various algorithms on multichannel

audio compression. First, we identified and investigated current

state-of-the-art audio compression algorithms, both lossless and

lossy compression, which can handle mono, stereo, 5.1, and 7.1

multichannel audio. Out of various algorithms available, AC3,

AAC, and Ogg have been selected as lossy compression

algorithms, while FLAC and MPEG-4 ALS have been chosen as

lossless compression algorithms. Two performance measure were

used in the experiments, i.e. compression ratio and encoding

time. The results showed that among three lossy audio

compression algorithms, AC3 has the fastest encoding time while

Ogg Vorbis has the highest compression ratio. Furthermore,

between FLAC and MPEG-4 ALS, FLAC has faster encoding

time and MPEG-4 ALS has higher compression ratio. Overall, in

terms of encoding time and compression ratio, it has been found

that FLAC is the fastest coder while Ogg Vorbis has the highest

compression ratio among five encoders evaluated.

Keywords—multichannel audio; lossless compression; lossy

compression; encoding time; compression ratio.

I. INTRODUCTION

Multichannel audio systems are widely used in modern

sound devices. Usually, two digits separated by a decimal

point, e.g. 2.1, 4.1, 5.1, 6.1, 7.1, are used to classify the various

kinds of speaker set-up [1, 2]. This number represents the

number of audio tracks used. Some audio systems only have a

single channel or two channels (stereophonic sound or 2.0

channel sound). The first digit shows the number of primary

channels, i.e. satellite units, each of which are reproduced on a

single speaker which has the capability to handle range of

frequency between 100Hz to 22 kHz. On the other hand, the

second digit (decimal digit) represents the presence of LFE

(Low Frequency Effect) that is reproduced on a subwoofer.

Moreover, surround system describes a type of audio output in

which the sound appears to surround the listener by 360

degrees, in which it gives impression that sound are coming

from all possible directions. It has been used to provide a more

realistic and engaging experience [3].

There are two kinds of audio compression algorithm those

are lossy and lossless. Lossy audio compression is known by

their well-designed system to shrinks file sizes. Advanced

Audio Coding (AAC), MPEG-1 Layer III (MP3), Dolby AC-3,

Opus, OGG Vorbis [4] and Windows Media Audio Lossy

(WMA lossy) are the examples of prevalent foremost lossy

audio coding system [5]. AAC can be considered as the most

influential multichannel audio coding algorithm [6]. This is due

to its ability to support audio channels up to 48 channels and

contribute lossless audio for 5.1 channels at sampling rates 320

kbits/s. Meanwhile, AC3 provides high audio quality at

384kbit/s [7].

Meanwhile, the most well-known codec in lossless

algorithm are Free Lossless Audio Codec (FLAC), Apple

Lossless Audio Codec (ALAC), Waveform Audio File (WAV),

MPEG-4 Audio lossless [8], True Audio (TTA) [9]. Each of

the codec, have the own domain and advantage to encode and

decode the audio. Lossless methods do not have any loss

information and provide an exact replica of the original signal.

Although many research has been conducted on lossless

and lossy audio compression, but not many researches have

been focused on the multichannel audio coding. Therefore, the

objective of this paper is to investigate the performance of

various audio compression algorithms to encode multichannel

audio in terms of encoding time and compression ratio.

II. MULTICHANNEL AUDIO

A. Monoaural and Stereophonic Audio

Fig. 1. Stereo Speakers Setup [10]

From analog audio, sampling and quantization are

conducted to represent the sound wave into digital

representation. A stereo signal can be considered as two

independent channels of audio information, i.e. left and right

channels. Stereophonic audio provides the impression of sound

localization [10]. Fig. 1 illustrates the stereo setup in a typical

living room.

B. 5.1 Multichannel Audio

Unlike mono and stereo audio, multi-channel audio format

designates in more than two channels. This type of audio

format aims to advance the ability of sound localization. As an

example, a 5.1 multichannel loudspeakers arrangement has

been illustrated in Fig. 2. The left and right channels placed at

±30˚ like in stereo audio. Meanwhile, the rear right and left

channel located at ±110˚. Usually, they are used for extended

sound source localizations interpretation. For center channel, 0˚

commonly for playing again voice contents in moving audio.

The decimal digit (.1) channel refer to subwoofer channel

which also recognize as LFE channel. This channel is for

playing back the low frequency contents. By adding more

surround loudspeaker to the two standard channels LS and RS,

it will create larger listening zone. This setup had been widely

used in cinema [10].

Fig. 2. 5.1 Multichannel Speakers Setup [10]

C. 7.1 Multichannel Audio and Beyond

Multichannel audio 7.1 is a further enhancement to 5.1

audio channels. There are other two side-surround speaker in

the speaker configuration. Many of application used 7.1 audio

in order to greater impact of surround sound. The loudspeakers

arrangement is almost similar to 5.1 multichannel audio.

However, there are another two speaker left and right rear

which about ±135˚to surround sound. Fig. 3 shows the setup

configuration of multichannel 7.1 audio.

Beyond 7.1 multichannel audio, 10.2 channel surround

sound has been developed. It is the advanced version of 5.1

technology, but 10.2 could produce twice as good as 5.1. In this

channel configuration. 14 channels are used to including five

front speakers, five surround channels, two LFE and two

heights, plus the addition of a second sub-woofer [10].

Fig. 3. 7.1 Multichannel Speakers Setup [10]

III. AUDIO COMPRESSION ALGORITHMS

Of the various lossless and lossy compression algorithms

available, Dolby AC3, AAC and Ogg Vorbis have been

selected as lossy compression algorithms, while FLAC and

MPEG-4 ALS has been chosen as lossless compression

algorithms. It is selected due to its capability to encode

multichannel audio and its popularity.

A. Dolby AC3 Encoder

AC3 is Dolby Audio Codec 3/Advanced Codec 3/Acoustic

Coder 3 which refers to multichannel compression technology

that has been developed by Dolby Laboratories. The objective

of this codec is to compress audio as similar as possible to the

original signal while using a minimum bit rate. AC3 had been

practice at cinema due to its outstanding sound system. Fig. 4

shows Dolby AC3 encoder mechanisms.

Fig. 4. Dolby AC3 Encoder

At the AC3 encoder process, the algorithm will use MDCT

in audio transformation from time to frequency domain. Then,

transform coefficients will be grouped into non uniform

subbands. The subbands are approximately the critical bands of

human auditory system. From that, transform coefficients

within one subband are converted to a floating-point

representation, with one or more mantissas per exponent. The

exponents are encoded by a suitable strategy according to time

and frequency resolution and then go into the psychoacoustic

model. In psychoacoustic model, the perceptual resolution is

calculates according to the encoded exponents and the proper

perceptual parameters.

B. Advanced Audio Coding (AAC)

AAC leads MP3 as there is a new non-backward

compatible audio coder introduced in [1, 6]. It becomes popular

due to application in Apple iTunes. Fig. 5 illustrates an AAC

encoder. AAC operates MDCT transform only in its main

coding loop and transient detection function to detect a long

window of 2048 points or a serial set of eight 256 point

windows is ready for the MDCT transform. Thus, this give

high frequency resolution of 23Hz and 2.7ms for a signal

sampled at 48 kHz. A gain control procedure is incorporated in

the SSR profile of AAC. A Pseudo Quadrature Mirror Filter

(PQMF) filter bank is used to split the signal into four

subbands with same bandwidth. The original signal sampling

rates reduced to quarters by discarding one or more subbands.

AAC utilizes the temporal-noise-sharping technique to expel

the pre-echo effect caused by transients. Based on subjective

evaluations, AAC provides great audio for 5 channel

bandwidth at bit rate of 320kbps.

Fig. 5. AAC encoder

C. Ogg Vorbis

Ogg Vorbis is a full open, non-proprietary, patent and

royalty free compression audio format. Fig. 6 illustrates the

possible implementation of Ogg Vorbis encoder. It is based on

vector quantization and transformation with overlapping

windows, i.e. modified discrete cosine transform (MDCT).

Each windows can have 2048 or 512 samples. The shorter one

is used only to encode a transient signals. After transformation

to frequency domain, the signal is analyzed by psychoacoustic

model and inaudible part of the spectrum is removed. Then the

floor vector is generated for each of the channels.

Fig. 6. Ogg Vorbis encoder

D. Free Lossless Audio Coder (FLAC)

Free Lossless audio coding (FLAC) is quite famous among

lossless codec due its fastest decoding audio. FLAC uses a

linear prediction mathematical (LPC) operation where future

values of the digital signal are estimated as a linear function of

previous samples. The FLAC encoder first divide the input

audio signal into frames. Then, it will conduct an interchannel

decorrelation. The predictor is then attempts to find an

optimum coefficients to predict the signal. Lastly, the predictor

coefficients and its residue were passed to entropy coding.

E. MPEG-4 Audio Lossless Coding (ALS)

MPEG-4 Audio Lossless Coding (ALS) standard is derived

from MPEG-4 audio coding standard. This codecs feature is to

preserve every single bit of the original audio data. ALS

provides method for lossless coding of audio signals with

arbitrary sampling rates, resolutions of up to 32-bit and up to

216 channels, also including 32-bit floating-point signals.

Thus, virtually all known input formats from CD quality 44.1

kHz, 16-bit to high-end audio multichannel can be supported.

Fig. 6 illustrates the ALS encoder.

Fig. 6. MPEG-4 ALS encoder

IV. RESULTS AN DISCUSSION

In this section, the audio encoders implementation, audio

database, as well as performance evaluation in terms of

encoding time and compression ratio will be conducted. To

simplify the experiments, this section will focus only on 5.1

and 7.1 multichannel audio.

A. Implementation

Table I shows the audio encoder along with its software

implementation. NeroAACEnc version 1.5.4.0 release in

February 2010 was used for AAC encoding. Oggenc2 v2.88

(libvorbis 1.3.5) was used for Ogg encoding. Finally, FFmpeg

was used for AC3, FLAC, and MPEG-4 ALS encoding.

TABLE I. AUDIO ENCODER AND ITS IMPLEMENTATION

Algorithm Audio Format Extensions Software

Lossy

AAC .aac NeroAACEnc

AC3 .ac3 FFmpeg

OGG Vorbis .ogg oggenc2

Lossless FLAC .flac FFmpeg

MPEG-4 ALS .m4a FFmpeg

Fig. 7 shows the flowchart of the overall implementation

using Matlab. The program will loop through five encoders, i.e.

AAC, Ogg, AC3, FLAC, and MPEG-4 ALS, and five files

each for mono, stereo, 5.1, and 7.1 multichannel audio signals.

For each file and each encoder, the compression ratio is

recorded. System call, i.e. dos() function in Matlab, is

employed to access the encoder executable file. For accuracy,

the program will loop 10 times for each coder, in which the

average value of encoding time will be recorded.

Fig. 7. Overall Implementation using Matlab

B. Audio Database

Various audio signal in the original WAV format are

collected from internet. Table II shows the audio database used

for experimentation on 5.1 and 7.1 multichannel audio.

TABLE II. AUDIO DATABASE FOR MULTICHANNEL AUDIO

Channel File Name Details

5.1

Five1.wav 47 seconds, 44100 Hz

Five2.wav 9 seconds, 48000 Hz

Five3.wav 131 seconds, 44100 Hz

Five4.wav 300 seconds, 44100 Hz

Five5.wav 125 seconds, 44100 hz

7.1

Seven1.wav 32 seconds, 48000 Hz

Seven2.wav 14 seconds, 48000 Hz

Seven3.wav 95 seconds, 48000 Hz

Seven4.wav 19 seconds, 48000 Hz

Seven5.wav 4 seconds, 48000 Hz

C. Experiments on Lossy Compression

The AAC, AC3 and OGG are encoder that will compress

the original audio WAV format which is lossless to lossy

format. As the encoding is conducted in Matlab, the output

from each type of audio is shown in Table III and IV. The

columns ‘T_AAC’, ‘T_OGG’ and‘T_AC3’ indicate the

encoding time of audio files in AAC, OGG and AC3.

Meanwhile, the columns ‘C_AAC’, ‘C_OGG’ and ‘C_AC3’

specify the compression ratio of audio files in AAC, OGG and

AC3. In order to compare the best time processing and

compression ratio, yellow and blue highlight is used to identify

them. The blue color is for the best compression ratio in each

audio file. On the other hand, the least processing time is

shown by yellow color.

TABLE III. LOSSY COMPRESSION FOR 5.1 MULTICHANNEL AUDIO

5.1 AAC AC3 OGG

Audio T_AAC C_AAC T_AC3 C_AC3 T_OGG C_OGG

Five1 5.488 32.62 0.598 9.454 3.077 21.293

Five2 1.118 43.293 0.203 10.274 0.651 24.731

Five3 8.114 10.243 1.551 9.45 8.458 17.162

Five4 20.056 10.333 3.511 9.45 18.712 18.258

Five5 7.788 10.595 1.45 9.449 8.011 18.467

TABLE IV. LOSSY COMPRESSION FOR 7.1 MULTICHANNEL AUDIO

7.1

Audio

AAC AC3 OGG

T_AAC C_AAC T_AC3 C_A C3 T_OGG C_OGG

Seven1 4.393 20.559 0.591 13.719 3.211 11.502

Seven2 9.336 117.141 0.244 20.546 0.761 229.361

Seven3 20.404 38.271 1.442 20.569 7.132 47.283

Seven4 13.572 55.577 0.315 13.694 1.218 97.631

Seven5 1.78 38.867 0.189 12.558 0.353 91.7

D. Experiments on Lossless Compression

There are two encoders for lossless to lossless compression.

Both MPEG 4 ALS and FLAC had been implementing in

Matlab to synthesis the time processing and compression ratio

between each file. The columns ‘T_FLAC’ and ‘T_M4A’

indicates the encoding time which the file was encoded by

FLAC and MPEG 4ALS. Compression ratio is showed by

‘C_FLAC’ and ‘C_M4A’ column. The blue highlight the

highest compression ratio in each audio file. Meanwhile,

yellow highlight the smallest time processing of audio while

encoding is done by FLAC and MPEG 4 ALS.

TABLE V. LOSSLESS COMPRESSION OF 5.1 MULTICHANNEL AUDIO

5.1 Audio

FLAC MPEG4 ALS

T_FLAC C_FLAC T_M4A C_M4A

Five1 0.468 7.035 5.723 12.337

Five2 0.168 9.715 1.784 13.008

Five3 1.447 3.003 13.141 12.344

Five4 2.855 3.488 31.625 12.328

Five5 1.348 3.366 12.94 12.337

TABLE VI. LOSSLESS COMPRESSION OF 7.1 MULTICHANNEL AUDIO

7.1 Audio

FLAC MPEG4 ALS

T_FLAC C_FLAC T_M4A C_M4A

Seven1 0.474 4.002 7.45 12.454

Seven2 0.266 27.338 1.34 93.798

Seven3 1.764 4.418 21.677 23.488

Seven4 0.292 30.142 3.301 39.292

Seven5 0.142 8.437 0.598 21.469

E. Discussion

All the lossy and lossless encoders had been evaluated in

Matlab to differentiate which one is the best coder to encode

audio in terms of encoding speed (time processing) and

compression ratio. There are 100 files are being encoded in

AAC, AC3, OGG (lossy compression), and FLAC, MPEG-

ALS (lossless compression).

Table III to IV shows the results in lossy compression. The

evaluation is conducted by signify which encoder of each audio

has the possibility to have smallest result in encoding time

represent in yellow and biggest compression ratio represent in

blue. As we compare them in each file, the average best results

seem goes to AC3 encoder for encoding time. Around 100% of

audio file have AC3 for the faster processing. These amounts

show AC3 can encode audio file in the fastest way than AAC

and AC3. For compression ratio, OGG encoder compressed the

audio better among other codec. Around 80% audio files have

largest compression ratio on OGG, followed by 20% audio

files on AAC.

Table V and VI shows the results in lossless compression.

The proportion of encoding time is best at FLAC encoder. This

is due to absolute 100% of audio file in m 5.1 and 7.1 audio

have minimum time compared to MPEG4 ALS encoder. In

contrast, the best compression ratio for lossless algorithm is

MPEG 4 ALS encoder as all audio file has largest ratio in the

encoder.

From the observations of lossy encoder comparison, we can

examine that the best encoder for encoding time is AC3.This

encoder give good result in encoding time at 5.1 and 7.1

multichannel audio. AC3 is matured in term of encoding time

compared to other codec. In lossless audio, when the encoders

encode all the audio in 5.1 and 7.1 multichannel audio, the

finding for the best encoding time and compression ratio give a

consistent result. The entire 10 audio file shows similar pattern.

FLAC is the best to encode audio at smallest speed compare to

MPEG-4 ALS. However, in term of compression ratio, MPEG-

4 ALS performs better than FLAC.

V. CONCLUSIONS AND FUTURE WORKS

This paper has presented the performance evaluation of

three lossy and two lossless audio compression evaluated on

5.1 and 7.1 multichannel audio signals. It has been found that

among three lossy audio compression algorithms, AC3 has the

fastest encoding time while Ogg Vorbis has the highest

compression ratio. Furthermore, between FLAC and MPEG-4

ALS, FLAC has faster encoding time and MPEG-4 ALS has

higher compression ratio. Overall, in terms of encoding time

and compression ratio, it has been found that FLAC is the

fastest coder while Ogg Vorbis has the highest compression

ratio among five encoders evaluated. Future works could

include the optimize parameters of each audio compression

algorithm to better evaluate its performance.

ACKNOWLEDGMENT

The authors would like to express their gratitude to the

Malaysian Ministry of Higher Education (MOHE), which has

provided funding for the research through the Fundamental

Research Grant Scheme, FRGS15-194-0435.

REFERENCES

[1] M. Bosi and R. E. Goldberg, Introduction to digital audio coding and

standards, vol. 721, Springer Science & Business Media, 2012.

[2] F. Rumsey, Spatial audio, CRC Press, 2012.

[3] F. Schuh, S. Dick, R. Füg, C. R. Helmrich, N. Rettelbach, and T.

Schwegler, "Efficient multichannel audio transform coding with low

delay and complexity," in Audio Engineering Society Convention 141,

pp., 2016.

[4] J. Moffitt, "Ogg Vorbis—open, free audio—set your media free," Linux

journal, vol. 2001, pp. 9, 2001.

[5] W. Jackson, "Audio Concepts, Terminology, and Codecs," in Android

Apps for Absolute Beginners: Springer, 2014, pp. 651-663.

[6] M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H.

Fuchs, and M. Dietz, "ISO/IEC MPEG-2 advanced audio coding," Journal

of the Audio engineering society, vol. 45, pp. 789-814, 1997.

[7] R. Hennequin, J. Royo-Letelier, and M. Moussallam, "Codec independent

lossy audio compression detection," in Acoustics, Speech and Signal

Processing (ICASSP), 2017 IEEE International Conference on, pp. 726-

730, 2017.

[8] T. Liebchen, T. Moriya, N. Harada, Y. Kamamoto, and Y. Reznik, "The

MPEG-4 Audio Lossless Coding (ALS) standard-technology and

applications," in Proc. 119th AES Conv, pp., 2005.

[9] A. Djuric, "TTA Lossless audio codec-True audio compressor

algorithms," 2010.

[10] T. Holman, Surround sound: up and running, CRC Press, 2014.

Performance Evaluation of Multichannel Audio Compression

Article

Full-text available

Apr 2018

In recent years, multichannel audio systems are widely used in modern sound devices as it can provide more realistic and engaging experience to the listener. This paper focuses on the performance evaluation of three lossy, i.e. AAC, Ogg Vorbis, and Opus, and three lossless compression, i.e. FLAC, TrueAudio, and WavPack, for multichannel audio signals, including stereo, 5.1 and 7.1 channels. Experiments were conducted on the same three audio files but with different channel configurations. The performance of each encoder was evaluated based on its encoding time (averaged over 100 times), data reduction, and audio quality. Usually, there is always a trade-off between the three metrics. To simplify the evaluation, a new integrated performance metric was proposed that combines all the three performance metrics. Using the new measure, FLAC was found to be the best lossless compression, while Ogg Vorbis and Opus were found to be the best for lossy compression depends on the channel configuration. This result could be used in determining the proper audio format for multichannel audio systems. © 2018 Institute of Advanced Engineering and Science. All rights reserved.

RED: An Intelligent Edge based Speaker System with Ambient Sensing Technology

Conference Paper

Jul 2022

Comparative analysis of the quality of recorded sound in the function of different recording formats

Article

Full-text available

Sep 2022

In article, the quality of the following encoders was analyzed: mp3, AAC, wma and OGG Vorbis. An original graphic method was used to carry out the quantitative research. It consists in comparing the number of pixels (representing data) between the spectrogram of a wav file and the spectrograms of files compressed with different codecs and bit rates. It has been shown that the Ogg Vorbis encoder retains the most data from the uncompressed wav sample in all tested bit rates (128KBit / s, 160KBit / s, 320KBit / s).

Subjective Evaluation of Music Compressed with the ACER Codec Compared to AAC, MP3, and Uncompressed PCM

Article

Full-text available

Jul 2019

Audio data compression has revolutionised the way in which the music industry and musicians sell and distribute their products. Our previous research presented a novel codec named ACER (Audio Compression Exploiting Repetition), which achieves data reduction by exploiting irrelevancy and redundancy in musical structure whilst generally maintaining acceptable levels of noise and distortion in objective evaluations. However, previous work did not evaluate ACER using subjective listening tests, leaving a gap to demonstrate its applicability under human audio perception tests. In this paper, we present a double-blind listening test that was conducted with a range of listeners (N=100). The aim was to determine the efficacy of the ACER codec, in terms of perceptible noise and spatial distortion artefacts, against de facto standards for audio data compression and an uncompressed reference. Results show that participants reported no perceived differences between the uncompressed, MP3, AAC, ACER high quality, and ACER medium quality compressed audio in terms of noise and distortions but that the ACER low quality format was perceived as being of lower quality. However, in terms of participants’ perceptions of the stereo field, all formats under test performed as well as each other, with no statistically significant differences. A qualitative, thematic analysis of listeners’ feedback revealed that the noise artefacts that produced the ACER technique are different from those of comparator codecs, reflecting its novel approach. Results show that the quality of contemporary audio compression systems has reached a stage where their performance is perceived to be as good as uncompressed audio. The ACER format is able to compete as an alternative, with results showing a preference for the ACER medium quality versions over WAV, MP3, and AAC. The ACER process itself is viable on its own or in conjunction with techniques such as MP3 and AAC.

Application of Smart Audio Based on Mixed Reality Technology in Media Fusion

Article

Jan 2021
MICROPROCESS MICROSY

A company of gadgets is the leading representative of complex reality technology. Do have differences in technical design and application of measurement and comparison of technology. For each technical specification and representation, and support the software functions to compare and consider. Smart Audio is based on the complex reality in the complicated fact of media convergence first worker. Because he worked for as long as he is in this Computer, the most widely used operating system on media convergence is worn on the head. It supports a full range of programs that run on this platform. It works equipment is necessary, as long as the device itself is used as a monitor, only the complex reality, and has a strong enough PC. It is very similar and does not require media integration, attached to a Personal Computer (PC) connected to a separate device. Van is very similar. It does not require the media's integration, an additional stand-alone device connected to the PC. Currently, it uses its operating system, which limits its range of small software. Virtual reality technology, complex reality, should be noted and allows to add digital content with the real world without losing visual contact with others. As a result, technology in education, where the application is called Fusion Applications media. Fusion Media will support through video link and SMS data analysis tools to encourage communication and reflective learning.

Multichannel audio steganography based on MPEG surround using direct sequence spread spectrum

Article

Full-text available

Sep 2019

Audio steganography is a technique for embedding hidden message on the audio signal. Several techniques are currently available, proposed as methods to hide secret messages on audio signals and integrated into the audio encoding system. In this paper, a data hiding technique is proposed working based on MPEG Surround (MPS), a multichannel audio encoding standard that is very popular for spatial or three-dimensional (3D) audio coding. Direct sequence spread spectrum (DSSS) is integrated with MPS to embed a secret message into the downmix signal of multichannel audio that generated from MPS encoder. The result of experiments shows that the audio that produced by the proposed system is still in acceptable quality and signal synchronization can run smoothly. The mean value of the signal to noise ratio (SNR) of the audio signal is 16.67 dB. The secret message can be successfully extracted with a mean value of bit error rate (BER) of 3.38%, and normalized correlation (NC) is 95%.

Codec independent lossy audio compression detection

Conference Paper

Mar 2017

Audio Concepts, Terminology, and Codecs

Chapter

Aug 2014

Wallace Jackson

This appendix will help get you to get up to speed on the foundation of audio, as well as on digital audio concepts, terminology, and codecs (file formats) supported in the Android OS.

The MPEG-4 audio lossless coding (ALS) standard - Technology and applications

Article

Jan 2005

MPEG-4 Audio Lossless Coding (ALS) is a new extension of the MPEG-4 audio coding family. The ALS core codec is based on forward-adaptive linear prediction, which oers remarkable compression together with low complexity. Additional features include long-term prediction, multichannel coding, and compression of floating-point audio material. In this paper authors who have actively contributed to the standard describe the basic elements of the ALS codec with a focus on prediction, entropy coding, and related tools. We also present latest developments in the standardization process and point out the most important applications of this new lossless audio format.

ISO/IEC MPEG-2 advanced audio coding

Article

Oct 1997

The ISO/IEC MPEG-2 advanced audio coding (AAC) system was designed to provide MPEG-2 with the best audio quality without any restrictions due to compatibility requirements. The main features of the AAC system (ISO/IEC 13818-7) are described. MPEG-2 AAC combines the coding efficiency of a high-resolution filter bank, prediction techniques, and Huffman coding with additional functionalities aimed to deliver very high audio quality at a variety of data rates.

Ogg Vorbis—Open, Free Audio—Set Your Media Free

Article

Jan 2001

Jack Moffitt

Ogg Vorbis is the Open Source Community's hot alternative to MP3.

Efficient multichannel audio transform coding with low delay and complexity

Jan 2016

F Schuh
S Dick
R Füg
C R Helmrich
N Rettelbach
T Schwegler

F. Schuh, S. Dick, R. Füg, C. R. Helmrich, N. Rettelbach, and T. Schwegler, "Efficient multichannel audio transform coding with low delay and complexity," in Audio Engineering Society Convention 141, pp., 2016.

TTA Lossless audio codec-True audio compressor algorithms

Jan 2010

A Djuric

A. Djuric, "TTA Lossless audio codec-True audio compressor algorithms," 2010.

Investigation of various algorithms on multichannel audio compression

Figures

Recommended publications

Perancangan dan Analisis Kinerja Pengkodean Audio Multichannel Dengan Metode Closed Loop

Multichannel audio coding based on minimum audible angles

Progressive multichannel audio codec (PMAC) with rich features

Performance Evaluation of Multichannel Audio Compression

On the Characteristics of Various Quranic Recitation for Lossless Audio Coding Application

Performance analysis of IEEE 1857.2 lossless audio compression linear predictor algorithm

On the Comparison of Line Spectral Frequencies and Mel-Frequency Cepstral Coefficients Using Feedfor...