{"product_id":"mpeg-7-audio-and-beyond-isbn-9780470093344","title":"MPEG-7 Audio and Beyond","description":"Advances in technology, such as MP3 players, the Internet and DVDs, have led to the production, storage and distribution of a wealth of audio signals, including speech, music and more general sound signals and their combinations. MPEG-7 audio tools were created to enable the navigation of this data, by providing an established framework for effective multimedia management. \u003ci\u003eMPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval\u003c\/i\u003e is a unique insight into the technology, covering the following topics:  \u003cul\u003e \u003cli\u003ethe fundamentals of MPEG-7 audio, principally low-level descriptors and sound classification and similarity;\u003c\/li\u003e \u003cli\u003espoken content description, and timbre, melody and tempo music description tools;\u003c\/li\u003e \u003cli\u003eexisting MPEG-7 applications and those currently being developed;\u003c\/li\u003e \u003cli\u003eexamples of audio technology beyond the scope of MPEG-7.\u003c\/li\u003e \u003c\/ul\u003e \u003cp\u003eEssential reading for practising electronic and communications engineers designing and implementing MPEG-7 compliant systems, this book will also be a useful reference for researchers and graduate students working with multimedia database technology.\u003c\/p\u003e  \u003cb\u003eList of Acronyms.\u003c\/b\u003e  \u003cp\u003e\u003cb\u003eList of Symbols.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e\u003cb\u003e1. Introduction.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e1.1 Audio Content Description.\u003c\/p\u003e \u003cp\u003e1.2 MPEG-7 Audio Content Description – An Overview.\u003c\/p\u003e \u003cp\u003e1.2.1 MPEG-7 Low-Level Descriptors.\u003c\/p\u003e \u003cp\u003e1.2.2 MPEG-7 Description Schemes.\u003c\/p\u003e \u003cp\u003e1.2.3 MPEG-7 Description Definition Language (DDL).\u003c\/p\u003e \u003cp\u003e1.2.4 BiM (Binary Format for MPEG-7).\u003c\/p\u003e \u003cp\u003e1.3 Organization of the Book.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e2. Low-Level Descriptors.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e2.1 Introduction.\u003c\/p\u003e \u003cp\u003e2.2 Basic Parameters and Notations.\u003c\/p\u003e \u003cp\u003e2.2.1 Time Domain.\u003c\/p\u003e \u003cp\u003e2.2.2 Frequency Domain.\u003c\/p\u003e \u003cp\u003e2.3 Scalable Series.\u003c\/p\u003e \u003cp\u003e2.3.1 Series of Scalars.\u003c\/p\u003e \u003cp\u003e2.3.2 Series of Vectors.\u003c\/p\u003e \u003cp\u003e2.3.3 Binary Series.\u003c\/p\u003e \u003cp\u003e2.4 Basic Descriptors.\u003c\/p\u003e \u003cp\u003e2.4.1 Audio Waveform.\u003c\/p\u003e \u003cp\u003e2.4.2 Audio Power.\u003c\/p\u003e \u003cp\u003e2.5 Basic Spectral Descriptors.\u003c\/p\u003e \u003cp\u003e2.5.1 Audio Spectrum Envelope.\u003c\/p\u003e \u003cp\u003e2.5.2 Audio Spectrum Centroid.\u003c\/p\u003e \u003cp\u003e2.5.3 Audio Spectrum Spread.\u003c\/p\u003e \u003cp\u003e2.5.4 Audio Spectrum Flatness.\u003c\/p\u003e \u003cp\u003e2.6 Basic Signal Parameters.\u003c\/p\u003e \u003cp\u003e2.6.1 Audio Harmonicity.\u003c\/p\u003e \u003cp\u003e2.6.2 Audio Fundamental Frequency.\u003c\/p\u003e \u003cp\u003e2.7 Timbral Descriptors.\u003c\/p\u003e \u003cp\u003e2.7.1 Temporal Timbral: Requirements.\u003c\/p\u003e \u003cp\u003e2.7.2 Log Attack Time.\u003c\/p\u003e \u003cp\u003e2.7.3 Temporal Centroid.\u003c\/p\u003e \u003cp\u003e2.7.4 Spectral Timbral: Requirements.\u003c\/p\u003e \u003cp\u003e2.7.5 Harmonic Spectral Centroid.\u003c\/p\u003e \u003cp\u003e2.7.6 Harmonic Spectral Deviation.\u003c\/p\u003e \u003cp\u003e2.7.7 Harmonic Spectral Spread.\u003c\/p\u003e \u003cp\u003e2.7.8 Harmonic Spectral Variation.\u003c\/p\u003e \u003cp\u003e2.7.9 Spectral Centroid.\u003c\/p\u003e \u003cp\u003e2.8 Spectral Basis Representations.\u003c\/p\u003e \u003cp\u003e2.9 Silence Segment.\u003c\/p\u003e \u003cp\u003e2.10 Beyond the Scope of MPEG-7.\u003c\/p\u003e \u003cp\u003e2.10.1 Other Low-Level Descriptors.\u003c\/p\u003e \u003cp\u003e2.10.2 Mel-Frequency Cepstrum Coefficients.\u003c\/p\u003e \u003cp\u003eReferences.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e3. Sound Classification and Similarity.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e3.1 Introduction.\u003c\/p\u003e \u003cp\u003e3.2 Dimensionality Reduction.\u003c\/p\u003e \u003cp\u003e3.2.1 Singular Value Decomposition (SVD).\u003c\/p\u003e \u003cp\u003e3.2.2 Principal Component Analysis (PCA).\u003c\/p\u003e \u003cp\u003e3.2.3 Independent Component Analysis (ICA).\u003c\/p\u003e \u003cp\u003e3.2.4 Non-Negative Factorization (NMF).\u003c\/p\u003e \u003cp\u003e3.3 Classification Methods.\u003c\/p\u003e \u003cp\u003e3.3.1 Gaussian Mixture Model (GMM).\u003c\/p\u003e \u003cp\u003e3.3.2 Hidden Markov Model (HMM).\u003c\/p\u003e \u003cp\u003e3.3.3 Neural Network (NN).\u003c\/p\u003e \u003cp\u003e3.3.4 Support Vector Machine (SVM).\u003c\/p\u003e \u003cp\u003e3.4 MPEG-7 Sound Classification.\u003c\/p\u003e \u003cp\u003e3.4.1 MPEG-7 Audio Spectrum Projection (ASP) Feature Extraction.\u003c\/p\u003e \u003cp\u003e3.4.2 Training Hidden Markov Models (HMMs).\u003c\/p\u003e \u003cp\u003e3.4.3 Classification of Sounds.\u003c\/p\u003e \u003cp\u003e3.5 Comparison of MPEG-7 Audio Spectrum Projection vs. MFCC Features.\u003c\/p\u003e \u003cp\u003e3.6 Indexing and Similarity.\u003c\/p\u003e \u003cp\u003e3.6.1 Audio Retrieval Using Histogram Sum of Squared Differences.\u003c\/p\u003e \u003cp\u003e3.7 Simulation Results and Discussion.\u003c\/p\u003e \u003cp\u003e3.7.1 Plots of MPEG-7 Audio Descriptors.\u003c\/p\u003e \u003cp\u003e3.7.2 Parameter Selection.\u003c\/p\u003e \u003cp\u003e3.7.3 Results for Distinguishing Between Speech, Music and Environmental Sound.\u003c\/p\u003e \u003cp\u003e3.7.4 Results of Sound Classification Using Three Audio Taxonomy Methods.\u003c\/p\u003e \u003cp\u003e3.7.5 Results for Speaker Recognition.\u003c\/p\u003e \u003cp\u003e3.7.6 Results of Musical Instrument Classification.\u003c\/p\u003e \u003cp\u003e3.7.7 Audio Retrieval Results.\u003c\/p\u003e \u003cp\u003e3.8 Conclusions.\u003c\/p\u003e \u003cp\u003eReferences.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e4. Spoken Content.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e4.1 Introduction.\u003c\/p\u003e \u003cp\u003e4.2 Automatic Speech Recognition.\u003c\/p\u003e \u003cp\u003e4.2.1 Basic Principles.\u003c\/p\u003e \u003cp\u003e4.2.2 Types of Speech Recognition Systems.\u003c\/p\u003e \u003cp\u003e4.2.3 Recognition Results.\u003c\/p\u003e \u003cp\u003e4.3 MPEG-7 \u003ci\u003eSpokenContent\u003c\/i\u003e Description.\u003c\/p\u003e \u003cp\u003e4.3.1 General Structure.\u003c\/p\u003e \u003cp\u003e4.3.2 \u003ci\u003eSpokenContentHeader.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e4.3.3 \u003ci\u003eSpokenContentLattice.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e4.4 Application: Spoken Document Retrieval.\u003c\/p\u003e \u003cp\u003e4.4.1 Basic Principles of IR and SDR.\u003c\/p\u003e \u003cp\u003e4.4.2 Vector Space Models.\u003c\/p\u003e \u003cp\u003e4.4.3 Word-Based SDR.\u003c\/p\u003e \u003cp\u003e4.4.4 Sub-Word-Based Vector Space Models.\u003c\/p\u003e \u003cp\u003e4.4.5 Sub-Word String Matching.\u003c\/p\u003e \u003cp\u003e4.4.6 Combining Word and Sub-Word Indexing.\u003c\/p\u003e \u003cp\u003e4.5 Conclusions.\u003c\/p\u003e \u003cp\u003e4.5.1 MPEG-7 Interoperability.\u003c\/p\u003e \u003cp\u003e4.5.2 MPEG-7 Flexibility.\u003c\/p\u003e \u003cp\u003e4.5.3 Perspectives.\u003c\/p\u003e \u003cp\u003eReferences.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e5. Music Description Tools.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e5.1 Timbre.\u003c\/p\u003e \u003cp\u003e5.1.1 Introduction.\u003c\/p\u003e \u003cp\u003e5.1.2 \u003ci\u003eInstrumentTimbre.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.1.3 \u003ci\u003eHarmonicInstrumentTimbre.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.1.4 \u003ci\u003ePercussiveInstrumentTimbre.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.1.5 Distance Measures.\u003c\/p\u003e \u003cp\u003e5.2 Melody.\u003c\/p\u003e \u003cp\u003e5.2.1 \u003ci\u003eMelody.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.2.2 \u003ci\u003eMeter.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.2.3 \u003ci\u003eScale.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.2.4 \u003ci\u003eKey.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.2.5 \u003ci\u003eMelodyContour.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.2.6 \u003ci\u003eMelodySequence.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.3 Tempo.\u003c\/p\u003e \u003cp\u003e5.3.1 \u003ci\u003eAudioTempo.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.3.2 \u003ci\u003eAudioBPM.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e5.4 Application Example: Query-by-Humming.\u003c\/p\u003e \u003cp\u003e5.4.1 Monophonic Melody Transcription.\u003c\/p\u003e \u003cp\u003e5.4.2 Polyphonic Melody Transcription.\u003c\/p\u003e \u003cp\u003e5.4.3 Comparison of Melody Contours.\u003c\/p\u003e \u003cp\u003eReferences.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e6. Fingerprinting and Audio Signal Quality.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e6.1 Introduction.\u003c\/p\u003e \u003cp\u003e6.2 Audio Signature.\u003c\/p\u003e \u003cp\u003e6.2.1 Generalities on Audio Fingerprinting.\u003c\/p\u003e \u003cp\u003e6.2.2 Fingerprint Extraction.\u003c\/p\u003e \u003cp\u003e6.2.3 Distance and Searching Methods.\u003c\/p\u003e \u003cp\u003e6.2.4 MPEG-7-Standardized \u003ci\u003eAudioSignature.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3 Audio Signal Quality.\u003c\/p\u003e \u003cp\u003e6.3.1 \u003ci\u003eAudioSignalQuality\u003c\/i\u003e Description Scheme.\u003c\/p\u003e \u003cp\u003e6.3.2 \u003ci\u003eBroadcastReady.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.3 \u003ci\u003eIsOriginalMono.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.4 \u003ci\u003eBackgroundNoiseLevel.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.5 \u003ci\u003eCrossChannelCorrelation.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.6 \u003ci\u003eRelativeDelay.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.7 \u003ci\u003eBalance.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.8 \u003ci\u003eDcOffset.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.9 \u003ci\u003eBandwidth.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.10 \u003ci\u003eTransmissionTechnology.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003e6.3.11 \u003ci\u003eErrorEvent\u003c\/i\u003e and \u003ci\u003eErrorEventList.\u003c\/i\u003e\u003c\/p\u003e \u003cp\u003eReferences.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e7. Application.\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e7.1 Introduction.\u003c\/p\u003e \u003cp\u003e7.2 Automatic Audio Segmentation.\u003c\/p\u003e \u003cp\u003e7.2.1 Feature Extraction.\u003c\/p\u003e \u003cp\u003e7.2.2 Segmentation.\u003c\/p\u003e \u003cp\u003e7.2.3 Metric-Based Segmentation.\u003c\/p\u003e \u003cp\u003e7.2.4 Model-Selection-Based Segmentation.\u003c\/p\u003e \u003cp\u003e7.2.5 Hybrid Segmentation.\u003c\/p\u003e \u003cp\u003e7.2.6 Hybrid Segmentation Using MPEG-7 ASP.\u003c\/p\u003e \u003cp\u003e7.2.7 Segmentation Results.\u003c\/p\u003e \u003cp\u003e7.3 Sound Indexing and Browsing of Home Video Using Spoken Annotations.\u003c\/p\u003e \u003cp\u003e7.3.1 A Simple Experimental System.\u003c\/p\u003e \u003cp\u003e7.3.2 Retrieval Results.\u003c\/p\u003e \u003cp\u003e7.4 Highlights Extraction for Sport Programmes Using Audio Event Detection.\u003c\/p\u003e \u003cp\u003e7.4.1 Goal Event Segment Selection.\u003c\/p\u003e \u003cp\u003e7.4.2 System Results.\u003c\/p\u003e \u003cp\u003e7.5 A Spoken Document Retrieval System for Digital Photo Albums.\u003c\/p\u003e \u003cp\u003eReferences.\u003c\/p\u003e \u003cp\u003e\u003cb\u003eIndex.\u003c\/b\u003e\u003c\/p\u003e  \u003cb\u003eHyoung-Gook Kim\u003c\/b\u003e, Researcher of the MPEG-7 Audio Project at the Communication Systems Group, Technical University of Berlin, Communication Systems Group, Sekr. EN 1, Einsteinufer 17, D-10587 Berlin  \u003cp\u003e\u003cb\u003eNicolas Moreau\u003c\/b\u003e, Researcher of the MPEG-7 Audio Project at the Communication Systems Group, Technical University of Berlin, Communication Systems Group, Sekr. EN 1, Einsteinufer 17, D-10587 Berlin\u003c\/p\u003e \u003cp\u003e\u003cb\u003eThomas Sikora\u003c\/b\u003e, Professor and head of the Communication Systems Group, Technical University of Berlin, Communication Systems Group, Sekr. EN 1, Einsteinufer 17, D-10587 Berlin\u003c\/p\u003e  Advances in technology, such as MP3 players, the Internet and DVDs, have led to the production, storage and distribution of a wealth of audio signals, including speech, music and more general sound signals and their combinations. MPEG-7 audio tools were created to enable the navigation of this data, by providing an established framework for effective multimedia management.   \u003ci\u003eMPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval\u003c\/i\u003e is a unique insight into the technology, covering the following topics:  \u003cul\u003e \u003cli\u003ethe fundamentals of MPEG-7 audio, principally low-level descriptors and sound classification and similarity;\u003c\/li\u003e \u003c\/ul\u003e \u003cul type=\"disc\"\u003e \u003cli\u003espoken content description, and timbre, melody and tempo music description tools;\u003c\/li\u003e \u003c\/ul\u003e \u003cul type=\"disc\"\u003e \u003cli\u003eexisting MPEG-7 applications and those currently being developed;\u003c\/li\u003e \u003c\/ul\u003e \u003cul type=\"disc\"\u003e \u003cli\u003eexamples of audio technology beyond the scope of MPEG-7.\u003c\/li\u003e \u003c\/ul\u003e \u003cp\u003eEssential reading for practising electronic and communications engineers designing and implementing MPEG-7 compliant systems, this book will also be a useful reference for researchers and graduate students working with multimedia database technology.\u003c\/p\u003e","brand":"Wiley","offers":[{"title":"Default Title","offer_id":47989657567461,"sku":"NP9780470093344","price":149.95,"currency_code":"USD","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/1842\/7735\/files\/9780470093344.jpg?v=1761784987","url":"https:\/\/k12savings.com\/products\/mpeg-7-audio-and-beyond-isbn-9780470093344","provider":"K12savings","version":"1.0","type":"link"}