Program of WASPAA2007

Program

Sunday, October 21

Registration (October 21, 16:00-18:00) Hospitality Room

Dinner (October 21, 18:00-20:00) West Dining Room

Open Bar (October 21, 20:00-22:00) West Dining Room

Monday, October 22

Keynote Address1: Simon Haykin (McMaster University) (October 22, 8:00-9:00) Conference House

Chair: Shoji Makino

8:00-9:00
[Keynote1] Coherent ICA: Implications for Auditory Signal Processing
Simon Haykin, Kevin Kan

Lecture ML1: Microphone Array Signal Processing (October 22, 9:00-10:00) Conference House

Chair: Gary W. Elko

9:00-9:20
[ML1-1] Enhanced Microphone-Array Beamforming Based on Frequency-Domain Spatial Analysis-Synthesis
Michael M. Goodwin
9:20-9:40
[ML1-2] Real Time Capture of Audio Images and Their Use with Video
Adam O'Donovan, Ramani Duraiswami, Nail A. Gumerov
9:40-10:00
[ML1-3] Subband Method for Multichannel Least Squares Equalization of Room Transfer Functions
Nikolay D. Gaubitch, Mark R. P. Thomas, Patrick A. Naylor

Break (October 22, 10:00-10:20) West Dining Room

Poster MP1 (October 22, 10:20-12:20) West Dining Room

Chair: W. Bastiaan Kleijn

[MP1-01] Broadband Music: Opportunities and Challenges for Multiple Source Localization
Jacek P. Dmochowski, Jacob Benesty, Sofiène Affes
[MP1-02] Energy-Based Position Estimation of Microphones and Speakers for ad hoc Microphone Arrays
Minghua Chen, Zicheng Liu, Li-Wei He, Phil Chou, Zhengyou Zhang
[MP1-03] Linear Regression on Sparse Features for Single-Channel Speech Separation
Mikkel N. Schmidt, Rasmus K. Olsson
[MP1-04] Sound Source Separation using Null-Beamforming and Spectral Subtraction for Mobile Devices
Shintaro Takada, Satoshi Kanba, Tetsuji Ogawa, Kenzo Akagiri, Tetsunori Kobayashi
[MP1-05] On Dealing with Sampling Rate Mismatches in Blind Source Separation and Acoustic Echo Cancellation
Enrique Robledo-Arnuncio, Ted S. Wada, Biing-Hwang (Fred) Juang
[MP1-06] Signal Deflation and Paraunitary Constraints in Spatio-Temporal FastICA-Based Convolutive Blind Source Separation of Speech Mixtures
Malay Gupta, Scott C. Douglas
[MP1-07] Fast Convergence Blind Source Separation Based on Frequency Subband Interpolation by Null Beamforming
Keiichi Osako, Yoshimitsu Mori, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano
[MP1-08] Electronic Pop Protection for Microphones
Gary W. Elko, Jens Meyer, Steven Backer, Jürgen Peissig
[MP1-09] A Practical Multichannel Dereverberation Algorithm using Multichannel DYPSA and Spatiotemporal Averaging
Mark R. P. Thomas, Nikolay D. Gaubitch, Jon Gudnason, Patrick A. Naylor
[MP1-10] Isotropic Noise Suppression in the Power Spectrum Domain by Symmetric Microphone Arrays
Hikaru Shimizu, Nobutaka Ono, Kyosuke Matsumoto, Shigeki Sagayama
[MP1-11] Acoustic Echo Cancelation for Dynamically Steered Microphone Array Systems
Matti Hämäläinen, Ville Myllylä
[MP1-12] A New Approach to Digital Audio Equalization
S. Cecchi, L. Palestini, E. Moretti, F. Piazza
[MP1-13] Implementation of Directional Sources in Wave Field Synthesis
Jens Ahrens, Sascha Spors
[MP1-14] A Comparsion of Acoustic and Psychoacoustic Measurements of Pass-Through Hearing Protection Devices
Douglas S. Brungart, Brian W. Hobbs, James T. Hamil
[MP1-15] Improvement in Detectability of Alarm Signals in Noisy Environments by Utilizing Spatial Cues
Hideaki Uchiyama, Masashi Unoki, Masato Akagi
[MP1-16] Estimation Model for the Speech-Quality Dimension "Directness / Frequency Content"
Lu Huo, Marcel Wältermann, Kirstin Scholz, Alexander Raake, Ulrich Heute, Sebastian Möller
[MP1-17] Probabilistic Model Based Similarity Measures for Audio Query-By-Example
Tuomas Virtanen, Marko Helén
[MP1-18] Improving Generalization for Classification-Based Polyphonic Piano Transcription
Graham E. Poliner, Daniel P. W. Ellis
[MP1-19] Acoustic Signal Processing for Degradation Analysis of Rotating Machinery to Determine the Remaining Useful Life
Patricia Scanlon, Alan M. Lyons, Alan O'Loughlin
[MP1-20] Single-Frame Discrimination of Non-Stationary Sinusoids
Jeremy J. Wells, Damian T. Murphy

Lunch (October 22, 12:20-15:40) West Dining Room

Break (October 22, 15:40-16:00) Conference House

Lecture ML2: Source Localization and Blind Source Separation (October 22, 16:00-18:00) Conference House

Chair: Scott Douglas

16:00-16:20
[ML2-1] Modeling of Motion Dynamics and its Influence on the Performance of a Particle Filter for Acoustic Speaker Tracking
Eric A. Lehmann, Anders M. Johansson, Sven Nordholm
16:20-16:40
[ML2-2] Multi Target Acoustic Source Tracking using Track Before Detect
Maurice Fallon, Simon Godsill
16:40-17:00
[ML2-3] Blind Sparse-Nonnegative (BSN) Channel Identification for Acoustic Time-Difference-Of-Arrival Estimation
Yuanqing Lin, Jingdong Chen, Youngmoo Kim, Daniel D. Lee
17:00-17:20
[ML2-4] Blind Criterion and Oracle Bound for Instantaneous Audio Source Separation using Adaptive Time-Frequency Representations
Emmanuel Vincent, Rémi Gribonval
17:20-17:40
[ML2-5] Monaural Speech Separation using Source-Adapted Models
Ron J. Weiss, Daniel P. W. Ellis
17:40-18:00
[ML2-6] A Soft Masking Strategy Based on Multichannel Speech Probability Estimation for Source Separation and Robust Speech Recognition
Eugen Hoffmann, Dorothea Kolossa, Reinhold Orglmeister

Dinner (October 22, 18:00-20:00) West Dining Room

Demo Session 1 (October 22, 20:00-22:00) West Dining Room

Tuesday, October 23

Keynote Address2: Albert S. Bregman (McGill University) (October 23, 8:00-9:00) Conference House

Chair: Daniel P.W. Ellis

8:00-9:00
[Keynote2] Progress in the Study of Auditory Scene Analysis
Albert S. Bregman

Lecture TL1: Signal Enhancement (October 23, 9:00-10:00) Conference House

Chair: Patric A. Naylor

9:00-9:20
[TL1-1] Single-Channel Impact Noise Suppression with No Auxiliary Information for its Detection
Akihiko Sugiyama
9:20-9:40
[TL1-2] Aliasing Reduction for Modified Discrete Cosine Transform Domain Filtering and its Application to Speech Enhancement
Fabian Kuech, Bernd Edler
9:40-10:00
[TL1-3] Example-Driven Bandwidth Expansion
Paris Smaragdis, Bhiksha Raj

Break (October 23, 10:00-10:20) West Dining Room

Poster TP1 (October 23, 10:20-12:20) West Dining Room

Chair: Paris Smaragdis

[TP1-01] A Two-Stage Frequency-Domain Blind Source Separation Method for Underdetermined Convolutive Mixtures
Hiroshi Sawada, Shoko Araki, Shoji Makino
[TP1-02] Long-Term Gain Estimation in Model-Based Single Channel Speech Separation
M. H. Radfar, R. M. Dansereau
[TP1-03] Sparseness-Based 2ch BSS using the EM Algorithm in Reverberant Environment
Yosuke Izumi, Nobutaka Ono, Shigeki Sagayama
[TP1-04] Prior Structures for Time-Frequency Energy Distributions
Ali Taylan Cemgil, Paul Peeling, Onur Dikmen, Simon Godsill
[TP1-05] Fast Time-Domain Spherical Microphone Array Beamforming
Zhiyun Li, Ramani Duraiswami
[TP1-06] Reverberation-Time Prediction Method for Room Impulse Responses Simulated with the Image-Source Model
Eric A. Lehmann, Anders M. Johansson, Sven Nordholm
[TP1-07] Overfitting-Resistant Speech Dereverberation
Takuya Yoshioka, Tomohiro Nakatani, Takafumi Hikichi, Masato Miyoshi
[TP1-08] Novel and Efficient Download Test for Two Path Echo Canceller
Mohammad Asif Iqbal, Steven L. Grant
[TP1-09] An Approach to Massive Multichannel Broadband Feedforward Active Noise Control using Wave-Domain Adaptive Filtering
Sascha Spors, Herbert Buchner
[TP1-10] Enhancement of Residual Echo for Improved Frequency-Domain Acoustic Echo Cancellation
Ted S. Wada, Biing-Hwang (Fred) Juang
[TP1-11] Effects of Pre-Processing Filters on a Wavelet Packet-Based Algorithm to Identify Speech Transients
Daniel M. Rasetshwane, J. Robert Boston, Ching-Chung Li, John D. Durrant
[TP1-12] Modeling Spot Microphone Signals using the Sinusoidal Plus Noise Approach
Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides
[TP1-13] A Modified Spatio-Temporal Orthogonal Iteration Method for Multichannel Audio Signal Representation
Scott C. Douglas, Malay Gupta
[TP1-14] A Low-Delay Audio Coder with Constrained-Entropy Quantization
Minyue Li, W. Bastiaan Kleijn
[TP1-15] Extending Fine-Grain Scalable Audio Coding to Very Low Bitrates using Overcomplete Dictionaries
Emmanuel Ravelli, Gaël Richard, Laurent Daudet
[TP1-16] Spectral Band Replication Tool for Very Low Delay Audio Coding Applications
Tobias Friedrich, Gerald Schuller
[TP1-17] Methods for 2nd Order Spherical Harmonic Spatial Encoding in Digital Waveguide Mesh Virtual Acoustic Simulations
Alex Southern, Damian Murphy
[TP1-18] Solo Voice Detection via Optimal Cancellation
Christine Smit, Daniel P. W. Ellis
[TP1-19] Fast Sequential LS Estimation for Sinusoidal Modeling and Decomposition of Audio Signals
Bertrand David, Roland Badeau
[TP1-20] Speech-To-Singing Synthesis: Converting Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices
Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
[TP1-21] Convolutional Synthesis of Wind Instruments
Tamara Smyth, Jonathan S. Abel

Lunch (October 23, 12:20-15:40) West Dining Room

Break (October 23, 15:40-16:00) Conference House

Lecture TL2: Speech and Audio Coding and Hearing Aid (October 23, 16:00-18:00) Conference House

Chair: Thomas F. Quatieri

16:00-16:20
[TL2-1] Comparison of Reduced-Bandwidth MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids
Simon Doclo, Tim van den Bogaert, Jan Wouters, Marc Moonen
16:20-16:40
[TL2-2] Distributed Spatial Audio Coding in Wireless Hearing Aids
Olivier Roy, Martin Vetterli
16:40-17:00
[TL2-3] A Time-Frequency Modulation Model of Speech Quality
James M. Kates, Kathryn H. Arehart
17:00-17:20
[TL2-4] Low Delay Filterbanks for Enhanced Low Delay Audio Coding
Markus Schnell, Ralf Geiger, Markus Schmidt, Markus Multrus, Michael Mellar, Jürgen Herre, Gerald Schuller
17:20-17:40
[TL2-5] Lossless Audio Coding with Bandwidth Extension Layers
Stephen Voran
17:40-18:00
[TL2-6] Rate Distribution between Model and Signal
W. Bastiaan Kleijn, Alexey Ozerov

Dinner (October 23, 18:00-20:00) West Dining Room

Demo Session 2 (October 23, 20:00-22:00) West Dining Room

Wednesday, October 24

Lecture WL1: Music and Signal Analysis and Synthesis (October 24, 8:00-10:00) Conference House

Chair: Simon Godsill

8:00-8:20
[WL1-1] Sinewave Analysis/Synthesis Based on the Fan-Chirp Transform
Robert Dunn, Thomas F. Quatieri
8:20-8:40
[WL1-2] Spectral Refinement and its Application to Fundamental Frequency Estimation
Mohamed Krini, Gerhard Schmidt
8:40-9:00
[WL1-3] A Novel Method for Decomposition of Multicomponent Nonstationary Signals
A. Goli, D. M. McNamara, A. K. Ziarani
9:00-9:20
[WL1-4] Using Stereo Information for Instrument Identification in Polyphonic Mixtures
David Sodoyer, Pierre Leveau, Laurent Daudet
9:20-9:40
[WL1-5] Bauer Method of MVDR Spectral Factorization for Pitch Modification in the Source Domain
M. Ravi Shanker, R. Muralishankar, A. G. Ramakrishnan
9:40-10:00
[WL1-6] Waveguide Modeling of Lossy Flared Acoustic Pipes: Derivation of a Kelly-Lochbaum Structure for Real-Time Simulations
Thomas Hélie, Rémi Mignot, Denis Matignon

Break (October 24, 10:00-10:20) West Dining Room

Poster WP1 (October 24, 10:20-12:20) West Dining Room

Chair: Jingdong Chen

[WP1-01] Sound Source Distance Learning Based on Binaural Signals
Sampo Vesa
[WP1-02] EM Localization and Separation using Interaural Level and Phase Cues
Michael I. Mandel, Daniel P. W. Ellis
[WP1-03] Single Channel Speech and Background Segregation Through Harmonic-Temporal Clustering
Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama
[WP1-04] Joint Iterative Multi-Speaker Identification and Source Separation using Expectation Propagation
John MacLaren Walsh, Youngmoo E. Kim, Travis M. Doll
[WP1-05] Audio Source Separation with Matching Pursuit and Content-Adaptive Dictionaries (MP-CAD)
Namgook Cho, Yu Shiu, C.-C. Jay Kuo
[WP1-06] Post-Filter Design for Superdirective Beamformers with Closely Spaced Microphones
Heinrich W. Löllmann, Peter Vary
[WP1-07] A Fast Microphone Array SRP-PHAT Source Location Implementation using Coarse-To-Fine Region Contraction (CFRC)
Hoang Do, Harvey F. Silverman
[WP1-08] Importance of Energy and Spectral Features in Gaussian Source Model for Speech Dereverberation
Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Masato Miyoshi
[WP1-09] A Variable Step-Size for Frequency-Domain Acoustic Echo Cancellation
Yin Zhou, Xiaodong Li
[WP1-10] A Novel Approach to Active Noise Control Based on Wave Domain Adaptive Filtering
P. Peretti, S. Cecchi, L. Palestini, F. Piazza
[WP1-11] Semantic Colouration Space Investigation: Controlled Colouration in the Bark-Sone Domain
Jimi Y. C. Wen, Patrick A. Naylor
[WP1-12] Robustness Analysis of Binaural Hearing Aid Beamformer Algorithms by Means of Objective Perceptual Quality Measures
Thomas Rohdenburg, Volker Hohmann, Birger Kollmeier
[WP1-13] Privacy-Preserving Musical Database Matching
Madhusudana Shashanka, Paris Smaragdis
[WP1-14] A Multichannel Linear Prediction Method for the MPEG-4 ALS Compliant Encoder
Yutaka Kamamoto, Noboru Harada, Takehiro Moriya
[WP1-15] Enhanced Resampling for Sinusoidal Modeling Parameters
Martin Raspaud, Sylvain Marchand
[WP1-16] Compressive Coding of Stereo Audio Signals Extracting Sparseness Among Sound Sources with Independent Component Analysis
Shigeki Miyabe, Tadashi Mihashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura
[WP1-17] Distortion-Aware Query-By-Example for Environmental Sounds
Gordon Wichern, Jiachen Xue, Harvey Thornburg, Andreas Spanias
[WP1-18] Multi-Object Tracking of Sinusoidal Components in Audio with the Gaussian Mixture Probability Hypothesis Density Filter
Daniel Clark, Ali-Taylan Cemgil, Paul Peeling, Simon Godsill
[WP1-19] Separation of Harmonic and Speech Signals using Sinusoidal Modeling
Peter Jančovič, Münevver Köküer
[WP1-20] An Instrument Timbre Model for Computer Aided Orchestration
Damien Tardieu, Xavier Rodet

Lunch (October 24, 12:20-14:00) West Dining Room