
Books and Book Chapters
- P. Maragos, A. Potamianos and P. Gros (eds),
Multimodal Processing and Interaction: Audio, Video, Text
,
Springer-Verlag, 2008.
- A. Potamianos and M. Perakakis, "
Human-Computer Interafaces to Multimedia Content: A Review
,'' in Multimodal Processing and Interaction: Audio, Video, Text,
Springer-Verlag, 2008.
- A. Potamianos and M. Perakakis, "
Design Principles for Multimodal Spoken Dialogue Systems
,'' in Multimodal Processing and Interaction: Audio, Video, Text,
Springer-Verlag, 2008.
- G. Evangelopoulos, K. Rapatzikos, P. Maragos,
Y. Avrithis, A. Potamianos,
"Audiovisual Attention Modeling and Salient Event Detection",
in Multimodal Processing and Interaction: Audio, Video, Text,
Springer-Verlag, 2008.
Journal Publications
- P. Tsiakoulis, A. Potamianos, and D. Dimitriadis,
"Spectral Moment Features Augmented by Low Order Cepstral Coefficients for Robust ASR
," IEEE Signal Processing Letters, submitted, 2009.
- M. Perakakis and A. Potamianos,
"Synergy and Modality Efficiency in Multimodal Dialogue Systems
," IEEE Transactions on Audio, Speech
and Language Processing, submitted, 2009.
- S. Yildirim, S. Narayanan, and A. Potamianos,
"Detecting Emotional State of a Child in a Conversational Computer Game
," Computer, Speech and Language , to appear, 2010.
- D. Nion, K. Mokios, N. Sidiropoulos, A. Potamianos, "
Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech
Mixtures
," IEEE Transactions on Audio, Speech
and Language Processing, to appear, 2010.
- E. Iosif, and A. Potamianos, "
Unsupervised Semantic Similarity Computation Between Terms Using
Web Documents
,''
IEEE Transactions on Knowledge and Data Engineering, to appear, 2010.
- D. Dimitriadis, A. Potamianos, and P. Maragos, "
A Comparison of the Squared Energy and Teager-Kaiser Operators for Short-Time
Energy Estimation in Noise
,''
IEEE Transactions on Signal Processing , vol. 57, no. 7, pp. 2569-2581, July 2009.
- E. Sanchez-Soto, A. Potamianos, and K. Daoudi, "
Unsupervised stream weight computation in classification and detection tasks
,'' IEEE Transactions on Audio, Speech
and Language Processing, vol. 17, no. 3, pp. 436-445, Mar. 2009.
- M. Perakakis and A. Potamianos, "
A study in efficiency and modality usage in multimodal form filling systems
,'' IEEE Transactions on Audio, Speech
and Language Processing,
Vol. 16, pp. 1194 - 1206, Aug. 2008.
- E. Ammicht, E. Fosler-Lussier, and A. Potamianos, "
Information seeking spoken dialogue systems - Part I: Semantics and pragmatics
,'' IEEE
Transactions on Multimedia, vol. 9, no.3, Apr. 2007.
Vol. 9, pp. 532 - 549, April 2007.
- A. Potamianos, E. Fosler-Lussier, E. Ammicht, and M. Perakakis, "
Information seeking spoken dialogue systems - Part II: Multimodal dialogue
,'' IEEE Transactions on Multimedia,
Vol. 9, pp. 550 - 566, April 2007.
- D. Dimitriadis, P. Maragos, and A. Potamianos, "
Robust AM-FM features for speech recognition
,'' IEEE Signal Processing Letters, vol. 12,
pp. 621-624, Sept. 2005.
- A. Potamianos, S. Narayanan, and G. Riccardi, "
Adaptive categorical understanding for spoken dialogue systems
,'' IEEE Transactions on Speech
and Audio Processing,
Vol. 13, pp. 321 - 329, May 2005.
- A. Pargellis, E. Fosler-Lussier, C.-H. Lee, A. Potamianos, and A. Tsai,
"Auto-induced semantic classes,'' Speech Communication, vol. 43,
pp. 183-203, Aug. 2004.
- A. Potamianos and S. Narayanan, "
Robust recognition of children's speech
,''
IEEE Transactions on Speech and Audio Processing, vol. 11,
pp. 603-616, Nov. 2003.
- V. Weerackody, W. Reichl, and A. Potamianos, "
An error-protected speech recognition system for wireless communications
,'' IEEE Transactions on
Wireless Communications, vol. 1, pp. 282-291, Apr. 2002.
- S. Narayanan and A. Potamianos, "
Creating conversational interfaces for children
,'' IEEE Transactions on Speech and Audio Processing, vol. 10,
pp. 65-78, Feb. 2002. [IEEE Signal Processing Society Best Paper Award 2005]
- D. Dimitriadis, V. Pitsikalis, P. Maragos, and A. Potamianos, "
Modulation and chaotic features for speech recognition
,'' invited paper to the Journal
of Control and Intelligent Systems, Special Issue on Nonlinear Speech
Processing, vol. 30, pp. 19-26, Jan. 2002.
- A. Potamianos and P. Maragos, "
Time-frequency distributions for automatic speech recognition
,'' IEEE Transactions on Speech and Audio Processing,
vol. 9, pp. 196-200, Mar. 2001.
- A. Potamianos and P. Maragos, "
Speech analysis and synthesis using an AM-FM modulation model
,'' Speech Communication, vol. 28,
pp. 195-209, July 1999.
- S. Lee, A. Potamianos, and S. Narayanan, "
Acoustics of children's speech: Developmental changes of temporal and spectral parameters
,'' Journal of
the Acoustical Society of America, pp. 1455-1468, Mar. 1999. [Selected Research Article by JASA]
- P.Maragos and A. Potamianos, "
Fractal dimensions of speech sounds: Computation
and application to automatic speech recognition
,'' Journal of the
Acoustical Society of America, pp. 1925-1932, Mar. 1999.
- A. Potamianos and P. Maragos, "
Speech formant frequency and bandwidth tracking using multiband energy demodulation
,'' Journal of the Acoustical Society
of America, vol. 99, pp. 3795-3806, June 1996.
- P. Maragos and A. Potamianos, "
Higher-order differential energy operators
,''
IEEE Signal Processing Letters, vol. 2, Aug. 1995.
- H. M. Hanson, P. Maragos, and A. Potamianos, "
A system for finding speech formants and modulations via energy separation
,'' IEEE Transactions on
Speech and Audio Processing, vol. 2, pp. 436-443, July 1994.
- A. Potamianos and P. Maragos, "
A comparison of the energy operator and the Hilbert transform approach to signal and speech demodulation
,'' Signal
Processing, vol. 37, pp. 95-120, May 1994.
Conference Publications
- S. Dimopoulos, C.H. Lee, E. Fosler-Lussier, and A. Potamianos, "
Transition features for CRF-based recognition and boundary detection
,"
in Proc. Automatic Speech Recogn. and Underst. Workshop (ASRU-2009),
Merano, Italy, Dec. 2009.
- P. Tsiakoulis, A. Potamianos, and D. Dimitriadis, "
Short-time instantaneous frequency and bandwidth features for speech recognition
,"
in Proc. Automatic Speech Recogn. and Underst. Workshop (ASRU-2009),
Merano, Italy, Dec. 2009.
- T. Kannetis, and A. Potamianos, "
Towards Adapting Fantasy, Curiosity and Challenge in Multimodal Dialogue
Systems for Preschoolers
,"
in Proc. Int'l Conf. on Multimodal Interfaces (ICMI-2009), Boston, MA, Nov. 2009.
- T. Kannetis, A. Potamianos, and G.N. Yannakakis, "
Fantasy, Curiosity and Challenge as Adaptation Indicators in Multimodal Dialog
Systems for Preschoolers
,"
in Proc. Workshop of Child, Computer and Interaction (WOCCI-2009), Boston, MA, Nov. 2009.
- M. Gerosa, D. Giuliani, S. Narayanan, and A. Potamianos, "
A Review of ASR Technologies for Children's Speech
,"
in Proc. Workshop of Child, Computer and Interaction (WOCCI-2009), Boston, MA, Nov. 2009.
- P. Tsiakoulis, and A. Potamianos, "
Statistical Analysis of Amplitude Modulation in Speech Signals using an AM-FM Model
,"
in Proc. Intl. Conf. on Acoustics, Speech and Signal Processing
(ICASSP-2009), Taipei, Taiwan, Apr. 2009.
-
G. Evangelopoulos, A. Zlatintsi, G. Skoumas, K. Rapantzikos, A. Potamianos, P. Maragos, and Y. Avrithis, "
Video Event Detection and Summarization Using Audio, Visual and Text Saliency
,"
in Proc. Intl. Conf. on Acoustics, Speech and Signal Processing
(ICASSP-2009), Taipei, Taiwan, Apr. 2009.
-
S. Dimopoulos, A. Potamianos, E. Fosler-Lussier, and C.-H. Lee, "
Multiple Time Resolution Analysis of Speech Signals Using MCE Training with Application to Speech Recognition
,"
in Proc. Intl. Conf. on Acoustics, Speech and Signal Processing
(ICASSP-2009), Taipei, Taiwan, Apr. 2009.
- V. Farantouri, A. Potamianos, and S. Narayanan, "
Linguistic Analysis of Spontaneous Children Speech
",
in Proc. Workshop of Child, Computer and Interaction (WOCCI-2008), Chania, Greece, Oct. 2008.
- M. Perakakis and A. Potamianos, "
Multimodal System Evaluation using Modality Efficiency and Synergy Metrics
",
in Proc. Int'l Conf. on Multimodal Interfaces (ICMI-2008), Chania, Greece, Oct. 2008.
- G. Evangelopoulos, K. Rapantzikos, A. Potamianos, P. Maragos, A. Zlatintsi and Y. Avrithis, "
Movie Summarization Based On Audio-Visual Saliency Detection
",
in Proc. Intl Conference on Image Processing (ICIP-2008),
San Diego, California, Oct. 2008.
- A. Tegos, V. Karkaletsis, and A. Potamianos, "
Learning of Semantic Relations Between Ontology Concepts Using Statistical Techniques
",
in Proc. HLIE Workshop , Antwerp, Belgium, Sept. 2008.
- M. Maragakis and A. Potamianos, "
Region-Based Vocal Tract Length Normalization for ASR
",
in Proc. Interspeech , Brisbane, Australia, Sept. 2008.
- K. Mokios, A. Potamianos, and N.  Sidiropoulos, "
On the Effectiveness of PARAFAC-Based Estimation in Blind Speech Separation
",
in Proc. Intl. Conf. on Acoustics, Speech and Signal Processing
(ICASSP-2008), Las Vegas, Nevada, Apr. 2008.
- E. Iosif and A. Potamianos, "
Unsupervised semantic similarity computation using web search engines
,'' in Proc. Intern. Conf. on Web Intelligence,
(Silicon Valley, USA), Nov. 2007.
- E. Sanchez-Soto, K. Daoudi, and A. Potamianos, "
Unsupervised stream weight computation in a segmentation task: Application to audio-visual speech
recognition
,'' in Proc. Intern. Conf. on Signal Proc. and
Communications, (Dubai, UAE), Nov. 2007.
- M. Perakakis and A. Potamianos, "
The effect of input mode on inactivity and interaction times of multimodal systems
,'' in Internat. Conf. on
Multimodal Interfaces, (Nagoya, Japan), Nov. 2007.
- A. Potamianos and S. Narayanan, "
A review of the acoustic and linguistic
properties of children's speech
,'' in Proc. Intern. Workshop on
Multimedia Signal Processing, (Chania, Greece), Oct. 2007.
- S. Siltanen et al, "
Multimodal user interface for augmented assembly
,'' in
Proc. Intern. Workshop on Multimedia Signal Processing, (Chania,
Greece), Oct. 2007.
- E. Iosif and A. Potamianos, "
A soft-clustering algorithm for automatic induction of semantic classes
,'' in Proc. Interspeech, (Antwerp,
Belgium), Aug. 2007.
- D. Dimitriadis, J. Segura, L. Garcia, A. Potamianos, P. Maragos, and
V. Pitsikalis, "
Advanced front-end for robust speech recognition in
extremely adverse environments
,'' in Proc. Interspeech, ( Antwerp, Belgium), Aug. 2007.
- A. Katsamanis, P. Tsiakoulis, P. Maragos, and A. Potamianos, "
Investigations in articulatory synthesis
,'' in Proc. Intenat. Conf. on Phonetics,
(Saarbrucken, Germany), Aug. 2007.
- E. Sanchez-Soto, A. Potamianos, and K. Daoudi, "
Unsupervised stream weight computation using anti-models
,'' in Proc. Internat. Conf. on Acoust.,
Speech, and Signal Process., (Hawaii, USA), Apr. 2007.
- E. Iosif, A. Tegos, A. Pangos, E. Fosler-Lussier, and A. Potamianos, "
Unsupervised combination of metrics for semantic class induction
,'' in IEEE/ACM Workshop on Spoken Language Technology, (Aruba), Dec. 2006.
- M. Perakakis, M. Toutoudakis, and A. Potamianos, "
Blending speech and visual input in multimodal dialogue systems
,'' in IEEE/ACM Workshop on Spoken
Language Technology, (Aruba), Dec. 2006.
- A. Potamianos et al, "
Towards speaker and enviromental robustness in ASR: the HIWIRE project
,'' in ITRW Workshop on Speech Recognition and
Intrinsic Variation, (Toulouse, France), May 2006.
- A. Potamianos, E. Sanchez-Soto, and K. Daoudi, "
Stream weight computation for multi-stream classifiers
,'' in Proc. Internat. Conf. on Acoust., Speech,
and Signal Process., (Toulouse, France), May 2006.
- K. Mokios, N. Sidiropoulos, and A. Potamianos, "
Blind speech separation algorithm using PARAFAC and integer least squares
,'' in Proc.
Internat. Conf. on Acoust., Speech, and Signal Process., (Toulouse, France),
May 2006.
- P. Karageorgakis, A. Potamianos, and I. Klasinas, "
Towards incorporating language morphology into statistical machine translation systems
,'' in Proc. Automatic Speech Recogn. and Underst. Workshop, (Cancun, Mexico), Dec.
2005.
- A. Pangos, E. Iosif, A. Potamianos, and E. Fosler-Lussier,
Combining statistical similarity measures for automatic induction of semantic
classes
,'' in Proc. Automatic Speech Recogn. and Underst. Workshop,
(Cancun, Mexico), Dec. 2005.
-
D. Dimitriadis, P. Maragos, and A. Potamianos, "
Auditory Teager energy cepstrum coefficients for robust speech recognition
,'' in Proc. European
Conf. on Speech Communication and Technology, (Lisbon, Portugal), Sept.
2005.
-
S. Yildirim, C. Lee, S. Lee, A. Potamianos, and S. Narayanan, "
Detecting politeness and frustration state of a child in a conversational computer game
,'' in Proc. European Conf. on Speech Communication and Technology,
(Lisbon, Portugal), Sept. 2005.
- A. Potamianos, E. Ammicht, and E. Fosler-Lussier, "
Modality tracking in the multimodal Bell Labs Communicator
,'' in Proc. Automatic Speech
Recogn. and Underst. Workshop, (St. Thomas, U.S. Virgin Islands), Dec. 2003.
- A. Potamianos, "Novel features for robust speech recognition,'' in invited presentation to the Conf. of the Acoustical Society of America,
(Cancun, Mexico), Dec. 2002.
- M. Walker et al, "
DARPA Communicator: Cross-system results for the 2001
evaluation
,'' in Internat. Conf. Speech Language Processing,
(Colorado), Sept. 2002.
- M. Walker et al, "DARPA Communicator evaluation: Progress from 2000 to
20001,'' in Internat. Conf. Speech Language Processing, (Colorado),
Sept. 2002.
- R. Argiles-Solsona, E. Fosler-Lussier, J. Kuo, A. Potamianos, and I. Zitouni, "
Adaptive language models for spoken dialogue systems
,'' in Proc.
Internat. Conf. on Acoust., Speech, and Signal Process., (Orlando, Florida),
May 2002.
- D. Dimitriadis, P. Maragos, and A. Potamianos, "
Modulation features for speech
recognition
,'' in Proc. Internat. Conf. on Acoust., Speech, and Signal
Process., (Orlando, Florida), May 2002.
- S. Lee, E. Ammicht, E. Fosler-Lussier, J. Kuo, and A. Potamianos, "
Spoken dialogue evaluation for the Bell Labs Communicator system
,'' in Proc. Human Language Technology Conf., (San Diego, California), Mar. 2002.
- M. Tsangaris and A. Potamianos, "
AGORA: A GUI approach to multimodal user interfaces
,'' in Proc. Human Language Technology Conf., (San Diego,
California), Mar. 2002.
- E. Ammicht, A. Potamianos, and E. Fosler-Lussier, "
Ambiguity representation and resolution in spoken dialogue systems
,'' in Proc. European Conf. on
Speech Communication and Technology, (Aalborg, Denmark), Oct. 2001.
- M. Galley, E. Fosler-Lussier, and A. Potamianos, "
Hybrid natural language generation for spoken dialogue systems
,'' in Proc. European Conf. on
Speech Communication and Technology, (Aalborg, Denmark), Oct. 2001.
- A. Pargellis, E. Fosler-Lussier, A. Potamianos, and C.-H. Lee, "
Metrics for measuring domain-independence of semantic classes
,'' in Proc. European
Conf. on Speech Communication and Technology, (Aalborg, Denmark), Oct. 2001.
- M. Walker et al, "DARPA Communicator dialog travel planning systems: the
June 2000 evaluation,'' in Proc. European Conf. on Speech
Communication and Technology, (Aalborg, Denmark), Oct. 2001.
- A. Potamianos and V. Weerackody, "
Soft-feature decoding for speech recognition
over wireless channels
,'' in Proc. Internat. Conf. on Acoust., Speech,
and Signal Process., (Salt Lake City, Utah), May 2001.
- A. Pargellis and A. Potamianos, "
Cross-domain classification using generalized
domain acts
,'' in Internat. Conf. Speech Language Processing, (Beijing,
China), Oct. 2000.
- A. Potamianos, E. Ammicht, and H.-K. Kuo, "
Dialogue management in the Bell Labs communicator system
,'' in Internat. Conf. Speech Language
Processing, (Beijing, China), Oct. 2000.
- A. Potamianos and H.-K. Kuo, "
Speech understanding using finite state transducers
,'' in Internat. Conf. Speech Language Processing, (Beijing,
China), Oct. 2000.
- W. Reichl, V. Weerackody, and A. Potamianos, "A codec for speech recognition
in a wireless system,'' in Proc. EUROCOMM, (Munich, Germany), May 2000.
- A. Potamianos and P. Maragos, "
Time-frequency distributions for automatic speech recognition
,'' in Proc. Workshop on Automatic Speech Recognition
and Understanding, (Keystone, Colorado), Dec. 1999.
- S. Narayanan, A. Potamianos, and H. Wang, "Multimodal systems for children:
Building a prototype,'' in Proc. European Conf. on Speech Communication
and Technology, (Budapest, Hungary), Sept. 1999.
- G. Potamianos and A. Potamianos, "Speaker adaptation for audio-visual speech
recognition,'' in Proc. European Conf. on Speech Communication and
Technology, (Budapest, Hungary), Sept. 1999.
- A. Potamianos, G. Riccardi, and S. Narayanan, "Categorical understanding using
statistical n-gram models,'' in Proc. European Conf. on Speech
Communication and Technology, (Budapest, Hungary), Sept. 1999.
- A. Potamianos et al, "
Design principles and tools for multimodal dialog
systems
,'' in Proc. ESCA Workshop Interact. Dialog. Multi-Modal Syst.,
(Kloster Irsee, Germany), June 1999.
- G. Riccardi, A. Potamianos, and S. Narayanan, "
Language model adaptation for spoken dialog systems
,'' in Internat. Conf. Speech Language Processing,
(Australia), Oct. 1998.
- S. Okawa, E. Brocchieri, and A. Potamianos, "
Multi-band speech recognition in noisy environments
,'' in Proc. Internat. Conf. on Acoust., Speech, and
Signal Process., (Seattle, Washington), May 1998.
- A. Potamianos and S. Narayanan, "
Spoken dialog systems for children
,'' in Proc. Internat. Conf. on Acoust., Speech, and Signal Process.,
(Seattle,Washington), pp. 197-201, May 1998.
- S. Lee, A. Potamianos, and S. Narayanan, "
Analysis of children's speech:
Duration, pitch and formants,'' in Proc. European Conf. on Speech
Communication and Technology, (Rhodes, Greece), pp. 473-476, Sept. 1997.
- P. Maragos and A. Potamianos, "On using fractal features of speech sounds in
automatic speech recognition,'' in Proc. European Conf. on Speech
Communication and Technology, (Rhodes, Greece), pp. 2531-2534, Sept. 1997.
- A. Potamianos and P. Maragos, "Speech analysis and synthesis using an
AM-FM modulation model,'' in Proc. European Conf. on Speech
Communication and Technology, (Rhodes, Greece), pp. 1355-1358, Sept. 1997.
- A. Potamianos, S. Narayanan, and S. Lee, "Automatic speech recognition for
children,'' in Proc. European Conf. on Speech Communication and
Technology, (Rhodes, Greece), pp. 2371-2374, Sept. 1997.
- I. Zeljkovic, S. Narayanan, and A. Potamianos, "Unsupervised HMM
adaptation based on speech-silence discrimination,'' in Proc. European
Conf. on Speech Communication and Technology, (Rhodes, Greece),
pp. 2055-2058, Sept. 1997.
- A. Potamianos and R. C. Rose, "
On combining frequency warping and spectral
shaping in HMM-based speech recognition
,'' in Proc. Internat. Conf. on
Acoust., Speech, and Signal Process., (Munich, Germany), Apr. 1997.
- R. C. Rose and A. Potamianos, "Improving robustness in HMM based speech
recognition through simultaneous frequency warping and spectral shaping,'' in
ESCA-NATO Workshop on Robust Speech Recognition, (Pont-a-Mousson,
France), Apr. 1997.
- A. Potamianos and R. C. Rose, "A feature-space transformation for telephone
based speech recognition,'' in Proc. European Conf. on Speech
Communication and Technology, (Madrid, Spain), Sept. 1995.
- P. Maragos, A. Potamianos, and B. Santhanam, "
Instantaneous energy operators:
Applications to speech processing and communications
,'' in IEEE Workshop
on Nonlinear Signal and Image Processing, (Thessaloniki, Greece), June 1995.
- A. Potamianos and P. Maragos, "
Speech formant frequency and bandwidth tracking
using multiband energy demodulation
,'' in Proc. Internat. Conf. on
Acoust., Speech, and Signal Process., (Detroit, MI), May 1995.
- A. Potamianos and P. Maragos, "
Applications of speech processing using an
AM-FM modulation model and energy operators
,'' in Proc. European
Signal Process. Conf., (Edinburgh, Scotland), pp. III: 1669-1672, Sept.
1994.
- P. Maragos, T. F. Quatieri, J. F. Kaiser, and A. Potamianos, "Demodulation of
AM-FM resonances in speech using energy separation,'' in presentation to the Conf. of the Acoustical Society of America, (Boston,
MA), June 1994.
- H. M. Hanson, P. Maragos, and A. Potamianos, "Finding speech formants and
modulations via energy separation: With application to a vocoder,'' in Proc. Internat. Conf. on Acoust., Speech, and Signal Process., (Minneapolis,
MN), Apr. 1993.
- J. Diamesis and A. Potamianos, "Tridiagonal state-space realization of a class
of 2-D transfer functions,'' in Proc. of the 25th Conf. on Information
Sciences and Systems, (Baltimore, MD), pp. 249-253, Mar. 1991.
Thesis
A. Potamianos,
Speech Processing Applications Using an AM-FM Modulation Model,
Harvard University, 1996
Back to Home Page