AVID-MODE

“Efficient Location-Aware Audio-Visual Delivery of High-Quality Content to Mobile Devices”, 06/2009 – 05/2013.

Funded by: European Commission, FP7, Marie Curie Industry-Academia Partnerships and Pathways (IAPP).

Partners: FORTH-ICS (Coordinator, A. Mouchtaris), University of Thessaly (PI I. Katsavounidis), TWS (Italy), CIDANA (China).

Funding: €1.323.240 (FORTH-ICS: €716.000).

 SPL Contact Person: Prof. Athanasios Mouchtaris.

Summary:

Nowadays, apart from mobile phones, we have in our disposal geographic positioning devices, personal organisers, email and text communication assistants, music players which can store thousands of songs, and video players with medium-definition capabilities. It is becoming increasingly apparent that mobile devices should achieve a greater level of integration regarding the services they offer. The goal is that soon it will be possible to own a single inexpensive high-quality mobile gizmo, which can be used as a tele- and video- phone, email client, MP3 player, document and spreadsheet processor, GPS locator, and so forth. In addition, most mobile devices now include some sort of navigation capability, either through the integration of a standard GPS module, or through the clever use of the cellular network topology and signal strength. In this project, our focus is on the enhancement of high-quality video, sound, and television delivery capabilities of mobile devices that will enable them to provide interactive and immersive media applications. We address limitations to current audio and video coding methods which restrict the use of mobile devices to offering only a low-quality streaming media presentation to the user. An important aspect of this project is the conscious exploitation of the location awareness that these new devices offer, for dynamically adapting the delivery of multimedia content. The proposed consortium consists of partners with complementary expertise which covers all the involved research areas, from the point of capturing the media content to the point that it is dynamically presented through a location aware mobile device to the user.

The role of SPL:

  • Audio coding for mobile devices.
  • Real-time immersive audio presentation with mobile devices, taking into consideration the location aspect.

Related Publications:

Pavlidi, Despoina; Griffin, Anthony; Puigt, Matthieu; Mouchtaris Athanasios, Real-Time Multiple Sound Source Localization and Counting using a Circular Microphone Array, IEEE Trans. Audio, Speech, and Language Processing, vol. 21, no. 10, October 2013, pp. 2193 – 2206.

Alexandridis, Anastasios; Griffin, Anthony; Mouchtaris, Athanasios, Directional Coding of Audio Using a Circular Microphone Array, in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada, May 26-31, 2013.

Griffin, Anthony; Hirvonen, Tony; Tzagkarakis, Christos; Mouchtaris, Athanasios; Tsakalides, Panagiotis, Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing, IEEE Trans. Audio, Speech, and Language Processing, vol. 19, no. 5, July 2011, pp. 1382-1395.

Sarailidis, Giorgos; Katsavounidis, Ioannis, A Multiscale Error Diffusion Technique for Digital Multitoning, IEEE Trans. Image Processing, vol. 21, no. 5, May 2012, pp. 2693 – 2705.

Achievements:

  • Developed novel real-time algorithms for directional audio coding.
  • Developed hand-gesture recognition algorithms suitable for real-time applications and mobile devices.
  • Examined in depth the new Chinese video coding standard AVS.