“Sound is the vocabulary of our world” @ the Abbey Road Hackathon

Pheobe participated in the Abbey Road Music Tech Hackathon winning two awards for Best example of spatial audio and the Best use of Audio Commons award.

Her team, Hear The World built a platform that sonifies the physical world allowing those who cannot see to feel the world by leveraging spatial sound techniques.

Well done Pheobe and the Hear the World team for delivering a novel audio quality of experience!

The Hackathon, taking place over 9-10 November, has had QxLab members participating for the last two years with with Alessandro joining the action in 2018.

How Do Different Browsers Handle Network Delays?


Hamed is in China at the IEEE International Conference on Communication Technology (ICCT) where he will present his work “Quantifying the Influence of Browser, OS and Network Delay on Time Instant Metric Measurements for a Web Mapping Application”.

Many factors influence the quality of experience when using interactive web-based applications such as Google Maps. In this work, Hamed explores the influence of operating system and browser on network delays.

QxLab Grows with Two New Grants for 5G and Speech Research

Dr Wissam Jassim and Dr Alcardo (Alex) Barakabitze joined QxLab in October.

Wissam is a research fellow working on GAN codec quality prediction. His research is funded from a new grant awarded to QxLab by Google. He will be working closely with Google Chrome to develop a new speech quality model that works with generative speech codecs.

Alex is an IRC Government of Ireland Postdoctoral Fellow developing QoE centric management of multimedia services and orchestration of resources in 5G networks using SDN and NFV.

 

Wissam and Alex will be based in the Insight Centre at UCD.

Alessandro at the Turing

Alessandro has temporarily relocated to London where he will be based at the Alan Turing Institute as part of an Turing-Insight research collaboration. The Enrichment programme sees a cohort of 54 students based at the Turing in London for 2019/20.

Spending time in London will give Alessandro access to the facilities of the Turing, based in the British Library. He will also have the opportunity to visit Queen Mary University of London where his co-supervisor Emmanouil Benetos is  based.

While in London, Alessandro will continue his PhD research using deep learning models to predict quality for audio archives.

 

 

 

 

3D Sound and Vision with the Occulus Quest

Matthew Parker from Texas Tech University spent the summer at QxLab hosted by Insight and the school of Computer Science. Over eight weeks, he developed a virtual reality environment using the newest generation headset, the Oculus Quest wireless VR headset to explore audio-visual fusion.

Matthew presenting his poster titled “Perception Deception: Exploring Audio Localization in Virtual Reality Using The McGurk Effect” at the UCD Science poster session.

The audio and video immersion provided by virtual reality headsets is what makes virtual reality so enticing. Better understanding of this audio and video immersion is needed particularly in streaming applications where bandwidth is limited. The full 360 degree spaces that are the trademark of virtual reality require copious amounts of data to produce at a high quality. His work examined audio localization as a possible candidate for data compression by utilizing an audio-video phenomenon known as the McGurk Effect. The McGurk Effect occurs when mismatched audio and video stimuli are experienced by someone and the resulting perception of the sound is different than either of the stimuli. The common example for this phenomenon is a video of a talker saying /ga/ dubbed with the audio of someone saying /ba/. This usually results in the perception of /da/.

He used Ambisonics for audio, a type of audio that can be used over headphones to mimic how humans naturally hear sounds using head related transfer functions and acoustic environments.

 

Today is World Archive Day

Today is the UNESCO World Archives Day, highlighting the important work of archives and archivists in preserving our cultural heritage. The date was chosen to commemorate the creation of the International Council on Archives (ICA) founded on 9th of June 1948 under the auspices of the UNESCO. According to the ICA, “[a]rchives represent an unparalleled wealth. They are the documentary product of human activity and as such constitute irreplaceable testimonies of past events. They ensure the democratic functioning of societies, the identity of individuals and communities and the defense of human rights.”

Following quickly after the 6th June D-Day commemorations, today is a good day to highlight the important work that has been taking place to digitise and preserve the audio archives of the Nuremberg trials. Witnesses, lawyers and judges were recorded in their native tongues together with recordings of the live translations.  This resulted in 775 hours of original trial audio recorded on 1,942 Presto gramophone discs and translations on Embossed tape, a clear-colored film also known as Amertape. While the tape degraded, the discs survived. The digitisation will be published next year but the fascinating story of  was recently published by the Verge and PRI articles by Christopher Harland-Dunaway. University of Fribourg’s Ottar Johnsen worked with Stefano Cavaglieri, a colleague at the Swiss National Sound Archives and the International Court of Justices archivists using imaging and audio digital signal processing to capture the archive material. You can listen to it here:

Last week, at the 11th International Conference on Quality of Multimedia Experience (QoMEX), QxLab PhD student Alessandro Ragano presented our work on how audio archive stakeholders perceive quality in archive material. By examining the lifecycle from digitisation through restoration and consumption, the influence factors and stakeholders are highlighted. At QxLab we are interested in how audio digital signal processing techniques can be used in conjunction with data driven machine learning to capture, enhance and explore audio archives.

Alessandro’s research is supported in part by a research grant from Science Foundation Ireland (SFI) and is co-founded under the European Regional Development Fund under Grant No. 17/RC-PhD/3483. This publication has emanated from research supported by Insight which is supported by SFI under grant number 12/RC/2289. EB is supported by RAEng Research Fellowship RF/128 and a Turing Fellowship.

 

 

 

PhD opportunities in Machine Learning and Digital Content QoE

QxLab is participating in two Science Foundation Ireland centre’s for research training. These centres will recruit cohorts of students in innovative, industry partnered, research training programmes.

If you are interested in machine learning for multimedia quality of experience or health applications for quality of life, apply via to the ML-Labs Centre. If you are interested in speech and audio applications for Augmented or Virtual Reality take a look at the D-Real Centre.

The ML-Labs and D-Real centre’s for research training are recruiting now with the first cohorts starting in September 2019.

This is the biggest single funding scheme for a cohort focused PhD training programme in Ireland with an investment by SFI of €100 million and QxLab is part of two of the 5 training centres.

Media Coverage: Silicon Republic | Irish Tech News | Business World

 

AES Ireland Section AGM at UCD

The first AES Ireland Section meeting will take place in room B1.09 in the Computer Science building at UCD on Friday, February 15th. This meeting will begin with a lecture at 17:00 by Dr. Andrew Hines (details below) followed by the first AGM at 18:00.

If anyone would like to put themselves forward for election to any of the committee roles, please contact Enda Bates (e.bates@tcd.ie). Please note: you must be an AES member to qualify for these roles but non-members are welcome to attend.

Speaker: Dr Andrew Hines, Assistant Professor, School of Computer Science, University College Dublin (qxlab.ucd.ie)

Title: Quality Assessment for Compressed Ambisonic Audio

Description:

Spatial audio with a high degree of sound quality and accurate localization is essential for creating a sense of immersion in a virtual environment. VR content creators can use spatial audio to attract the audiences attention in relation to their story or to guide the audience through a narrative in VR, relying on hearing something to focus our attention before we see it. Delivering spatial audio over networks requires efficient encoding techniques that could compress the raw audio content without compromising quality. Lossy compression schemes such as Opus typically reduce the amount of data to be sent by discarding some information. This discarded information can be important for ambisonic spatial audio in regards to listening quality or localization accuracy. Streaming service providers such as YouTube typically transcode uploaded content into various bit rates and need a perceptually relevant audio quality metric to monitor users’ perceived quality and spatial localization accuracy. This talk will present subjective listening test experiments that explore the effect of Opus codec compression on the quality as perceived by listeners. It will also introduce a full reference objective spatial audio quality metric, AMBIQUAL, which derives both listening quality and localization accuracy metrics directly from the spatial audio B-format ambisonic audio.

What do you tell a room full of PhD students?

When I was asked to give the talk I went through many of the stages of the “PhD roller-coaster” compressed into several hours. I accepted the request without reflection (other than a “sure, that needs no preparations…”) and then panicked that my PhD experiences were stale and possibly no-longer relevant. Then I reflected that I wasn’t being asked to advice through the lens of a student, the actual question was what advice could I offer as someone who as experienced both sides of the student-advisor relationship. Getting the research question right was an important first step. Next I read a few other blogs, papers and tweets. There is already a large body of work in the area of PhD advice so I decided to skip the exhaustive literature review and to provide a case study style approach focusing purely on my own experience.

Having “mastered the topic” (or at least as much as I was going to!), I scribbled a few notes on areas I thought I might want to cover: literature review, self-management, research network building, developing your identity as a researcher. I then wondered about how to present it. I considered what might make it engaging – neat slides, video examples – and decided that I deliver the advice without aids as an example of how if the content of your talk is of interest to the audience, they will remain engaged even if they have nothing more interesting to look at than the speaker themselves. In order to tie it together (and to help me remember what I planned to say) I decided to present it in the format of twelve tips. If you are interested in reading them, they recently got posted on the school blog.

 

 

No more 1 minute poster intros!

Alessandro and Andrew travelled to London for the 13th Digital Music Research Network One-day Workshop hosted at Queen Mary University of London. The event followed a format similar to recent ISMIR conferences where presenters at poster sessions each gave a 4 minute overview of their research prior to the main poster session so you could get a feel for the work and then go and talk to the presenters to find out more about their results. This format is a nice mix between regular 10+5 minute oral session presentation and questions or the rapid fire 1 minute ‘poster madness sessions’. Hopefully it will catch on at more conferences!

At the poster session Alessandro presented his work on “What happens to the musical works of the past?” As an SFI funded PhD student in the Insight Research centre at UCD, Alessandro is co-supervised by Emmanouil Benetos from Queen Mary University of London. He will be spending one year of his PhD based in London at the Alan Turing Institute.