06/19/04

Home
Biographical information
Research interests
Resume
Publications
Teaching
Demos

 

 

Audio-visual documents contain much more semantic information than textual data, on the other hand, computers can now manipulate and store a large amount of Audio-visual data. Therefore, the amount of audio-visual data is increasing in an exponential way. As for textual information, the more information is available, the more important is the content description. However, searching in textual data using textual queries is a very natural way since the data and the query are from the same type. Unfortunately, describing the semantics of audio-visual documents to enable good search capabilities is not easy for machines. Such semantic description makes searching audio-visual documents easy and effective . For example, typical audio-visual queries can be:

1.      Find me the video file when I said:” I’m leaving this town”

2.      Find me the video file when the president of France promises the people a low unemployment rate.

3.      Find me the video file where “Einstein” presented his relativity theory in the USA.

4.      Find me a video containing a mountain close to the sea, where some children are playing.

5.      Find me a music excerpt which is happy, fast, sad, loud….

 

These queries are very probable in the near future for a society that relies more and more on Knowledge.

 

This is the context of my thesis, studying the ability of machines to listen to the audio-visual data and to describe the content.           

I am working essentially on the audio data, so I am studying machines that “listen” and describe.

 

 

Research Interests:

 

1.      Multimedia description and content-based indexing.

2.      Artificial perception systems, artificial intelligence (life).

3.      Neural Networks.

4.      Audio signal analysis.

 

Home | Biographical information | Research interests | Resume | Publications | Teaching | Demos

This site was last updated 06/19/04