Rozpoznawanie Mowy: 2013

30 gru 2013

Postdocs and PhD studentships are Tampere University of Technology, Tampere, Finland

Open positions at the Multimedia Research Group (the MUVIS Team, http://muvis.cs.tut.fi) at the Department of Signal Processing, Tampere University of Technology (TUT), Finland

2 POSTDOCS
1 PHD STUDENTSHIP

Tampere University of Technology (TUT) is an active scientific community of 2,000 employees and more than 10,000 students. The University operates in the form of a foundation and has a long-standing tradition of collaboration with research institutions and business life. Many of the fields of research and study represented play a key role in addressing global challenges. International collaboration is an inherent part of all the University's activities.

The Department of Signal Processing belongs to the Faculty of Computing and Electrical Engineering. Signal processing has been chosen as one of the top strategic fields of research in TUT and was the host of two Academy of Finland Centers of Excellence. Nearly 200 faculty, staff and researchers work at the Department and nearly half are international.

Job description:
The positions are for scientific research in externally funded projects related to “big” learning in Big Data. The Postdocs are expected to perform independent research, collaborate with team members and supervise PhD and MSc students in addition to project management duties.

Doctoral Students will work toward a dissertation as a member of the research team and they will be supervised by the senior members of the group.

Requirements:
We are looking for creative and highly motivated researchers. Suitable disciplines for all open positions include Signal Processing, Artificial Intelligence, Machine Learning, Computer Vision and other related areas. Fluent written and spoken English and solid programming (Matlab/C/C++) skills are required. Excellent skills in computer vision, machine learning (deep learning and graph theory) and content-based multimedia retrieval is essential. Java and web programing skill is valuable.

Applicants to the postdoc position should have completed (or close to) the PhD degree. Candidates applying for the doctoral student position must hold a MSc degree in a related engineering field and are expected to enroll as a PhD student at TUT.

Salary:
The salary will be set in accordance with the University Salary System. Starting Doctoral Students receive a monthly salary of 2200 euros and Postdoctoral Researcher 3200 euros.

For more information, contact
Academy Professor Moncef Gabbouj, moncef.gabbouj AT tut.fi (http://www.cs.tut.fi/~moncef/open-positions.htm )

How to apply:
Applications can be submitted in PDF format by email to moncef.gabbouj AT tut.fi. The positions will remain open until filled. The target starting date is 1 February 2014 (or earlier).

The (preferably single document) application should include the following items:
- Letter of motivation
- CV (including names and contact details of at least two references, one of which is preferably the MSc or PhD thesis supervisor)
- Copy of MSc/PhD degree certificate
- List of publications
- Research abstract

24 gru 2013

Przewidywania 25 lat temu

Dziennik telewizyjny sprzed 25 lat zdumiewająco trafnie przewidział rozwój komputerów, także na temat rozpoznawania mowy:

18 gru 2013

A tym czasem w Sheffield ...

... takie rzeczy się dzieją : http://www.bbc.co.uk/news/technology-25120697

10 gru 2013

Kolejny ciekawy artykuł ...

... o systemach dialogowych, wirtualnych doradcach, Siri itd. - http://www.economist.com/news/technology-quarterly/21590760-predictive-intelligence-new-breed-personal-assistant-software-tries

Polska wersja Interaction Analayser

"Firma Interactive Intelligence wprowadziła do oferty polską wersję językową aplikacji do analizy mowy w czasie rzeczywistym. Oprogramowanie Interaction Analyser (przeznaczone przede wszystkim dla centrów obsługi klientów, ale również dla średnich i dużych przedsiębiorstw) pozwala centrom obsługi klienta monitorować pracę konsultantów i oceniać nagrania." - Computerworld

9 gru 2013

Film z 9 Studenckiego Festiwalu Informatyki

Ostatnio odkryłem, że byłem nagrywany ;)

6 gru 2013

O głosie Koala

Kolejny ciekawy artykuł na BBC o głosie, tym razem zwierząt - link. Podobno Koala mają dodatkową parę strun głosowych.

5 gru 2013

BBC o niewidzialnych komputerach

Ciekawy artykuł o ubiquitous computing, Google glass itp. na BBC.

3 gru 2013

AKUSTYKA W JĘZYKOZNAWSTWIE - JĘZYKOZNAWSTWO W AKUSTYCE

"Szanowni Państwo,

serdecznie zapraszamy do udziału w organizowanej przez Zakład Historii Języka Polskiego i Dialektologii Wydziału Polonistyki Uniwersytetu Warszawskiego konferencji naukowej:

Konferencja odbędzie się w dniach 26-27 września 2014 r. na Wydziale Polonistyki UW (ul. Krakowskie Przedmieście 26/28). Szczegółowe informacje znajdują się w załącznikach oraz na stronie www.zhjpid.uw.edu.pl (zakładka: Konferencja Akustyka w językoznawstwie...").

Z wyrazami szacunku

dr Justyna Garczyńska

dr Monika Kresa"

29 lis 2013

Mgr inż. Sandra Imiela

W dniu dzisiejszym odbyła się obrona pracy "KOMPUTEROWA GRA FABULARNA OPARTA NA SYSTEMACH DIALOGOWYCH", której byłem promotorem i na którą niechlubnie się spóźniłem. Autorce - mgr inż. Sandrze Imieli serdecznie gratuluję i życzę powodzenia.

28 lis 2013

Przedłużenie terminu przyjmowania zgłoszeń na PVC

Na Pacific Voice Conference napłynęło już około 60 zgłoszeń. Pomimo tego zdecydowaliśmy się przedłużyć termin przyjmowania streszczeń do 20 grudnia.

www.dsp.agh.edu.pl/pvc

23 lis 2013

Dlaczego warto uczyć się przedmiotów prowadzonych przez DSP AGH

Za zgodę autora upubliczniamy list absolwenta pierwszego rocznika Inżynierii Akustycznej do prof. Ziółki.

www.dsp.agh.edu.pl

19 lis 2013

Sarmata 2.0

Nowa wersja Sarmaty zrealizowana przez Techmo sp. z o.o. osiągnęła średnią skuteczność 97,7% w testach na przeszło 5 000 nagranych wypowiedziach. W 99,6% przypadków prawidłowa hipoteza była w pierwszej trójce listy najsilniejszych hipotez. W grudniu system będzie testowany w ADESCOM Polska sp. z o.o.

www.dsp.agh.edu.pl

17 lis 2013

Balbus Speech

Jack McDermott opracował dwie aplikacje, pomocne przy leczeniu zaburzeń mowy. Speech 4 Good oraz Fluently oparto na sprawdzonych metodach terapeutycznych. Do tej pory pobrano je już ponad 10 tys. razy. Aplikacje kosztują 15 USD i 10 USD, podczas gdy według inc.com cena detaliczna innych narzędzi tego typu idzie w tysiące dolarów.

(cytat za firma.pb.pl)

15 lis 2013

Kanał DSP AGH na youtubie

Zapraszamy do subskrybowania naszego kanału youtube.

P.S. Co zrobić żeby mieć ładny link na youtubie typu youtube.com/dsp_agh?

11 lis 2013

Przykłady ofert pracy w dziedzinie technologii mowy

Job Title: Speech Scientist

Organization: Synchronoss VoiceCare R&D

Office: Bridgewater, NJ preferred

Primary Mission: Overall responsibility for maintaining & improving the speech recognition performance & accuracy of one or more existing VoiceCare solutions, and to create new grammars for new solutions or new functionalities in existing solutions.

Key Responsibilities:

Work closely with Product/Project Managers, Architects, Business Analysts, VUI Designers, Software/Systems Engineers, Customers, & Partners to understand and define the product roadmap & requirements for VoiceCare solutions from a speech recognition perspective

End-to-end grammar development and performance of existing & new VoiceCare solutions

Streamlining our existing grammars to maximize and promote reusability and common code

Innovating and improving our continuous application grammar tuning process – especially around Natural Language Grammars & Statistical Language Models

Maintaining, Supporting, Enhancing, & Improving our grammars

Learning to first use all existing Speech Science tools, and then helping to maintain and improve our tools

As a key member of a cross-functional team, monitor, analyze, & tune our VoiceCare products on a regular basis to improve performance against 3 key dimensions:

Application Effectiveness (how well is the application performing against the business objectives, for example, maximizing call-completion-rate)

Caller Experience (how good is the overall caller’s experience in using the application)

Speech Recognition Accuracy (how well does the application understand what the callers are saying)

Identifying, Defining, Improving, and Applying best-practices in grammar design, development, & tuning

Delivering high-quality application/grammar releases to QA and into production with zero or minimal defects

Providing constructive feedback and suggestions for improvements on our products, platforms, tools, & business processes to the appropriate departments

Working professionally, productively, and effectively with all cross-functional team-members both within the company and outside the company (e.g. customers, partners, & vendors)

Staying current on the latest industry trends, technologies, practices, & innovation in speech recognition technology – introducing relevant ideas into the company as appropriate

Representing the company as a Speech Science Subject Matter Expert in customer facing meetings, conferences, trade-shows, & seminars as appropriate

Promoting the company’s brand & presence in the Speech Science community by publishing technical papers and articles as appropriate

Job Requirements:

Minimum: A Bachelors of Science Degree in Computer Science, Machine Learning, Linguistics, or a related field

Preferred: Masters or PhD

4 to 8 years of direct experience in speech science research and development

Experience in developing large-scale complex grammars with very large vocabulary and/or statistical language models

Experience working with industry-leading Speech Recognition Engines (e.g. Nuance, AT&T, Microsoft, etc.)

Experience working with industry-leading VoiceXML Browsers (e.g. Genesys, Avaya, Nortel, etc.)

Excellent communications skills – written and oral

Willingness to travel as required

Desirable:

experience in the Communications Services Industry

published author in speech science (books, papers, journals, etc.)

Desired Skills and Experience