Database of references - search results

Database of references - Multimedia Systems Department, TUG

784 record found

Entry No. 1

Entry type conference paper

Authors A. Czyżewski, P. Odya, J. Smulko, G. Lentka, B. Kostek, M. Kotarski

English title Scent Emitting Multimodal Computer Interface for Learning Enhancement

Polish title Wspomaganie procesu uczenia z wykorzystaniem Multimodalnego Interfejsu Aromatowego

Conference MIMIC 2010 - Fourth International Workshop on Management and Interaction with Multimodal Information Content

Preprint

Number

Volume

Pages 142 - 146

Conference site Bilbao, Hiszpania

Conference date 30.8.10- 3.9.10

Notes DEXA 2010 - 21. Int. Workshops on Databse and Expert System Applications

Abstract A scent emitting multimodal computer interface is an important supplement of the polysensoric stimulation process. Such stimulation plays an essential role in education and therapy in children with developmental disorders (e.g. in the case of autism or ADHD). The interface engineered may be a part of the equipment of the so-called world experience rooms but can also be used separately, therefore enhancing substantially educational software. Emitting scents facilitate learning new material during the science lessons and/or make them more attractive; e.g. biology. The device developed uses USB and Bluetooth interfaces and supports a standard PC. Details about the interface are described in the paper, along with possible fields of application.

Streszczenie Komputerowy interfejs aromatyczny stanowi ważne uzupełnienie procesu stymulacji polisensorycznej. Stymulacja ta odgrywa kluczową rolę w terapii i kształceniu dzieci z zaburzeniami rozwoju (np. w przypadku autyzmu czy ADHD). Opracowany interfejs może stać się elementem wyposażenia tzw. sal doświadczania świata, ale może być także stosowany niezależnie stanowiąc znaczące wzbogacenie komputerowych programów edukacyjnych. Dzięki możliwości emitowania zapachów można urozmaicać i uatrakcyjniać lekcje np. biologii czy materiałoznawstwa. Opracowane urządzenie korzysta z interfejsu USB lub Bluetooth i współpracuje z typowym komputerem klasy PC. Możliwe jest także uzyskanie informacji zwrotnych, które pozwalają sterować nasyceniem zapachu w pomieszczeniu.

Entry No. 2

Entry type conference paper

Authors B. Kostek, A. Czyżewski, S. K. Zieliński

English title Artificial Intelligence Approach to the Detection of Events in Musical Signal

Polish title Wykorzystanie sztucznej inteligencji do wykrywania zdarzeń w sygnale muzycznym

Conference 96th AES Convention

Preprint 3822 (P7.8)

Number

Volume

Pages

Conference site Amsterdam, Holland

Conference date 26.2.1994- 1.3.1994

Abstract Experiments with events detection in musical signal with the use of the learning algorithm were described. Particular attention was paid to the recognition of transient states in pipe organ sound. The concept of the computer pipe organ control system based on artificial intelligence approach raised from these experiments. Applications of learning algorithm to musical pattern recognition and analyzing of transient states in musical sound were shown.

Streszczenie Zostały opisane eksperymenty z wykrywaniem zdarzeń w sygnale muzycznym z użyciem algorytmu uczącego się. Szczególna uwaga została poświęcona rozpoznawaniu stanów transjentowych w dźwięku piszczałki organowej. Z eksperymentów tych powstała koncepcja komputerowego systemu sterującego piszczałką organową bazującego na sztucznej inteligencji. Zostały przedstawione zastosowania algorytmu uczącego się do rozpoznawania fraz muzycznych i analizy stanów transjentowych w dźwiękach muzycznych.

Entry No. 3

Entry type conference paper

Authors A. Czyżewski, B. Kostek, S. K. Zieliński

English title New approach to the synthesis of organ pipe sound

Polish title Nowa metoda syntezy dźwięku organowego

Conference 98th AES Convention

Preprint 3957 (E2)

Number

Volume

Pages

Conference site Paris, France

Conference date 25.2.1995- 28.2.1995

Abstract Apperance of cheap digital signal processors on the electronics market opens a new domain of applications related to the real-time synthesis of sound of wind instruments basing directly on their physical models. Problems related to the implementation of physical model based synthesis of organ sound are discussed. Results of some experiments with this kind of synthesis are presented. Special features of the practically implemented method are quoted.

Streszczenie Pojawienie się tanich cyfrowych procesorów sygnałowych na rynku elektronicznym otwiera nową dziedzinę zastosowań związanych z syntezą w czasie rzeczywistym dźwięków instrumentów dętych bazujących bezpośrednio na ich modelach fizycznych. Przedyskutowano problemy związane z implementacją modeli fizycznych, na których bazuje synteza dźwięku organowego. Przedstawiono rezultaty niektórych eksperymentów z tego rodzaju syntezą. Wspomniano o specjalnych właściwościach praktycznie zaimplementowanych metod.

Entry No. 4

Entry type conference paper

Authors A. Czyżewski, R. Królikowski

English title Noise Reduction in Audio Employing Auditory Masking Approach

Polish title Redukcja szumu w sygnałach fonicznych z wykorzystaniem zjawiska maskowania

Conference 106th Audio Engineering Society Convention

Preprint 4930

Number

Volume

Pages

Conference site Monachium, Germany

Conference date 8.5.1999- 11.5.1999

Abstract A new method of noise reduction which exploits some features of the human auditory system is proposed by the authors. The noise suppression is obtained twofold: by uplifting masking thresholds and by keeping noisy components just beneath these thresholds. The foundations of the engineered method are discussed extensively in the paper, and some engineered perceptual noise reduction algorithms are described. The way of introduction of the noise reduction features into an MPEG encoder is demonstrated.

Streszczenie W referacie zaproponowano metodę redukcji szumu polegającą na wykorzystaniu niektórych cech systemu słuchowego człowieka. Tłumienie szumu może być uzyskane na dwa sposoby: poprzez podniesienie progów maskowania lub też przez składowych szumowych tuż poniżej tych progów. Szczegółowo przedstawiono podstawy opracowanej metody, jak również opisano algorytmy perceptualnej redukcji szumu. W referacie zamieszczono także sposób włączenia tej metody do schematu funkcjonalnego kodera w standardzie MPEG.

Entry No. 5

Entry type conference paper

Authors S.K. Zieliński, A. Czyżewski

English title A Method for Echo Cancellation in Audio Signals Using the Genetic Algorithm

Polish title Metoda tłumienia echa przy użyciu algorytmów genetycznych

Conference Joint Meeting, 137th regular meeting of the Acoustical Society of America and the 2nd convention of the EAA: Forum Acusticum - integrating the 25th German Acoustics DAGA Conference

Preprint

Number

Volume

Pages

Conference site Berlin, Germany

Conference date 14.3.1999- 19.3.1999

Abstract In this paper, a new method of echo cancellation is proposed. This method is based on the use of models of systems causing the echo. Parameters of such models are optimized using the genetic algorithm. The computational cost of the proposed method can be minimized by the application of the correlation function.

Streszczenie W komunikacie konferencyjnym przedstawiono nową metodę tłumienia echa. Metoda ta wykorzystuje modele systemów wytwarzających echo. Parametry modeli optymalizowane są przy użyciu algorytmów genetycznych. Złożoność obliczeniowa zaproponowanego algorytmu może być zmniejszona poprzez zastosowanie przetwarzania wstępnego sygnału wykorzystującego funkcje korelacji.

Entry No. 6

Entry type conference paper

Authors A. Czyżewski, J. Lasecki, B. Kostek

English title Neural network-based beamformer

Polish title Beamformer oparty o sieć neuronową

Conference 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS99)

Preprint pp. 195-198

Number

Volume 3

Pages

Conference site Orlando, USA

Conference date 31.7.1999- 5.8.1999

Abstract A simple and robust spatial filtration algorithm working in the frequency domain is described. This algorithm uses neural network trained to decide whether a spectral component comes from forward or lateral/backward directions. Some details explaining how the neural network is acquiring knowledge about the signal directivity are presented. A discussion of the algorithm efficiency and limitations is also included.

Entry No. 7

Entry type conference paper

Authors H. Skarżyński, A. Czyżewski, B. Kostek

English title Electrostimulation Tests as a Tool in Cochlear Implant Preoperative Diagnostics

Polish title Testy elektrostymulacji jako badania kwalifikujące do implantowania osób niesłyszących

Conference 137 Acoust. Soc. of Amer. Meeting

Preprint

Number 5aPPb

Volume

Pages

Conference site Berlin,

Conference date 13.3.1999- 18.3.1999

Abstract The procedures developed at the Institute of Physiology and Pathology of Hearing in Warsaw allow to determine some vital characteristics of the hearing sense that help to make decisions regarding the cochlear implantation. Apart from standard pre-examination procedures a test based on the electrical stimulation via the external auditory canal filled with saline can be performed. In order to evaluate the test results both the dynamics range defined by auditory threshold and uncomfortable loudness level and the Time Difference Limen Test are considered. Moreover in some deaf patients a speech communication was achieved with the use of the ball shaped electrode and the spectral compression of speech signal. In this way, the interpretation of the electrical stimulation test results for the new diagnosed cases was made more reliable.

Streszczenie W referacie przedstawiono bezwzględne i względne kryteria kwalifikacji do implantacji wszczepów ślimakowych. Przedstawiono również nową metodę badania kwalifikującego do operacji.

Entry No. 8

Entry type conference paper

Authors A. Czyżewski, J. Lasecki, B. Kostek

English title Computational Approach to Spatial Filtering

Polish title Realizacja algorytmu filtracji przestrzennej opartego o sztuczne sieci neuronowe

Conference 7th European Congress on Intelligent Techniques and Soft Computing

Preprint pp.242

Number

Volume CD-ROM Proceedings

Pages

Conference site Aachen, Germany

Conference date 13.9.1999- 16.9.1999

Abstract Hearing impaired persons have difficulty in understanding speech in cocktail-party conditions. Spatial filtering may be very helpful for such people. This feature should be applied to the hearing aid, thus the computational complexity of spatial filtering-based algorithm must allow real time implementation. In order to meet this assumption some investigations were made and neural network-based algorithm was proposed. This algorithm is presented in this paper.

Streszczenie yyyy

Entry No. 9

Entry type conference paper

Authors B. Kostek, A. Czyżewski, J. Lasecki

English title Spatial Filtration of Sound for Multimedia Systems

Polish title Filtracja przestrzenna dźwięku w systemach multimedialnych

Conference IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing

Preprint 209-213

Number

Volume CD-ROM Proceedings

Pages

Conference site Copenhagen, Denmark

Conference date 13.9.1999- 15.9.1999

Abstract This paper deals with the problem of receiving of a desired signal in noisy or cocktail-party" conditions. This problem is vital in many domains, such as communications, multimedia (multimodal interaction), speech recognition, and psychoacoustics (hearing prostheses). It can be partially solved by classical filtering techniques, however these techniques often introduce distortions into the filtered signal. On the other hand, as it results form experiments performed by the authors, a spatial filtration can be performed based on the Artificial Neural Network (ANN). Such an algorithm was elaborated, and some details concerning its implementation are described. Moreover, results of experiments are presented. These results demonstrate that ANN-based nonlinear filter increases the signal-to-noise ratio and improves speech intelligibility.

Streszczenie W pracy przedstawiono nowe rozwiązanie filtracji przestrzennej sygnału w obecności dookólnego szumu. W odróżnieniu od klasycznych podejść, w których stosuje się typowe filtry cyfrowe, w podejściu opisywanym w niniejszej pracy jako element filtrujący zastosowano sztuczną sieć neuronową. W pracy zaprezentowano wyniki pomiarów opracowanego filtru przestrzennego.

Entry No. 10

Entry type conference paper

Authors A. Czyżewski, R. Królikowski, S.K. Zieliński, B. Kostek

English title Intelligent Echo and Noise Reduction

Polish title Inteligenta metoda tłumienia echa i szumu

Conference 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS99)

Preprint pp.234-238

Number

Volume 4

Pages

Conference site Orlando, USA

Conference date 31.7.1999- 4.8.1999

Abstract New concepts of echo cancellation and reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels are proposed. In the both cases, some methods originated form artificial intelligence domain, i.e.: genetic algorithms, neural networks, rough sets are applied. In turn, in the noise reduction method, some features of the human auditory system are presented in the paper. Furthermore, a number of experiments have been carried out, and a brief discussion on some of them is included in the paper.

Streszczenie W referacie zaproponowano nową koncepcję redukcji echa oraz szumu niestacjonarnego, zakłócających sygnały foniczne transmitowane w kanałach telekomunikacyjnych. W obu przypadkach wykorzystano niektóre metody z dziedziny sztucznej inteligencji, tj.: algorytmy genetyczne, sieci neuronowe i zbiory przybliżone. Z kolei, w metodzie redukcji szumu wykorzystywane są pewne cechy systemu słuchowego człowieka. W referacie przedstawiono podstawy opracowanej metody wraz z opisem zastosowanych algorytmów inteligentnych. Ponadto, przeprowadzono szereg eksperymentów i zamieszczono krótką dyskusja nad uzyskanymi wynikami.

Entry No. 11

Entry type conference paper

Authors G. Szwoch, B. Kostek, A. Czyżewski

English title Designing Waveguide Elements of a Hearing Aid Using the Physical Modeling Techniques

Polish title Projektowanie falowodowych elementów aparatów słuchowych przy użyciu metody modelowania fizycznego

Conference 106th AES Convention

Preprint 4870

Number

Volume

Pages

Conference site Munich, Germany

Conference date 8.5.1999- 11.5.1999

Abstract The aim of this paper is to model a desired transfer function of a hearing aid. For this purpose physical modeling techniques were used in order to change parameters of a model in real time. The main features of a system allow one to design waveguide elements of a hearing aid. Such a system may be helpful in the process of fitting some hearing aid elements.

Streszczenie W artykule opisano metodę projektowania akustycznych elementów aparatów słuchowych, posiadających pożądane charakterystyki częstotliwościowe. Metoda modelowania fizycznego została wykorzystana w celu umożliwienia regulacji parametów modelu w czasie rzeczywistym. System ten pozwoli na projektowanie falowodowych elementów aparatów słuchowych. Może on być pomocny w procesie dobierania aparatu słuchowego.

Entry No. 12

Entry type conference paper

Authors A. Czyżewski, R. Królikowski, S.K. Zieliński, B. Kostek

English title Echo and Noise Reduction Methods for Multimedia Communication Systems

Polish title Metody redukcji echa i szumu dla multimedialnych systemów komunikacyjnych

Conference IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing

Preprint pp. 239-244

Number

Volume

Pages

Conference site Copenhagen, Denmark

Conference date 13.9.1999- 15.9.1999

Abstract New concepts of echo cancellation and reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels are proposed. In the both cases, some methods originated form artificial intelligence domain, i.e.: genetic algorithms, neural networks, rough sets are applied. Moreover, in the noise reduction method, some features of the human auditory system are exploited. A number of experiments have been carried out, and a brief discussion on some of them is included in the paper.

Streszczenie W referacie zaproponowano nową metodę redukcji echa oraz szumu niestacjonarnego, zakłócających sygnały foniczne transmitowane w kanałach telekomunikacyjnych. W obu przypadkach wykorzystano niektóre metody z dziedziny sztucznej inteligencji, tj.: algorytmy genetyczne, sieci neuronowe i zbiory przybliżone. Dodatkowo, w metodzie redukcji szumu wykorzystywane są pewne cechy systemu słuchowego człowieka. Przeprowadzono szereg eksperymentów i zamieszczono krótką dyskusja nad uzyskanymi wynikami.

Entry No. 13

Entry type conference paper

Authors H. Skarżyński, A. Czyżewski, B. Kostek, G. Szwoch

English title Computer Techniques in Electrostimulation Testing of Hearing and Hearing Aid Modeling

Polish title Wybrane zastosowania techniki komputerowej w testowaniu i modelowaniu charakterystyk słyszenia

Conference 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI'99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS'99)

Preprint pp. 216-221

Number

Volume 8

Pages

Conference site Orlando, USA

Conference date 31.7.1999- 5.8.1999

Abstract In this paper two aplications of computer techniques in audiology are presented. In the first part of the paper, a new method to examine electrostimulation of structure of the auditory tract, developed in the Institute of Physiology and Pathology of Hearing, is described. The study is dedicated to the problem of an evaluation of the auditory nerve electrical sensitivity in deaf people, assisted by computer technology. A new method is proposed, which enables an assessment of both hearing loss in a given moment of time and the future benefits of the cochlear implant to the patient. In the second part of the paper, a new method of fitting acoustical elements of a hearing aid is proposed. A digital waveguide model of these elements is designed. Next,on the basis of this model computer simulations are performed. It is possible to obtain the desired shape of transfer function of the model by changing the values of its parameters. Resulting a computer simulation dimensions of the physical system can be calculated. This method can be used to design acoustical elements of a hearing aid, having desired acoustical properties. Both applications, although aimed at different group of patient

Entry No. 14

Entry type conference paper

Authors R. Królikowski, A. Czyżewski

English title Noise Reduction in Telecommunication ChannelsUsing Rough Sets and Neural Networks

Polish title Redukcja szumu w kanałach telekomunikacyjnych z wykorzystaniem zbiorów przybliżonych i sieci neuronowych

Conference 7th International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing

Preprint pp. 100-108

Number

Volume

Pages

Conference site Ube, Yamaguchi, Japonia,

Conference date 9.11.1999- 11.11.1999

Abstract A new concept of reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels is proposed. This concept exploits some features of the human auditory system as well as some methods originated from soft computing domain, i.e. rough set-based reasoning and neural processing. The foundations of the engineered method and a description of applied decision algorithms are presented. A number of experiments have been prepared, and some of them have already been carried out. A brief discussion of these experiments' results and some conclusions are also included.

Streszczenie W referacie zaproponowano nową koncepcję redukcji szumu niestacjonarnego zakłócającego sygnały foniczne w kanałach telekomunikacyjnych. Koncepcja ta wykorzystuje pewne cechy systemu słyszenia, jak również pewne metody z dziedziny soft-computingu, tj. wnioskowanie w oparciu o zbiory przybliżone oraz przetwarzanie neuronowe. Przedstawiono podstawy opracowanej metody oraz opis zastosowanych algorytmów. Przygotowano i wykonano pewną liczbę eksperymentów. W referacie zamieszczono krótką dyskusję nad otrzymanymi wynikami wraz wnioskami końcowymi.

Entry No. 15

Entry type conference paper

Authors A. Czyżewski, J. Lasecki, B. Kostek

English title Computational Approach to Spatial Filtering

Polish title Realizacja algorytmu filtracji przestrzennej opartego o sztuczne sieci neuronowe

Conference 7th European Congress on Intelligent Techniques and Soft Computing

Preprint pp.242

Number

Volume CD-ROM Proceedings

Pages

Conference site Aachen, Germany

Conference date 13.9.1999- 16.9.1999

Abstract Hearing impaired persons have difficulty in understanding speech in cocktail-party conditions. Spatial filtering may be very helpful for such people. This feature should be applied to the hearing aid, thus the computational complexity of spatial filtering-based algorithm must allow real time implementation. In order to meet this assumption some investigations were made and neural network-based algorithm was proposed. This algorithm is presented in this paper.

Streszczenie yyyy

Entry No. 16

Entry type conference paper

Authors A. Czyżewski, J. Lasecki, B. Kostek

English title Computational Approach to Spatial Filtering

Polish title Realizacja algorytmu filtracji przestrzennej opartego o sztuczne sieci neuronowe

Conference 7th European Congress on Intelligent Techniques and Soft Computing

Preprint pp.242

Number

Volume CD-ROM Proceedings

Pages

Conference site Aachen, Germany

Conference date 13.9.1999- 16.9.1999

Abstract Hearing impaired persons have difficulty in understanding speech in cocktail-party conditions. Spatial filtering may be very helpful for such people. This feature should be applied to the hearing aid, thus the computational complexity of spatial filtering-based algorithm must allow real time implementation. In order to meet this assumption some investigations were made and neural network-based algorithm was proposed. This algorithm is presented in this paper.

Streszczenie yyyy

Entry No. 17

Entry type conference paper

Authors S.K. Zieliński, A. Czyżewski

English title A Novel Approach to Echo Cancellation

Polish title Nowe podejście do tłumienia echa

Conference 106th Audio Engineering Society Convention

Preprint 4901

Number

Volume

Pages

Conference site Munich, Germany

Conference date 8.5.1999- 11.5.1999

Abstract In this paper a new method of echo cancellation is proposed. This method is based on the genetic algorithm.

Streszczenie W pracy zaproponowano nową metodę tłumienia echa. Metoda ta opiera się na wykorzystaniu algorytmów genetycznych.

Entry No. 18

Entry type book

Authors A. Czyżewski, R. Królikowski

English title Application of Fuzzy Logic and Rough Sets to Audio Signal Enhancement

Polish title Zastosowanie logiki rozmytej i zbiorów przybliżonych do poprawy jakości sygnału fonicznego s. 397-409

Editor Springer-Verlag

Pages 379

Abstract A method of noise reduction, related to spectral subtraction and controlled by intelligent algorithms, is described in the paper. A decision system based on fuzzy logic and rough sets is presented. The engineered inference algorithm exploiting rough sets is also included.

Streszczenie W artykule opisano metodę redukcji szumu, zbliżoną do odejmowania widmowego, sterowaną przy pomocy algorytmów inteligentnych. Przedstawiono zastosowanie systemu decyzyjnego pracującego w logice rozmytej oraz wykorzystującego wnioskowanie oparte na zbiorach przybliżonych. Zamieszczono również opracowany algorytm decyzyjny działający przy użyciu zbiorów przybliżonych.

Entry No. 19

Entry type conference paper

Authors A. Czyżewski, R. Królikowski

English title Perceptual Approach to Noise Reduction

Polish title Perceptualna redukcja szumu

Conference 8th International Symposium on Sound Engineering and Mastering

Preprint s. 53-58

Number

Volume

Pages

Conference site Gdańsk, Poland

Conference date 9.9.1999- 11.9.1999

Abstract A perceptual approach to noise reduction problem is presented in the paper. According to this, the noise suppression can be obtained by exploiting some masking properties of the auditory system. In the paper, mathematical foundations of the engineered method with relevant algorithms, and some observations related to the carried out experiments are briefly presented.

Streszczenie W referacie przedstawiono podejście perceptualne do problemu redukcji szumu, zgodnie z którym, redukcja szumu może być uzyskana wskutek wykorzystania właściwości maskowania ucha ludzkiego. W referacie krótko omówiono matematyczne podstawy opracowanej metody wraz z odpowiednimi algorytmami, a także zamieszczono niektóre obserwacje dotyczące wykonanych eksperymentów.

Entry No. 20

Entry type conference paper

Authors R. Królikowski, A. Czyżewski

English title Noise Reduction in Acoustic Signals Using the Perceptual Coding

Polish title Redukcja szumu w sygnałach fonicznych z wykorzystaniem kodowania perceptualnego

Conference 137th Regular Meeting of the Acoustical Society of America

Preprint S49 (1pSCa8)

Number

Volume CD Proceedings

Pages

Conference site Berlin, Germany

Conference date 14.3.1999- 19.3.1999

Abstract A new method of noise reduction exploiting some features of the human auditory system is proposed by the authors. The noise suppression is obtained twofold: by uplifting masking thresholds and by keeping noisy components just beneath these thresholds. The foundations of the engineered method are described, and some results of the carried out experiments are briefly discussed in the paper.

Streszczenie W referacie, zamieszczono propozycję metody redukcji szumu z wykorzystaniem pewnych cech systemu słyszenia człowieka. Wg niej, redukcję szumu obecnego w sygnale można uzyskać w trakcie perceptualnego kodowania w dwojaki sposób: poprzez podniesienie progów maskowania lub też utrzymując składowe szumowe poniżej tych progów. W referacie opisano podstawy opracowanej metody oraz krótko przedyskutowano wyniki przeprowadzonych eksperymentów.

Entry No. 21

Entry type conference paper

Authors A. Czyżewski, R. Królikowski

English title Noise Reduction in Audio Signals Based on the Perceptual Coding Approach

Polish title Redukcja szumu w sygnałach fonicznych w oparciu o perceptualne kodowanie sygnałów

Conference IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Preprint pp. 147-150

Number

Volume

Pages

Conference site New Paltz, NY, USA

Conference date 17.10.1999- 20.10.1999

Abstract A new concept of the reduction of noise affecting audio signals transmitted in telecommunication channels is proposed. This concept is exploiting some features of the human auditory system. A strong subjective effect of noise suppression in noisy audio can be obtained by uplifting masking thresholds above the estimated level of noisy components or by reducing this level in such a way that the components be maintained just below masking thresholds. The foundations of the engineered method together with the appropriate algorithms are described in the paper. A brief discussion on the results of carried out experiments and some conclusions are also included in the paper. The main focus is put on perceptual foundations of the noise reduction method.

Streszczenie W referacie zaproponowano nową koncepcję redukcji szumu, wykorzystującą pewne cechy systemu słyszenia człowieka. Silny subiektywny efekt tłumienia szumu można uzyskać albo przez podniesienie progów maskowania powyżej oszacowanego poziomu składowych szumowych, albo przez redukcję poziomu tych składowych poniżej progów maskowania. W referacie opisano podstawy opracowanej metody wraz z odpowiednimi algorytmami oraz zamieszczono krótką dyskusję na temat uzyskanych rezultatów wraz z wnioskami. Skupiono się w nim głównie na perceptualnych podstawach opracowanej metody redukcji szumu.

Entry No. 22

Entry type conference paper

Authors R. Królikowski, A. Czyżewski

English title Applications of Rough Sets and Neural Nets to Noisy Audio Enhancemen

Polish title Zastosowanie zbiorów przybliżonych oraz sieci neuronowych do poprawy jakości sygnału fonicznego

Conference 7th European Congress on Intelligent Techniques and Soft Computing

Preprint pp. 240

Number

Volume CD-ROM Proceedings

Pages

Conference site Aachen, Germany

Conference date 13.9.1999- 16.9.1999

Abstract A new concept of reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels is proposed. This concept exploits some features of the human auditory system as well as some methods originated from artificial intelligence domain, i.e. reasoning based on rough sets and neural processing. The foundations of the engineered method together with a description of applied intelligent decision algorithms are presented in the paper. A number of experiments have been prepared, and some of them have already been carried out. Hence, a brief discussion on the results of these experiments and some conclusions are also included in the paper.

Streszczenie W referacie zaproponowano nową metodę redukcji szumu niestacjonarnego, zakłócającego sygnały foniczne transmitowane w kanałach telekomunikacyjnych. Wykorzystuje ona zarówno pewne cechy systemu słuchowego człowieka, jak również pewne metody sztucznej inteligencji, tj. wnioskowanie oparte na zbiorach przybliżonych i przetwarzanie neuronowe. Przedstawiono podstawy opracowanej metody wraz z opisem zastosowanych inteligentnych systemów decyzyjnych. Przygotowano szereg eksperymentów i zamieszczono krótką dyskusję nad ich wynikami wraz z wnioskami.

Entry No. 23

Entry type journal paper

Authors A. Kaczmarek, A. Czyżewski, B. Kostek

English title Investigating Polynomial Approximation of Spectra of the Pipe

Polish title Aproksymacja wielomianowa widma dźwięków piszczałek organowych

Journal Archives of Acoustics

Volume 24

Number 1

Pages 3 - 24

Abstract A precise method for the determination of the spectral representation of pipe sounds was introduced. The polynomial approximation of the spectral envelope was found to be an effective tool, allowing the study of differences between sounds produced by organ pipes of various types belonging to some selected instruments. The paired comparison subjective testing procedure was applied in order to assess the similarities between sounds synthesized using polynomial smoothed spectra and the original organ sound patterns. The statistical processing of test results revealed that a direct relation exists between the type of organ pipe and the minimum order of the approximating polynomial that can be used to represent the pipe sound spectrum, as determined by the positive opinions of the experts. The applied pipe organ sound recording and processing methods, subjective testing procedures and experiment results are discussed in the paper.

Entry No. 24

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński, B. Kostek, R. Królikowski

English title Rough Set Analysis of Electrostimulation Test Database for the Prediction of Post-Operative Profits in Cochlear Implanted Patients

Polish title Analiza bazy danych testów elektrostymulacji w oparciu o zbiory przybliżone w celu przewidywania pooperacyjnych korzyści u pacjentów z implantami ślimakowym

Conference 7th Int. Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing

Preprint pp. 109-117

Number

Volume

Pages

Conference site Ube, Yamaguchi, Japonia,

Conference date 9.11.1999- 11.11.1999

Abstract A new method of examining the hearing nerve in deaf people is presented. It consists in testing deaf people with a speech signal delivered via a microelectrode connected to a current source and attached to the promontory. The current delivered to the electrode is modulated with the speech signal, transposed downwards the frequency scale. A database of patients data and electrostimulation test results was created, and analyzed using a rough set method in order to find rules allowing prediction of hearing recovery of cochlear implantation candidates.

Streszczenie Przedstawiono nową metodę badania nerwu słuchowego u osób głuchych, polegającą na badaniu reakcji takich osób przy podaniu sygnału mowy poprzez mikroelektrodę, połączoną ze źródłem prądu i umieszczoną na promontorium. Prąd modulowany jest sygnałem mowy, transponowanym w dół skali częstotliwości. Utworzono bazę danych pacjentów i wyników testów elektrostymulacji, która przetworzono w oparciu o metodę zbiorów przybliżonych w celu znalezienia reguł umożliwiających predykcję polepszania słyszenia u kandydatów do implantacji.

Entry No. 25

Entry type conference paper

Authors A. Czyżewski, R. Królikowski

English title Neural Algorithm-Based Modelling of Some Properties of the Human Auditory System

Polish title Modelowanie pewnych właściwości systemu słuchowego człowieka w oparciu o algorytm neuronowy

Conference IV Konferencja Sieci Neuronowych i Ich Zastosowań

Preprint s. 117-122

Number

Volume

Pages

Conference site Zakopane,

Conference date 18.5.1999- 22.5.1999

Abstract A parallel algorithm derived form the standard neural net was applied to simulation of simultaneous masking of sound. Due to proposed innovations, the neural topology is more strictly related to the chosen model, and the standard model of an neuron is extended by a synaptic function for each biased input. These modifications lead to a dedicated structure for which the traditional training can be replaced by the data storage. The extended neurons model and the topology of the auditory masking-oriented net are discussed. A brief description of the chosen perceptual model is also included.

Streszczenie Na podstawie standardowej struktury neuronowej, opracowano równoległy algorytm i zastosowano do symulacji maskowania jednoczesnego dźwięku. Przez wprowadzenie, topologia proponowanej sieci jest ściślej związana z wybranym modelem psychoakustycznym. Poza tym, standardowy model neuronu został rozszerzony przez wprowadzenie funkcji synaptycznej dla każdego wejścia, które zostało obciążone wartością progową. Te modyfikacje prowadzą do dedykowanej struktury, dla której tradycyjny trening może zostać zastąpiony zapamiętywaniem danych. W referacie omówiono rozszerzony model neuronu, topologię sieci zorientowanej na system słuchowy oraz zamieszczono krótki opis wybranego modelu psychoakustycznego.

Entry No. 26

Entry type conference paper

Authors H. Skarżyński, A. Czyżewski, B. Kostek

English title Prediction of Post-Operative Profits in Cochlear Implanted Patients Using the Electricostimulation Procedure

Polish title Kwalifikacja pacjentów do operacji wszczepu ślimakowego na podstawie wyników elektroatymulacji nerwu słuchowego

Conference IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Preprint pp. 239-242

Number

Volume

Pages

Conference site New Paltz, NY, USA

Conference date 17.10.1999- 20.10.1999

Abstract The presented research is devoted to the problem of evaluation of the auditory nerve electrical sensitivity in deaf people. In the case of profound hearing impairments the assessment of the degree of hearing loss by using standard acoustic tests such as tonal or vocal audiometry, ABR testing, impedance audiometry, etc. would often conclude in a complete lack of response to an acoustic stimulus in the patient. That is why other diagnostic methods that would enable evalualion of the auditory nerve electrical sensitivity have been designed and introduced to the clinical practice. The method of speech signal transmission to the auditory nerve before cochlear implantation was conceived and tested. This method uses spectral transposition of signal delivered to the external electrode allowing to stimulate the auditory nerve.

Entry No. 27

Entry type conference paper

Authors B. Kostek, A. Czyżewski, P. Suchomski

English title Multimedia Fitting System for Hearing Impaired People.

Polish title Multimedialny system doboru protez.

Conference 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI'99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS'99), pp. 169-175,6 fig., 1 tab., 9

Preprint

Number

Volume 8

Pages

Conference site Orlando, USA,,

Conference date 31.7.1999- 5.8.1999

Abstract One of the most important stages in the recovery of hearing impaired people is the choice of n adequate hearing aid. The elaborated Multimedia Hearing Aid Fitting System (MHAFS) is an experimental software that allows to find the characterictics of a hearing aid matching patient's needs and to choose automatically a suitable hearing device. It is planned that this system will be made available in the Internet, so it can be used by anybody who is willing to experience a remote approximate testing of hearing characteristics and receive sounds processed like in some well fitted hearing aids. The key issues related to the engineered system will be presented in the paper.

Entry No. 28

Entry type conference paper

Authors A. Czyżewski, B. Kostek, S.K Zieliński

English title Common Hearing Tester and Computer-Based Audiometer

Polish title Tester słuchu i audiomets komputerowy

Conference Warsztaty Naukowo-Szkoleniowe Audiologii: Nowoczesna diagnostyka zaburzeń słuchu, Wystapienie seminaryjne - ustne

Preprint

Number

Volume

Pages

Conference site Warszawa,

Conference date 8.6.2000- 10.6.2000

Abstract Warsztaty Naukowo-Szkoleniowe Audiologii: Nowoczesna diagnostyka zaburzeń słuchu W celu pomiaru słuchu można wykorzystać odtwarzacz płyt kompaktowych lub komputer osobisty. Do celów pomiarowych wykorzystano metodę adudiometrii tonalnego oraz metodę audiometrii mowy w szumie.

Streszczenie It is possible to use CD player and/or the personal computer for hearing evaluation. The tonal audiometry and the speech in noise audiometry were employed for these purposes.

Entry No. 29

Entry type conference paper

Authors R. Królikowski, A. Czyżewski, B. Kostek

English title Localization of Sound Sources by Means of Recurrent Neural Networks

Polish title Lokalizacja źródeł dźwięku przy pomocy rekurencyjnych sieci neuronowych

Conference The Second International Conference on Rough Sets and Current Trends in Computing (RSCTC

Preprint

Number

Volume

Pages 564 - 573

Conference site Banff, Kanada

Conference date 16.10.2000- 19.10.2000

Abstract The issue of localization of sound sources for videoconferencing is discussed in the paper. A new algorithm for estimating speaker locations, based on recurrent neural networks (RNN), is introduced and described. The scheme of experiments carried out in an acoustically adopted chamber, exploiting the engineered method is detailed.

Streszczenie W referacie poruszono kwestie lokalizacji źródeł dźwięku dla potrzeb wideokonferencji. Przedstawiono i opisano nowy algorytm estymacji miejsca położenia mówcy, działający w oparciu o rekurencyjne sieci neuronowe. Ponadto, zamieszczono schemat eksperymentów przeprowadzonych w pomieszczeniu dopasowanym akustycznie i wykorzystujących opracowaną metodę.

Entry No. 30

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya, S. Zieliński

English title Determining Influence of Visual Cues on the Perception of Surround Sound Using Soft Computing

Polish title Percepcja obrazu i dźwięku w systemie dookólnym

Conference RSCTC'2000

Preprint

Number

Volume

Pages 507 - 516

Conference site Banff, Canada,

Conference date 16.10.2000- 19.10.2000

Abstract Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. Techniques and standards involved in digital video processing are much more developed than concepts underlying creating recording and mixing of the multichannel sound. The main challenge in the sound processing in the multichannel system is to create an appropriate basis for the relating multimodal context of visual and sound domains. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV. However, there is not much study done yet that associates surround sound and digital video presented at the TV screen. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be solved with the application of fuzzy logic to the processing of subjective test results.

Streszczenie Referat pokazuje eksperymenty mające na celu ocenę wpływu obrazu na percypowany dźwięk w systemie dookólnym. Przedsykutowano metodykę badań oraz przedstawiono wyniki przeprowadzonych testów subiektywnych. W referacie zawarto też dyskusję sposobów obróbki uzyskanych wyników. W przeprowadzonych eksperymentach posłużóno się logiką rozmytą.

Entry No. 31

Entry type conference paper

Authors B. Kostek, A. Czyżewski, P. Odya

English title Shift in Localization of Phantom Sound Sources in Surround Sound versus Video Context

Polish title Wpływ obrazu na lokalizację źródeł pozornych w systemie dookólnym

Conference 21st Tonmeistertagung

Preprint

Number

Volume

Pages 1 - 8

Conference site Hanower, Niemcy,

Conference date 24.11.2000- 27.11.2000

Abstract Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. The visual objects displayed on the screen can affect perception of the phantom sound sources in surround panorama. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be solved with the application of fuzzy logic to the processing of subjective test results.

Streszczenie Celem referatu było zbadanie wpływu obrazu na lokalizację źródeł pozornych w systemie dookólnym. W referacie przedstawiono założenia eksperymentów i sposób ich przeprowadzenia oraz kalibrację systemu złożonego z czterech głośników, komputera wyposażonego w kartę DVD oraz programu komputerowego symulującego zmianę położenia źródeł pozornych. Przedyskutowano również otzrymane wyniki i pdano wnioski.

Entry No. 32

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya, S. Zieliński

English title Determining Influence of Visual Cues on the Perception of Surround Sound Using Soft Computing

Polish title Percepcja obrazu i dźwięku w systemie dookólnym

Conference RSCTC'2000

Preprint

Number

Volume

Pages 1 - 10

Conference site Banff, Canada,

Conference date 16.10.2000- 19.10.2000

Abstract Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. Techniques and standards involved in digital video processing are much more developed than concepts underlying creating recording and mixing of the multichannel sound. The main challenge in the sound processing in the multichannel system is to create an appropriate basis for the relating multimodal context of visual and sound domains. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV. However, there is not much study done yet that associates surround sound and digital video presented at the TV screen. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be solved with the application of fuzzy logic to the processing of subjective test results.

Streszczenie Referat pokazuje eksperymenty mające na celu ocenę wpływu obrazu na percypowany dźwięk w systemie dookólnym. Przedsykutowano metodykę badań oraz przedstawiono wyniki przeprowadzonych testów subiektywnych. W referacie zawarto też dyskusję sposobów obróbki uzyskanych wyników. W przeprowadzonych eksperymentach posłużóno się logiką rozmytą.

Entry No. 33

Entry type conference paper

Authors P. Odya, A. Kornacki, S. Zieliński, A. Czyżewski

English title Influence of Video Context on the Localization of Phantom Sound Sources

Polish title Wpływ obrazu na lokalizację pozornych źródeł dźwięku

Conference VII Sympozjum Nowości w Technice Audio

Preprint

Number

Volume

Pages 13 - 20

Conference site Warszawa,

Conference date 6.10.2000- 7.10.2000

Streszczenie We współczesnych produkcjach filmowych i multimedialnych wykorzystywany jest coraz częściej dźwięk w formacie dookólnym. Dość trudnym zagadnieniem z punktu widzenia realizacji dźwięku jest odpowiednie rozmieszczenie źródeł dźwięku w panoramie. Zawartość obrazu może mieć wpływ na lokalizację pozornych źródeł dźwięku. Stąd też celem opisanych eksperymentów było zbadanie zjawiska interakcji pomiędzy zawartością obrazu wizyjnego a lokalizacją źródeł pozornych dźwięku wokół słuchacza.

Entry No. 34

Entry type conference paper

Authors H. Skarżyński, A. Czyżewski, L. śliwa, A. Senderski, S.K. Zieliński

English title Multimedia System for Screening of Hearing via Internet

Polish title Multimedialny internetowy system powszechnych badań uszkodzeń słuchu.

Conference II Konferencja Naukowo-Techniczna "Technologie Internet i Intranet"

Preprint

Number

Volume

Pages

Conference site Kielce,

Conference date 27.10.2000- 27.10.2000

Abstract A System for Screening of Hearing via Internet was presented. Its structure and the way of working was discussed.

Streszczenie Zaprezentowano pierwszy w Polsce system do badań przesiewowych słuchu u dzieci i młodzieży. Przedstawiono założenia leżące u podstawo opracowanego systemu.

Entry No. 35

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title An Approach to the Automatic Classification of Musical Souns

Polish title Automatyczna klasyfikacja dźwięków instrumentów muzycznych

Conference 108th AES Convention

Preprint 5115

Number

Volume

Pages 1 - 33

Conference site Paris, France

Conference date 19.2.2000- 22.2.2000

Abstract A study on the automatic classification of musical instrument sounds is presented. For this purpose a large database of musical instrument sounds was built, which consists of both solo and duet stereo recordings. The classification process of musical instrument sounds is done on the basis of some soft computing techniques, such as neural networks. The results of the classification are given as a percentage of musical instrument sounds properly recognized by the system. A discussion of the system efficiency and of its limitations is presented. Conclusions and remarks concerning further development of this study are included.

Streszczenie W pracy przedstawiono założenia dotyczące automatycznego rozpoznawania dźwięków instrumentów muzycznych. Problem ten jest istotny przy automatycznym przeszukiwaniu baz komputerowych. Sparametryzowane dźwięki tworzą wektory parametrów, które są nastepnie podawane na wejście sieci neuronowej. W fazie treningu przebudowano różne konfiguracje wektorów parametrów i sieci neuronowych. W pracy zawarto wyniki automatycznej klasyfikacji uzyskane w oparciu o sieci neuronowe i podano wnioski. (20 rys., 5 tab., bibl. 19 poz.)

Entry No. 36

Entry type conference paper

Authors B. Kostek, A. Czyżewski, H. Skarżyński, J. Mazur

English title Multimedia Hearing Aids Fitting System

Polish title System dopasowania protez

Conference 4th World Automation Congress, WAC 2000

Preprint IFMIP011

Number

Volume

Pages 47 - 48

Conference site Maui, Hawaii, USA,

Conference date 11.6.2000- 15.6.2000

Abstract The application described in the paper is concerned with automatic finding the dynamic characteristics of the hearing aid matching patients needs. The multimedia computer technology makes it practical to organize hearing aid fitting basing on the computer software. Consequently, the proposed method of testing hearing abilities and finding the adequate hearing aid dynamical processing characteristics can be based entirely on multimedia computer technology.The subject of the application is the method of hearing aids fitting employing compressed speech understanding tests in noise and the way of organizing such procedure of hearing aids fit in.

Entry No. 37

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Suchomski, H. Skarżyński

English title Multimedia Hearing Aids Fitting System

Polish title Multimedialny system doboru protez słuchu

Conference Konferencja Naukowo-Techniczna Systemy i Technologie Telekomunikacji Multimedialnej STM 2000

Preprint

Number

Volume

Pages 237 - 242

Conference site Łódź,

Conference date 14.3.2000- 15.3.2000

Abstract The obtaining an optimum dynamic hearing characteristic for suitable hearing device is the main task of the elaborated Multimedia Hearing Aids Fitting System (MHAFS). To achieve the dynamic characteristic of the impaired hearing the system uses loudness scaling test results. The system creates proper compressor characteristics (used in typical hearing aids), based on the obtained hearing dynamic characteristic. Next the system tests the compressor characteristics exploiting speech in noise audiometry. The main features of the elaborated system will be presented in this paper

Streszczenie Zasadniczym zadaniem opracowywanego Multimedialnego Systemu Doboru Protez Słuchu (MSDPS) jest wyznaczenie optymalnej charakterystyki dynamiki poszukiwanego aparatu słuchowego. Punktem wyjścia do pracy systemu jest przeprowadzenie badania wrażenia narastania głośności, którego wyniki pozwalają określić charakterystykę dynamiki uszkodzonego słuchu. Na podstawie charakterystyki dynamiki słuchu system wyznacza poszukiwaną charakterystykę kompresji stosowaną w aparatach słuchowych. Oprogramowanie pozwala również przetestować wyznaczone charakterystyki na podstawie badania mowy w szumie. W niniejszym komunikacie zostaną zaprezentowane istotne zasady działanie tworzonego systemu doboru protez słuchu. (5 rys, bibl. 8 poz.)

Entry No. 38

Entry type conference paper

Authors A. Czyżewski, B. Kostek

English title Multimedia Technology Based Orientation System for Visually Impaired People

Polish title Multimedialny system orientacji dla osób niedowidzacych i niewidomych

Conference 4th World Automation Congress, WAC 2000., Abstr.,book, full-paper CD-ROM.

Preprint IFMIP012

Number

Volume

Pages 47 - 47

Conference site Maui, Hawaii, USA

Conference date 11.6.2000- 15.6.2000

Abstract The research performed by authors consisted in multimedia system that aimed at enabling people orientating in their surrounding and to avoid any kind of obstacles and theats. The latter aim may be achieved by an intelligently controlled synthesis of the acoustic field based on the digital image analysis. One of the features of such a system should be the ability to identify the location and to describe the dimension of obstacles in the environment. The idea of perceiving the "sound picture" instead of the visual one brings with itself many issues that are closely related to the research subject discussed in the paper.

Entry No. 39

Entry type conference paper

Authors G. Szwoch, B. Kostek, A. Czyżewski

English title Simulating Acoustics of Hearing Aid Employing Non Linear Signal Filtering and Waveguide Modeling

Polish title Symulacja akustyki aparatów słuchowych przy użyciunieliniowej filtracji sygnału i modelowania falowodowego

Conference 108th AES Convention

Preprint 5087

Number

Volume

Pages 1 - 15

Conference site Paris, France

Conference date 19.2.2000- 22.2.2000

Abstract A model of hearing aid is designed and used to perform computer simulations that include signal processing (amplification, filtering and compression) as well as transmitting the sound to the ear by the acoustical waveguide. The method and some results of simulations are presented applicable to the process of fitting the hearing aid to the individual patients needs.

Streszczenie W referacie przedstawiono cyfrowy model protezy słuchu, w oparciu o który przeprowadzono szereg symulacji charakterystyk akustycznych. Zaimplementowane algorytmy wykorzystują nieliniową filtrację sygnału i modelowanie falowodowe. Wykorzystywany model może być pomocny przy projektowaniu elementów akustycznych protezy słuchu o zadanych właściwościach. ( 9 rys., 11 poz. bibl.)

Entry No. 40

Entry type conference paper

Authors A. Czyżewski, A. Kornacki, B. Kostek, P. Odya, S. Zieliński

English title Influence of visual cues on the perception of surround sound

Polish title Korelacja wzrokowo-słuchowa w systemach dookólnych

Conference 139th Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer.

Preprint 3aPP14

Number 5

Volume 107

Pages 2851

Conference site Atlanta, GA, USA,

Conference date 30.5.2000- 3.6.2000

Abstract Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. Techniques and standards involved in digital video processing are much more developed than concepts underlying creating recording and mixing of the multichannel sound. The main challenge in the sound processing in the multichannel system is to create an appropriate basis for connecting multimodal context of visual and sound domains. Therefore one of the purposes of experiments is to study in which way and how the surround sound interfere or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV. However, there is not much study done yet that associate surround sound and digital video presented at the TV screen. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be addressed in the paper.

Streszczenie Obecne produkcje multimedialne często zą wyposażane w dźwięk dookólny. Ze względu na to, że obraz wpływa na subiektywnie postrzeganą panoramę dźwięku dookólnego, zjawisko to należy poddać dokładnym badaniom. (str. ..., 4 rys., 1 tab., 9 poz. bibl.)

Entry No. 41

Entry type conference paper

Authors B. Kostek, A. Czyżewski, H. Skarżyński, J. Mazur

English title Multimedia Hearing Aids Fitting System

Polish title System dopasowania protez

Conference 4th World Automation Congress, WAC 2000., Abstr.,book, full-paper CD-ROM Proc.

Preprint

Number

Volume

Pages 47 - 47

Conference site Maui, Hawaii, USA

Conference date 11.6.2000- 15.6.2000

Abstract The application described in the paper is concerned with automatic finding the dynamic characteristics of the hearing aid matching patients needs. The multimedia computer technology makes it practical to organize hearing aid fitting basing on the computer software. Consequently, the proposed method of testing hearing abilities and finding the adequate hearing aid dynamical processing characteristics can be based entirely on multimedia computer technology.The subject of the application is the method of hearing aids fitting employing compressed speech understanding tests in noise and the way of organizing such procedure of hearing aids fit in.

Entry No. 42

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Suchomski

English title Expert System for Hearing Aids Fitting

Polish title Automatyczny dobór charakterystyk protezy słuchowej w oparciu o system ekspercki

Conference 108th AES Convention

Preprint 5094

Number

Volume

Pages 1 - 14

Conference site Paris, France

Conference date 19.2.2000- 22.2.2000

Abstract The engineered experimental software allows to find the characteristics of a fearing aid matching patients needs and to choose automatically a suitable hearing device characteristics. The key issues related to the engineered application are based on the expert system implementation. This expert system uses both fuzzy logic and rough set processing of analytical data. The principles of the engineered expert system application and some details of the rough set and fuzzy logic implementation will be presented in the paper.

Streszczenie W referacie opisano założenia eksperymentalnego systemu eksperckiego, pozwalającego na automatyczny dobór charakterystyk protezy słuchowej. W systemie zaimplementowano moduł testujący różne charakterystyki elektronicznej protezy słuchu w oparciu o podawany sygnał mowy w szumie. Opracowany system wykorzystuje algorytmy pracujące w oparciu o wnioskowanie rozmyte i analizę danych w oparciu o metodę zbiorów przybliżonych.(11 rys., bibl. 12 poz.)

Entry No. 43

Entry type conference paper

Authors A. Czyżewski, A. Senderski, B. Kostek, S.K Zieliński, P. Klimek

English title Testing Hearing with Noisy Speech Employing Multimedia Computers

Polish title Testowanie słuchu za pomocą sygnału mowy generowanego w systemie multimedialnym

Conference Warsztaty Naukowo-Szkoleniowe Audiologii: Nowoczesna diagnostyka zaburzeń słuchu, Wystapienie seminaryjne - ustne

Preprint

Number

Volume

Pages

Conference site Warszawa,

Conference date 8.6.2000- 10.6.2000

Abstract Warsztaty Naukowo-Szkoleniowe Audiologii.

Streszczenie Warsztaty Naukowo-Szkoleniowe Audiologii.

Entry No. 44

Entry type journal paper

Authors A. Czyżewski, R. Królikowski

English title Neuro-Rough Control of Masking Thresholds for Audio Signal Enhancement

Polish title Sterowanie poziomami progów maskowania dla potrzeb redukcji szumu. Journal of Neurocomputing

Journal

Volume w druku

Number

Pages 1 - 26

Abstract The paper addresses the problem of neuro-rough hybridisation applied to non-stationary noise reduction. The goal of the intelligent controller is to estimate the current statistics of corrupting noise on the basis of the analysis of signals taken from telecommunication channel. Thereafter, the noise estimate enables determining the masking threshold levels which allow making the noise inaudible in the audio. Since the implemented decision algorithm requires quantised data, thus the Kohonens self-organising maps extended by various distance metrics were used as data quantisers. Some results of the experiments in the domain of non-stationary noise reduction in speech are discussed in the paper. (Journal of Neurocomputing)

Streszczenie (str. 26, rys. 7, tab. 7, bibl. 46 poz.)

Entry No. 45

Entry type conference paper

Authors A. Czyżewski, S.K. Zieliński, R. Królikowski

English title Intelligent methods of noise and echo reduction in audio signals

Polish title Inteligentne metody redukcji szumu i echa w sygnale fonicznym

Conference Konferencja Nukowo-Techniczna: Systemy i Technologie Telekomunikacji Multimedialnej (STM2000)

Preprint

Number

Volume

Pages 279 - 284

Conference site Łódź,

Conference date 14.3.2000- 15.3.2000

Abstract Artificial intelligence algorithms (neural nets, fuzzy logic, rough sets, genetic algorithms) can be applied to noise and echo reduction. Results of experiments cencerning applications of the mentioned alogrithms are presented.

Streszczenie Algorytmy sztucznej inteligencji (sieci neuronowe, logika rozmyta, zbiory przybliżone, algorytmy genetyczne) mogą być zastosowane do tłumienia echa i szumu w sygnałach fonicznych. W pracy przedstawiono nową metodę redukcji echa i niestacjonarnego szumu zakłócającego sygnał foniczny. (str. 6, rys. 4, tab. 0, bibl. 6 poz.)

Entry No. 46

Entry type journal paper

Authors A. Czyżewski, R. Królikowski

English title Neuro-Rough Control of Masking Thresholds for Audio Signal Enhancement

Polish title Sterowanie poziomami progów maskowania dla potrzeb redukcji szumu

Journal

Volume 36

Number 1-4

Pages 5 - 27

Abstract The paper addresses the problem of neuro-rough hybridisation applied to non-stationary noise reduction. The goal of the intelligent controller is to estimate the current statistics of corrupting noise on the basis of the analysis of signals taken from telecommunication channel. Thereafter, the noise estimate enables determining the masking threshold levels which allow making the noise inaudible in the audio. Since the implemented decision algorithm requires quantised data, thus the Kohonen’s self-organising maps extended by various distance metrics were used as data quantisers. Some results of the experiments in the domain of non-stationary noise reduction in speech are discussed in the paper.

Streszczenie Artykuł niniejszy odnosi się do problemu hybrydyzacji systemów opartych na zbiorach przybliżonych oraz przetwarzaniu neuronowym dla potrzeb redukcji szumu niestacjonarnego. Zadaniem inteligentnego kontrolera jest oszacowanie aktualnej statystyki zakłócającego szumu na podstawie analizy sygnału w kanale telekomunikacyjnym. Estymata szumu służy następnie do wyznaczenia poziomów maskowania w sygnale, a następnie do redukcji szumu. Dla potrzeb kwantyzacji zastosowano samoorganizujące się mapy Kohonena, które zostały zmodyfikowane poprzez dodanie różnych metryk. W artykule zamieszczono szczegółowy opis opracowanego systemu, a także przedstawiono przykładowe wyniki przeprowadzonych eksperymentów.

Entry No. 47

Entry type conference paper

Authors A. Czyżewski, B. Kostek, S. Zieliński

English title Waveguide Modeling of Ancient, Japanese Musical Instruments

Polish title Modelowanie fizyczne dawnych instrumentów japońskich

Conference ISMA'2001

Preprint

Number

Volume

Pages

Conference site Perugia, Italy

Conference date 9.2001- 9.2001

Abstract Problems related to the implementation of physical modeling-based synthesis of two traditional Japanese instruments are discussed. Examples of computer analyses of sounds of shakuhachi and koto are presented. On the basis of these analyses some assumptions concerning waveguide models were made. Physical modeling principles of musical instrument sounds generation were also shortly reviewed. Main differences in modeling wind and string instruments were highlighted. The process of constructing models of these two musical instruments was explained. A short discussion concerning problems occurred while creating such models was given. Some general conclusions concerning real-time implementation of the digital waveguide models were also included.

Streszczenie W referacie przedyskutowano problemy związane z konstrukcją modelu fizycznego wybranych instrumentów japońskich. Przeprowadzono szereg analiz dźwięków naturalnych pochodzących z shakuhachi i koto. Nastepnie zaprojektowano modele fizycznych tych instrumentów i przeprowadzono analizę uzyskanych dźwięków. Podano wnioski.

Entry No. 48

Entry type conference paper

Authors B. Kostek, A. Czyżewski, H. Skarżyński, K. Kochanek

English title Internet-Based Automatic Hearing Assessment System

Polish title Internetowy system badania słuchu

Conference 46 Internationales Wissenschaftliches Kolloquium Ilmenau

Preprint

Number

Volume

Pages 87 - 89

Conference site Ilmeanu, Germany

Conference date 24.9.2001- 27.9.2001

Abstract The aim of this paper is to show the new media application to the domain of health care. In the paper the Internet-based system that allows for automatic testing of hearing is described. Hearing impairment is one of the fastest growing diseases of modern society. Therefore it is very important to organize mass screening tests to identify people suffering from this kind of impairment. The described application provides a test that uses automatic questionnaire analysis, audiometric tone test procedures, and assesses speech intelligibility in noise. When all the testing is completed, the system automatically analyzes the results for each person examined. Based on the number of incorrect answers, the decision is made automatically by the expert system: does the person have normal hearing or does he or she have hearing problems and requires to be examined in one of the consulting centers? Those whose hearing impairment is confirmed are referred to treatment in rehabilitation centers. All these centers are connected via the Internet and are provided with special distributed database access allowing them to automatically register and track the patient discovered during the remote screening.

Streszczenie Celem referatu jest prezentacja możliwości wykorzystania multimediów w medycynie. Opisano w nim system internetowy umożliwiający automatyczne badanie słuchu. Wady słuchu stanowią jedną z najszybciej postępujących chorób we współczesnym społeczeństwie. W tym świetle bardzo ważnym staje się umożliwienie przeprowadzania masowych testów wykrywających ubytki słuchu. Przedstawiona aplikacja zawiera audiometryczny test tonalny, test ilustrowany dla dzieci oraz test rozumienia mowy w szumie. Po zakończeniu testów system automatycznie analizuje wyniki dla każdej badanej osoby. Osoby z wykrytą wadą słuchu kierowane są do specjalistycznych centrów rehabilitacyjnych na dalsze badania. Ośrodki te są połączone przy pomocy łączy internetowych z bazą danych systemu "Słyszę...".

Entry No. 49

Entry type conference paper

Authors A. Czyżewski, R. Królikowski, B. Kostek

English title Neural Networks Applied to Sound Source Localization

Polish title Zastosowanie sieci neuronowych do lokalizacji źródeł dźwięku

Conference 110th Audio Engineering Society Convention

Preprint 5375

Number

Volume

Pages

Conference site Amsterdam, Netherlands

Conference date 12.5.2001- 15.5.2001

Abstract The primary aim of this paper is to show that it is possible to localise the direction of the incoming acoustical signal based on the neural network trained for that purpose. Consequently, the automatically localised acoustical signal may be attenuated if it obscures the desired target sound. A set of parameters was formulated in order to localise target source and unwanted signals. In order to process acoustical signals incoming from various directions at the same time the neural network-based system was designed and implemented. The feature extraction method is thoroughly discussed, the training process is described and recently obtained results are discussed.

Streszczenie Podstawowym celem referatu jest pokazanie, że jest możliwa lokalizacja kierunku nadchodzącego sygnału akustycznego w oparciu o odpowiednio wytrenowane sieci neuronowe. W tym celu sformułowano zbiór parametrów oraz zaprojektowano i zaimplementowano stosowną sieć neuronową. W referacie przedstawiono proces parametryzacji oraz przedyskutowano uzyskane wyniki eksperymentów.

Entry No. 50

Entry type conference paper

Authors A. Czyżewski, J. Jaroszuk, B. Kostek

English title Digital Waveguide Models of the Panpipes

Polish title Model fizyczny fletni Pana

Conference ISMA’2001

Preprint

Number

Volume

Pages

Conference site Perugia, Italy

Conference date 9.2001- 9.2001

Abstract The principal aim of this paper is to present a digital waveguide model of the Panpipes. For the efficient modeling of the Panpipes instrument its structure and its physics were studied and thoroughly discussed. The acquired knowledge was then used during the construction of the model. In this context principles of the digital waveguide modeling of woodwind instruments are shortly reviewed. Because of the simplicity of designing the digital waveguide as a set of delay lines and scattering junctions the model can be easily implemented to a digital signal processor. In the paper two digital waveguide models of the Panpipes instruments were presented. They differ from each other by their complexity. This was due to examining the influence of decreasing the complexity of the model on the synthetic sound quality. The performed subjective tests resulted in showing that introduced simplifications in digital waveguide models reveal no noticeable influence on the sound quality. A comparison between synthetic and real Panpipes sounds was made. The results of both subjective tests and objective analyses obtained using engineered models of Panpipes are also included in the paper. Conclusions are derived.

Streszczenie Celem referatu jest przybliżenie zagadnień związanych z modelowaniem fizycznym wybranych instrumentów dętych. W referacie przedstawiono dwa modele fizyczne fletni Pana, różniące się stopniem skomplikowania i jakością otrzymanego dźwięku syntetycznego. Dokonano wszechstronnych analiz dźwięków otrzymanych w modelach i porównano je z dźwiękiem naturalnym. Dodatkowo przeprowadzono serię testów subiektywnych, które potwierdziły, że skonstruowane modele pozwalają na otrzymanie dźwięku zbliżonego do dźwięku naturalnego fletni Pana.

Entry No. 51

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Multimedia Techniques Applied to Health Care Procedures- Hearing Aid Fitting expert System

Polish title Wykorzystanie technik multimedialnych w medycynie - System doboru aparatów słuchowych

Conference 46 Internationales Wissenschaftliches Kolloquium

Preprint

Number

Volume

Pages 85 - 87

Conference site Ilmenau, Germany

Conference date 24.9.2001- 27.9.2001

Abstract In this paper an exemplary implementation of the complex multimedia system in the domain of the health care and its integration to the user environment is shown. The engineered Multimedia Hearing Aid Fitting Expert System is an experimental software program that allows finding automatically characteristics of a hearing aid matching patients needs. The fitting of the hearing aids is based either on classical methods that use audiometric test results or on loudness scaling principles. All these methods are based on artificial test signals. However, the fitting of hearing aids should be performed on the basis of testing speech understanding in noise. A satisfying reliability of these tests may be achieved through the use of modern computer technology, properly calibrated. The principles of the engineered software application, some details of the calibration process, and results of the experiments will be presented in the paper.

Streszczenie Celem referatu jest przedstawienie multimedialnego systemu wspomagającego dobór protez słuchowych. Aplikacja ta umożliwia automatyczne określanie optymalnych dla pacjenta charakterystyk protez słuchowych. Proces dopasowywania oparty na klasycznych metodach audiometrycznych lub zasadach skalowania głośności wykorzystuje generowane sygnały testowe. Dobór protezy słuchowej powinien być jednak oparty na teście rozumienia mowy w szumie. Wierność takiego testu może zostać osiągnięta dzięki użyciu współczesnej techniki komputerowej przy odpowiedniej kalibracji interfejsu użytkownika. W referacie przedstawiono opis zaimplementowanego systemu, zasadę jego kalibracji oraz uzyskane wyniki.

Entry No. 52

Entry type conference paper

Authors A. Kornacki, B. Kostek, P. Odya, A. Czyżewski

English title Problems Related to Surround Sound Production

Polish title Problemy realizacji dźwięku w systemach dookólnych

Conference 110th AES Convention

Preprint 5374

Number

Volume

Pages

Conference site Amsterdam, Netherlands

Conference date 12.5.2001- 15.5.2001

Abstract The problem of production of recordings designated for sound surround systems becomes a vital problem in sound technology. Existing standards of surround systems allow for reproduction of spatial sound. However, there are no consistent recommendations as to which microphone and mixing technique could be used in specific situations. For the purpose of research presented in this paper several microphone techniques were used for recordings of a quartet playing classical music. The mixing results in two-channel excerpts and several multichannel ones designated for 5.1 reproduction system. Then, in order to find the most preferable recording technique these excerpts were used in subjective tests.

Streszczenie Współczesne media zapisu dźwięku pozwalają na rejestrację i odtwarzanie dźwięku w wielokanałowych formatach dookólnych, np. w formacie 5.1. Możliwości te wymagają jednak opracowania odpowiednich metod realizatorskich. Dotyczy to zarówno technik mikrofonowych, jak również sposobu tworzenia panoramy dźwiękowej. W pracy przedstawiono porównanie kilku metod realizacji nagrań dookólnych.

Entry No. 53

Entry type conference paper

Authors A. Czyżewski, S. Zieliński

English title Dereverberation Based on the Genetic Algorithm

Polish title Usuwanie pogłosu przy użyciu algorytmów genetycznych

Conference 17th International Congress on Acoustics

Preprint 6A.06.01

Number

Volume

Pages 264

Conference site Rome, Italy

Conference date 2.9.2001- 7.9.2001

Abstract In this paper, a new method of echo cancellation is proposed applicable to some telecommunication systems. This method is based on the application of the reverse model of a system causing echo. Parameters of such a model are optimized using the genetic algorithm. Some exemplary results of echo cancellation obtained with the use of the proposed method are discussed.

Streszczenie W artykule zaproponowano nową metodę usuwania echa, mającą zastosowanie w systemach telekomunikacyjnych. Metoda oparta jest na zastosowaniu systemu odwrotnego do tego, który powoduje powstawanie echa. Parametry tego modelu zostały zoptymalizowane przy użyciu algorytmów genetycznych. Przedstawiono i przedyskutowano przykładowe wyniki usuwania echa uzyskane przy uzyciu proponowanych metod.

Entry No. 54

Entry type conference paper

Authors A. Czyżewski, A. Kornacki, G. Szwoch, B. Kostek

English title Simulation of the Reverberant Space in the Multichannel Audio Using the Convolution Method

Polish title Symulacja pogłosu w technice wielokanałowej przy użyciu metody splotu

Conference 17th International Congress on Acoustics

Preprint 4D.09.04

Number

Volume

Pages 163

Conference site Rome, Italy

Conference date 2.9.2001- 7.9.2001

Abstract The convolution method is commonly used to simulate the reverberant space by convolving monophonic or stereophonic sounds with the impulse responses of the room.In this paper,application of this method to the multichannel audio is proposed. The impulse responses of the real room were recorded.Each of the audio channels was obtained using the convolution of the adequate room impulse response with monophonic source sound.The results of the convolution were then combined and encoded as the multichannel surround audio in the format 5.1. The time and spectral analyses of the resulting sounds,as well as the listening tests were performed.The results of these experiments are presented and discussed in the paper. The presented method allows one to simulate the acoustical conditions of the room where the monophonic audio was acquired. Possible applications of this method include advanced Internet teleconferencing in which the bandwidth requirements may be decreased by transmitting only monophonic sounds and the impulse responses of the room instead of the whole multichannel audio.

Streszczenie Metoda splotu jest powszechnie stosowana w celu zasymulowania warunków pogłosowych poprzez splot sygnału monofonicznego lub stereofonicznego z odpowiedzią impulsową pomieszczenia. W artykule zaproponowano zastosowanie tej metody w technice dźwięku wielokanałowego. Zarejestrowano odpowiedzi impulsowe rzeczywistych pomieszczeń. Każdy z kanałów dźwięku został otrzymany przez splot odpowiedniej odpowiedzi impulsowej pomieszczenia z monofonicznym sygnałem źródłowym. Wyniki splotu zostały następnie połaczone i zakodowane w formacie dźwięku wielokanałowego 5.1. Przeprowadzono analizy czasowe i widmowe otrzymanych dźwięków oraz testy odsłuchowe. Wyniki eksperymentów zostały przedstawione i przedyskutowane w niniejszym artykule. Przedstawiona metoda umożliwia symulację warunków akustycznych pomieszczenia, w którym zarejestrowano dźwięk monofoniczny. Możliwe zastosowania tej metody to zaawansowane techniki telekonferencyjne w Internecie, w których możliwe będzie zmniejszenie wymagań dotyczacych przepustowości łaczy, poprzez transmisję wyłącznie dźwięku monofonicznego oraz odpowiedzi impulsowych pomieszczenia zamiast pełnego dźwięku wielokanałowego.

Entry No. 55

Entry type conference paper

Authors A. Czyżewski, R. Królikowski

English title Automatic Identification of Sound Source Direction Based on Neural Networks

Polish title Automatyczna identyfikacja kierunku źródła dźwięku w oparciu o sieci neuronowe

Conference 142nd Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer.

Preprint 4aSP9

Number 5

Volume 110

Pages 2741

Conference site Fort Lauderdale, USA

Conference date 3.12.2001- 7.12.2001

Abstract In this paper a method for automatic detection of sound source was studied. Both standard feed-forward- and recurrent neural networks were employed in that method. Comparison of the results obtained is given. Conclusions are also derived.

Streszczenie W niniejszym referacie przeanalizowano metodę automatycznego wykrywania kierunku źródeł dźwięku. Zarówno standardowe jednokierunkowe sieci neuronowe jak i rekurencyjne zostały zastosowane w tek metodzie. W referacie podano uzyskane w eksperymentach wyniki oraz przedstawiono wnioski.

Entry No. 56

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title A method for the automatic hearing aid fitting employing speech in noise

Polish title System ekspercki do doboru protez

Conference 142nd Acoustical Soc. of America Meeting

Preprint 2pPP10

Number 5

Volume 110

Pages 2680

Conference site Fort Lauderdale, USA

Conference date 3.12.2001- 7.12.2001

Abstract Some limitations of the hearing aid fitting process are discussed. The classical procedures in this process are based on audiometric test results and/or the loudness scaling method employing artificial test signals. However, the fitting of hearing aids should be also performed on the basis of testing speech understanding in noise, because this is much closer to the real life conditions. A satisfying reliability of these tests may be achieved through the use of modern computer technology with an application of a properly calibrated sound system. A new strategy applicable to fitting prostheses was developed. It allows finding automatically characteristics of a hearing aid matching patients needs. The principles of the fitting method employing fuzzy reasoning, and some results of the experiments will be presented in the paper.

Streszczenie W referacie przedstawiono problemy związane z procesem doboru protez. Zaprojektowano multimedialny system wspomagający dobór protez słuchowych. System ten umożliwia automatyczne określenie optymalnych dla pacjenta charakterystyk protez słuchowych.

Entry No. 57

Entry type conference paper

Authors A. Czyżewski, R. Królikowski

English title Acquisition of Acoustic Signals Assisted by Recurrent Neural Networks

Polish title Pozyskiwanie sygnałów akustycznych wspomagane przez rekurencyjne sieci neuronowe

Conference 17th International Congress on Acoustics

Preprint

Number

Volume CD-ROM

Pages

Conference site Rzym, Włochy

Conference date 2.9.2001- 7.9.2001

Abstract The issue of localisation of sound sources for videoconferencing is addressed in the paper, where a new method for estimating speaker locations is introduced. It is based on exploitation of temporal relationships between signals received by an array of microphones, and thereby recurrent neural networks are employed. Additionally, a parametrisation of the time-domain audio signals prior to the neural processing is performed. Some of the results of the experiments are briefly presented in the paper.

Streszczenie W referacie poruszono zagadnienie lokalizacji źródeł dźwięku dla potrzeb wideokonferencji, wraz z propozycją nowej metody oszacowania położenia mówcy. Metoda ta oparta jest na wykorzystaniu czasowych zależności pomiędzy sygnałami odebranymi przez matrycę mikrofonów i z tego powodu zastosowano rekurencyjne sieci neuronowe. Ponadto przed przetwarzaniem neuronowym zastosowano parametryzację sygnału w dziedzinie czasu. W referacie zamieszczono niektóre wyniki przeprowadzonych eksperymentów.

Entry No. 58

Entry type book

Authors A. Czyżewski, B. Kostek, P. Odya, S. Zieliński

English title Determining Influence of Visual Cues on the Perception of Surround Sound Using Soft Computing

Polish title Badanie wpływu treści obrazu wizyjnego na percepcję dźwięku z wykorzystaniem soft computingu

Editor Series: Lecture Notes in Computer Science, vol. 2005, Springer-Verlag

Pages 545 - 552

Abstract The main challenge in the sound processing in the multichannel system is to create an appropriate basis for the relating multimodal context of visual and sound domains. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV

Streszczenie Opisano przebieg i wyniki eksperymentów w dziedzinie badania wpływu treści obrazu na percepcję dźwięku w systemach stereofonii dookólnej.

Entry No. 59

Entry type book

Authors R. Królikowski, A. Czyżewski , B. Kostek

English title Localization of Sound Sources by Means of Recurrent Neural Networks

Polish title Lokalizacja źródeł dźwięku za pomocą rekurencyjnych sieci neuronowych

Editor Series: Lecture Notes in Computer Science, vol. 2005, Springer-Verlag

Pages 603 - 610

Abstract The issue of localization of sound sources for videoconferencing is discussed in the paper. A new algorithm for estimating speaker locations, based on recurrent neural networks (RNN), is introduced and described. The scheme of experiments carried out in an acoustically adopted chamber, exploiting the engineered method is detailed.

Streszczenie Przedyskutowano problematyke lokalizacji dźwięku dla potrzeb wideokonferencji. Zaprezentwano nowy algorytm estymacji położenia mówcy, oparty na wykorzystaniu rekurencyjnych sieci neuronowych. Omówiono wyniki eksperymentów, wykorzystujące materiał dźwiękowy przygotowany w komorze bezechowej.

Entry No. 60

Entry type conference paper

Authors A. Czyżewski

English title The Internet Sound Restoration Service Based on the Perceptual Denoising Method

Polish title Internetowy system rekonstrukcji nagrań oparty na perceptualnej metodzie redukcji szumów

Conference 20th Audio Eng. Soc. International Conference

Preprint

Number

Volume

Pages 162 - 167

Conference site Budapest, Hungary

Conference date 5.10.2001- 7.10.2001

Abstract The Internet service was launched intended to on-demand restoration and publishing of audio content related to world's cultural heritage. A special way of acquiring, processing and publishing archive recordings was conceived in order to ensure a proper dissemination of the proposed service and its long-term maintenance. The sound enhancement method underlying the system operation employs the extended perceptual coding of audio material allowing for simultaneous noise reduction and sound compression. Moreover, the non-linear predictor employing neural networks was applied to the detection and removal of impulse distortions. The system is still in the development phase, thus both: system features implemented already and technical assumptions related to its further development are presented in the paper.

Streszczenie Uruchomiono Internetowy system rekonstruwania nagrań archiwalnych. W tym celu opracowano kocepcję sposobu akwizycji i upowszechniania nagrań archiwalnych. System umożliwia usuwanie szumu z nagrań przy wykorzystaniu algorytmu opartego na kompresji perceptualnej oraz usuwanie zakłóceń impulsowych przy wykorzystaniu neuronowej predykcji nieliniowej.

Entry No. 61

Entry type conference paper

Authors A. Czyżewski, B. Kostek, K. Kochanek, J. Mazur, P. Odya, H. Skarżyński

English title

Polish title Masowe badania przesiewowe słuchu, wzroku, mowy i szumów usznych przy wykorzystaniu komputerów

Conference V Koszalińska Konferencja Naukowo-Techniczna

Preprint

Number

Volume

Pages 9 - 18

Conference site Kołobrzeg,

Conference date 5.12.2001- 7.12.2001

Entry No. 62

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya, T. Smoliński

English title Discovering the Influence of Visual Stimuli on The Perception of Surround Sound Using Genetic Algorithms

Polish title Badanie wpływu obrazu na percepcję dźwięku dookólnego z wykorzystaniem algorytmów genetycznych

Conference 19th International AES Conference

Preprint

Number

Volume

Pages 287 - 294

Conference site Schloss Elmau, Germany

Conference date 21.6.2001- 24.6.2001

Abstract The paper contains a description of experiments that aim to determine visual cue influence on the perception of spatial sound. Earlier stage of the carried out experiments showed that there exists a relationship between the perception of video presented in the screen and sound signals reproduced in a surround system. However, this relationship is dependent on the type of audio-visual signals. Thus a series of subjective tests has been performed on dozens of experts in order to discover these dependencies. The main issue in such experiments is the analysis of the influence of visual cues on the perception of the surround sound. This problem is solved with the application of genetic algorithms to the processing of subjective test results. Conclusions concerning the complexity of the investigated problem are included.

Streszczenie Niniejszy artykuł zawiera opis eksperymentów, które miały na celu wykazanie wpływu obrazu na postrzeganie dźwięku przestrzennego. Wcześniejsze eksperymenty wykazały bowiem istnienie zależności pomiędzy obrazem i dźwiękiem w systemie dźwięku dookólnego. Niemniej jednak związek ten zależy od rodzaju sygnałów audio-wizyjnych. Wykonano serię subiektywnych eksperymentów na wielu ekspertach w celu zbadania tych zależności. Podstawowym problemem tego typu eksperymentach jest analiza uzyskanych wyników. Problem ten rozwiązano z wykorzystaniem algorytmów genetycznych. Artykuł zawiera także wnioski dotyczące złożoności badanego problemu.

Entry No. 63

Entry type journal paper

Authors G. Szwoch, B. Kostek, A. Czyżewski

English title Computer Modeling of Acoustical Elements of a Hearing Aid

Polish title Komputerowe modelowanie akustycznych elementów aparatów słuchowych

Journal Archives of Acoustics

Volume 26

Number 3

Pages 203 - 213

Abstract In this paper, application of computer modeling methods to the process of hearing aid fitting is described. A computer model of the acoustical system of a hearing aid is presented. Exemplary results of the experiments are presented and compared with measurement data. The model proved to behave similarly to the physical system. Further improvements to the model are discussed.

Streszczenie W artykule opisano wykorzystanie metod modelowania komputerowego w procesie doboru aparatu słuchowego. Przedstawiono komputerowy model akustycznego układu aparatu słuchowego. Zamieszczono wyniki przykładowych eksperymentów i porównano te wyniki z danymi pomiarowymi. Wpływ zmian parametrów na charakterystyki modelu był podobny jak w przypadku rzeczywistego systemu. Przeprowadzono dyskusję koniecznych rozszerzeń modelu.

Entry No. 64

Entry type conference paper

Authors A. Czyżewski, R. Królikowski, B. Kostek

English title Encoding Spatial Information for Advanced Teleconferencing

Polish title Kodowanie informacji przestrzennej dla potrzeb zaawansowanej telekonferencji

Conference 19th International AES Conference

Preprint

Number

Volume

Pages 309 - 322

Conference site Schloss Elmau, Germany

Conference date 21.6.2001- 24.6.2001

Abstract The aim of this paper is to show a system that enables automatic identification of a sound source position in noisy acoustical conditions with a considerable accuracy. Automatic detection of sound source in such an acoustical environment is much needed in advanced teleconferencing. The approach shown in the paper is based on Artificial Neural Networks (ANNs) used for automatic sound localisation. Both standard feed-forward ANNs and Recurrent Neural Networks (RNNs) are employed for that purpose. Comparison of the results obtained, based on both types of ANNs, is also given. Conclusions are derived and shown.

Streszczenie W referacie pokazano system umożliwiający automatyczną identyfikację pozycji źródła dźwięku w zaszumionych akustycznych warunkach, co jest pożądane w przypadku telekonferencji. Opracowane rozwiązania bazują na sztucznych sieciach neuronowych, zarówno jednokierunkowych jak i rekurencyjnych. W referacie zamieszczono porównanie obu podejść oraz wnioski.

Entry No. 65

Entry type conference paper

Authors P. Odya, A. Czyżewski, B. Kostek

English title Determination of Influence of Visual Cues on Perception of Spatial Sound

Polish title Badanie wpływu obrazu na percpecję dźwięku dookólnego

Conference 110th Audio Eng. Soc. Conv.

Preprint 5311

Number

Volume

Pages

Conference site Amsterdam, Netherlands

Conference date 12.5.2001- 15.5.2001

Abstract The paper contains a description of experiments that aim to determine visual cue influence on the perception of spatial sound. Earlier stage of the carried out experiments showed that there exists a relationship between the perception of video presented in the screen and sound signals reproduced in a surround system. However, this relationship is dependent on the type of audio-visual signals. Thus a series of subjective test has been performed on dozens of experts in order to discover these dependencies. The main issue in such experiments is the analysis of the influence of visual cues on the perception of the surround sound. Conclusions concerning the complexity of the investigated problem are included.

Streszczenie Niniejszy artykuł zawiera opis eksperymentów, które miały na celu wykazanie wpływu obrazu na postrzeganie dźwięku przestrzennego. Wcześniejsze eksperymenty wykazały bowiem istnienie zależności pomiędzy obrazem i dźwiękiem w systemie dźwięku dookólnego. Niemniej jednak związek ten zależy od rodzaju sygnałów audio-wizyjnych. Wykonano serię subiektywnych eksperymentów na wielu ekspertach w celu zbadania tych zależności. Podstawowym problemem tego typu eksperymentach jest analiza uzyskanych wyników. Artykuł zawiera także wnioski dotyczące złożoności badanego problemu.

Entry No. 66

Entry type conference paper

Authors T. Fidecki, J. Adamczyk, A. Czyżewski, A. Kornacki, T. Lida, B. Okoń-Makowska

English title

Polish title Ad Laudes - realizacja koncertu internetowego

Conference ISSET'2001

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 18.10.2001- 20.10.2001

Entry No. 67

Entry type conference paper

Authors T. Fidecki, J. Adamczyk, A. Czyżewski, A. Kornacki, T. Lida, B. Okoń-Makowska

English title

Polish title Ad Laudes - realizacja koncertu internetowego

Conference ISSET'2001

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 18.10.2001- 20.10.2001

Entry No. 68

Entry type conference paper

Authors P. Odya, A. Czyżewski, B. Kostek, T. Smolinski

English title Determining the influence of visual stimuli on the peception of surround sound using data mining algorithms

Polish title Badanie wpływu obrazu na dźwięk w systemach dookólnych z wykorzystaniem algorytmów sztucznej inteligencji

Conference 142nd Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer.

Preprint 2pPP3

Number 5

Volume 110

Pages 2679

Conference site Fort Lauderdale, USA

Conference date 3.12.2001- 7.12.2001

Abstract A short description of experiments that aim to determine visual cues influence on the perception of spatial sound is provided in the paper. The earlier stage of the carried out experiments showed that there exists a relationship between the perception of video presented in the screen and sound signals reproduced in a surround system. However, this relationship is dependent on the type of audio–visual signals. Thus a series of subjective tests has been performed on dozens of experts in order to discover these dependencies. The main issue in such experiments is the analysis of the influence of visual cues on the perception of the surround sound. This problem is solved with the application of genetic algorithm and rule searching mechanism to the processing of subjective test results. Some results and conclusions concerning the complexity of the investigated problem are included.

Streszczenie Niniejszy referat zawiera krótki opis eksperymentów, które miały na celu wykazanie wpływu obrazu na postrzeganie dźwięku przestrzennego. Wcześniejsze eksperymenty wykazały bowiem istnienie zależności pomiędzy obrazem i dźwiękiem w systemie dźwięku dookólnego. Niemniej jednak związek ten zależy od rodzaju sygnałów audio-wizyjnych. Wykonano serię subiektywnych eksperymentów na wielu ekspertach w celu zbadania tych zależności. Podstawowym problemem tego typu eksperymentach jest analiza uzyskanych wyników. Problem ten rozwiązano z wykorzystaniem algorytmów genetycznych. Artykuł zawiera także wyniki eksperymentów i wnioski dotyczące złożoności badanego problemu.

Entry No. 69

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Employing Fuzzy Logic and Noisy Speech for Automatic Fitting of Hearing Aids

Polish title System doboru protez oparty o wnioskowanie rozmyte

Conference 142nd Acoustical Soc. of America Meeting, , J. Acoust. Soc. Amer.

Preprint

Number 5

Volume 110

Pages

Conference site Fort Lauderdale, USA

Conference date 3.12.2001- 7.12.2001

Abstract In this paper some limitations of the hearing-aid fitting process are discussed. In the fitting process, an audiologist performs tests on the wearer of the hearing aid, which is then adjusted based on the results of the test, with the goal of making the device work as best as it can for that individual. Traditional fitting procedures employ specialized testing devices which use artificial test signals. Ideally, however, the fitting of hearing aids should also simulate real-world conditions, such as listening to speech in the presence of background noise. Therefore, more satisfying and reliable fitting tests may be achieved through the use of multimedia computers equipped with a properly calibrated sound system. We have developed a new automatic system for fitting hearing aids. It employs fuzzy logic. In this process, a computer makes choices for adjusting the hearing aid's settings by analyzing the patient's responses and answering questions with replies that can lie somewhere between a simple "yes" or "no." This paper will describe the method and present some results of the experiments conducted to test the system.

Streszczenie Niniejszy referat przedstawia główne założenia systemu doboru protez opartego o wnioskowanie rozmyte. W systemie tym w pierwszej fazie badania wykorzystywana jest metoda LGOB (ang. Loudness Growth in 1/2-Octave Bands), pozwalająca na zbadanie zależności subiektywnego wrażenia głośności w funkcji częstotliwości. W metodzie tej badany określa odbierany sygnał według skali subiektywnej. Po określeniu zakresu dynamiki słuchu badanej osoby, określa się następnie sposób narastania wrażenia głośności w ustalonym zakresie. Kolejna faza, to badanie wykorzystujące mowę w szumie. Ten etap pozwala na uzyskanie adekwatnych charakterystyk kompresji protezy. Przetwarzanie uzyskanych wyników dokonywane jest w oparciu o logikę rozmytą.

Entry No. 70

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title In Search for Surround Sound Recording Techniques

Polish title Wielokanałowe systemy w nagraniach muzyki klasycznej

Conference ISMA'2001

Preprint

Number

Volume

Pages

Conference site Perugia, Italy

Conference date 9.2001- 9.2001

Abstract The existing and recently introduced standards of surround systems allow for reproduction of spatial sound in almost any room conditions. The vital concern of sound production for surround systems is the number of microphones, their positioning, proportion between direct sound, early reflections and the reverberation, artificially added delays, etc. The proper solution of such problems may result in creating spatial impression that is comparable to the live music perception. However this kind of a study should address some of the questions related to surround sound production. The broader aim is to establish recommendations as how to produce recordings of classical music designated for sound surround systems in specific acoustical conditions and then to reproduce it properly. This paper shows a study in which several microphone techniques were used for recordings of classical music in two auditory halls having different acoustical properties. Based on these recordings and various mixing techniques two channel stereo excerpts and some multichannel ones were produced. The latter were encoded in 5.1 multichannel format. The extensive subjective tests were performed employing a group of sound engineers and students in order to find the most preferable recording techniques. The listening tests were first performed employing excerpts obtained for each room separately, then the best production was compared for two rooms. The subjective tests were carried out in the same listening room equipped with the 5.1 surround reproduction system. In the paper results of such a comparison tests are shown. The methodology of carrying out subjective tests is presented. The discussion of obtained results and some conclusions are also included.

Streszczenie W referacie przedyskutowano problemy związane z nagraniem muzyki klasycznej przy użyciu wielokanałowych systemów mikrofonowych. Opisano kilka wybranych systemów, które posłużyły do nagrań muzyki kameralnej. Przeprowadzono szereg testów subiektywnych, których wyniki pozwoliły na wskazanie optymalnego w danych warunkach systemu mikrofonowego.

Entry No. 71

Entry type journal paper

Authors B. Kostek, A. Czyżewski

English title Representing Musical Instrument Sounds for Their Automatic Classification

Polish title Parametryzacja dźwięków muzycznych do celów automatycznej klasyfikacji instrumentów muzycznych

Journal J. Audio Eng. Soc.

Volume 49

Number 9

Pages 768 - 785

Abstract A study of the automatic classification of musical instrument sounds is presented. For this purpose a database of musical instrument sound parameters was built which consists of musical instrument recordings and their parametric representations. The parameterization process was conceived and performed in order to find significant musical instrument sound features and to remove redundancy from the musical signal. Classification experiments of musical instrument sounds were performed with neural networks allowing a discussion of the efficiency of the feature extraction process and its limitations. Conclusions and remarks concerning further development of this study and its relation to the current MPEG-7 standardi-zation process are included.

Streszczenie Artykuł dotyczy zagadnień związanych z automatycznym rozpoznawaniem instrumentów muzycznych. Zawarto w nim dyskusję na temat sposobów parametryzacji, a także przedstawiono wybrane posatci wektora cech. Do celów automatycznej klasyfikacji użyto sztucznych sieci neuronowych. Zawarto wnioski dotyczące tworzonego standardu MPEG-7.

Entry No. 72

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Automatic Recognition of Musical Instrument Sounds - Further Developments

Polish title Automatyczna klasyfikacja dźwięków instrumentów muzycznych - rozwój badań

Conference 110th Audio Eng. Soc. Convention

Preprint 5116

Number

Volume

Pages

Conference site Amsterdam, Netherlands

Conference date 12.5.2001- 15.5.2001

Abstract Discussion on the subject of retrieval of musical data from Internet or multimedia databases, which is carried out now for some time does not successfully reach its final stage of application. There are still many problems related to the subject of automatic recognition of music or musical instrument sounds that cannot be easily solved. Especially important is to find adequate parameters of musical signal based on time and frequency and/or wavelet analyses. Proposed feature vectors were derived on the basis of the constructed databases that contain recorded musical sounds. The presented study shows methods of automatic identification of musical instruments based both on classical statistical and soft computing approaches. They were used then to classify musical instruments. A set of results obtained in the carried out investigations is provided and analyzed and concluding remarks are included in the paper.

Streszczenie Referat dotyczy zagadnień związanych z automatycznym wyszukiwaniem informacji w bazach muzycznych. Głównym celem referatu było przedstawienie wektora cech zawierającego parametry dźwięków muzycznych opartych o analizę falkową. Przedstawiono przykładowe wyniki i podano wnioski.

Entry No. 73

Entry type journal paper

Authors A. Czyżewski, J. Jaroszuk, B. Kostek

English title DIGITAL WAVEGUIDE MODELS OF THE PANPIPES

Polish title Synteza falowodowa fletni Pana

Journal Archives of Acoustics

Volume 27

Number 4

Pages 357 - 371

Abstract The aim of this paper is to present a digital waveguide model of the Panpipes. For the efficient modelling of the Panpipes instrument its structure and its physics were studied and discussed. Principles of the digital waveguide modelling of woodwind instruments were also briefly reviewed. In the paper two digital waveguide models of Panpipes instruments differing from each other in their complexity were presented. Consequently it enabled studying the influence of the decreasing complexity of the model on the resulting synthetic sound quality. The subjective tests performed showed that the simplifications in digital waveguide models introduced reveal no noticeable influence on the sound quality. Comparison of synthetic and real Panpipes sounds was also made and conclusions reached.

Streszczenie W artykule przedstawiono główne cechy syntezy falowodowej. Omówiono cechy instrumentu fletni Pana. Przedyskutowano cechy zaproponowanych dwóch modeli fletni Pana różniących się złożonością obliczeniową. Pokazano szczegóły implementacyjne tych modeli, a także uzyskane wyniki symulacji dźwięków w modelach. Dokonano porównania dźwięków rzeczywistych i uzyskanych w wyniku syntezy falowodowej.

Entry No. 74

Entry type conference paper

Authors J. Czerniawski, A. Czyżewski, R. Królikowski

English title Neural Computation of Direction-Of-Arrival of Sound

Polish title Przetwarzanie neuronowe dla potrzeb wyznaczania kierunku przychodzenia dźwięku

Conference 3rd WSEAS (World Scientific and Engineering Academy and Society) Int.Conf. on Neural Network and Applications (NNA '02)

Preprint CD-ROM

Number

Volume

Pages

Conference site Interlaken, Szwjacaria

Conference date 11.2.2002- 15.2.2002

Abstract One of issues related to videoconferencing is addressed in the paper, namely - the problem of sound source localization. A new neural approach for estimation direction of arrival (DOA) is presented and discussed therein. The introduced algorithms are based on various types and structures of neural networks, including feedforward and recurrent ones. Considerations of the proposed DOA estimation are supported by some of the results of numerous experiments, carried out using audio material recorded in an anechoic chamber and acoustically adopted room. The results are briefly discussed.

Streszczenie W referacie zaprezentowano rozwiązania wykrywania kierunku źródła dźwięku. W tym celu sygnał akustyczny jest odbierany przy pomocy matrycy mikrofonów, a następnie jako sygnał wielokanałowy jest parametryzowany i jako sekwencja wektorów odpowiednich współczynników jest przetwarzany przez sztuczne sieci neuronowe. W przedstawionych w referacie eksperymentach wykorzystano zarówno jednokierunkowe, jak i rekurencyjne sieci neuronowe.

Entry No. 75

Entry type book

Authors A. Czyżewski, B. Kostek, H. Skarżyński

English title

Polish title Technika komputerowa w audiologii, foniatrii i logopedii

Editor Akademicka Oficyna Wydawnicza EXIT

Pages 1 - 441

Abstract Książka prezentuje opracowania, które są wynikiem kilkuletniej współpracy naukowaców z dziedziny informatyki, telekomunikacji, otolaryngologii, audiologii, psychologii, pedagogiki, logopedii i foniatrii. Książka prezentuje zastosowania techniki komputerowej w dziedzinach określonych w jej tytule.

Entry No. 76

Entry type conference paper

Authors A. Czyżewski, J. Czerniawski

English title Experimenting with Neural Network – Based Identification of Sound Source Position

Polish title Eksperymenty z opartą na sieciach neuronowych identyfikacji położenia źródel dźwięku

Conference Proc. 6th IASTED Intern. Conference,Artificial Intelligence and Soft Computing

Preprint

Number

Volume

Pages 385 - 390

Conference site Banff, Canada

Conference date 17.7.2002- 19.7.2002

Abstract Sound source position identification systems are often used in many telecommunication areas. Numerous approaches to this task were developed. Usually such systems are based on digital signal processing technology and are computationally intensive. This paper presents one of alternative methods implementing intelligent neural network–based decision module. The method effectiveness was tested with various types and structures of multilayer neural networks. The obtained results are discussed in the paper.

Streszczenie W referacie zaprezentowano alternatywną metodę lokalizacji położenia źródeł dźwięku opartą na zastosowaniu modularnej sieci neuronowej. Skuteczność tej metody detekcji kierunku napływania dźwięku była testowana z użyciem różnych struktur tego typu sieci, zaś uzyskane wyniki zostały przedyskutowane w referacie.

Entry No. 77

Entry type conference paper

Authors H. Skarżyński, B. Kostek, A. Czyżewski, J. Kotus, K. Kochanek

English title A COMPUTER EXAMINE OF HEARING OF SMALL CHILDREN USING BEHAVIORAL AUDIOMETRY METHOD

Polish title KOMPUTEROWE BADANIE SŁUCHU MAŁYCH DZIECI METODĄ AUDIOMETRII BEHAWIORALNEJ

Conference III KONGRES POLSKIEGO TOWARZYSTWA MEDYCYNY PERINATALNEJ

Preprint

Number

Volume

Pages 38

Conference site Łódź, Polska

Conference date 27.9.2002

Abstract The software to examine hearing of small children using behavioral audiometry method was presented in the paper. It enable children diagnostic in age from 6 to 36 months. A common features of the program were presented. The testing conditions and the hardware requirements were described. A multichannel sound system applied in the program, make possible to verify capabilities of localization sound sources. A large variety of the testing signals from a child environment cause, that the software could be used in checking a progress in rehabilitation process. Moreover, it could be used in fixing a hearing aids. At the end of the examine it is possible to print a report witch include a result and a description of test conditions and the child and his parent personal data.

Streszczenie W referacie przedstawiono program komputerowy do badania słuchu małych dzieci metodą audiometrii behawioralnej. Umożliwia on badanie dzieci w wieku od 6 do 36 miesięcy. Przedstawiono podstawowe cechy programu, opisano warunki badania oraz wymagania sprzętowe. Zastosowanie w programie wielokanałowego systemu odsłuchu dźwięku umożliwia weryfikację zdolności w zakresie lokalizowania źródła dźwięku. Duża różnorodność sygnałów testowych z otoczenia dziecka sprawia, że program może być pomocny do kontroli postępów w procesie rehabilitacji. Ponadto, może znaleźć zastosowanie podczas dopasowywania protez słuchowych. Po zakończeniu badania istnieje możliwość wydrukowania raportu zawierającego wyniki badania, opis profilu testu oraz dane osobowe dziecka i opiekuna.

Entry No. 78

Entry type conference paper

Authors B. Kostek, A. Czyżewski, M. Dziubiński

English title Decomposition of Duet Instrument Sounds

Polish title Dekompozycja duetów muzycznych

Conference ISMA'2002

Preprint

Number

Volume

Pages 292 - 301

Conference site Meksyk, Meksyk

Conference date 9.12.2002- 13.12.2002

Abstract This paper shows first a review of recent developments in the domain of separation of musical instrument sounds and then presents some methods of this kind developed at the Sound and Vision Engineering Department of the Technical University of Gdansk,Poland. The proposed technique for the decomposition of duet sounds is based on the modified Frequency Envelope Distribution analysis (FED). Recently introduced Frequency Envelope Distribution (FED)algorithm decomposes signal into linear expansion of waveforms,called EMO – Envelope Modulated Oscillations providing a combination of complex exponential signals modulated by complex amplitude envelopes.These waveforms are chosen to best match harmonic parts of the signal,however non-harmonic structures can be also represented by EMO.The first step of the engineered algorithm is the estimation of the fundamental frequency of the lower pitched instrument.Pitch estimation is carried out in block processing.The input signal is divided into short overlapping blocks,and pitch is estimated for each block separately,resulting in Pitch Contour Signal (PCS). Then harmonics of the second sound are searched in the residual signal.Therefore in this approach based on the FED algorithm the multi-pitch detection is not needed.Results of the performed experiments are shown and conclusions are derived.

Streszczenie W referacie zaprezentowany został algorytm separacji nagrań duetów muzycznych. Metoda separacji oparta została na algorytmie FED, przy pomocy którego możliwa jest ekstrakcja części harmonicznych sygnałów. Ponadto wykorzystany został algorytm estymacji częstotliwości podstawowej oparty na korelacji skrośnej, w celu estymacji częstotliwości dekomponowanych harmonicznych.

Entry No. 79

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Fitting hearing aids employing fuzzy logic

Polish title Dopasowanie protez słuchu z wykorzystaniem logiki rozmytej

Conference Proc. 6th IASTED Intern. Conference, Artificial Intelligence and Soft Computing

Preprint

Number

Volume

Pages 599 - 602

Conference site Banff, Canada

Conference date 17.7.2002- 19.7.2002

Abstract The paper describes first limitations of the clinical hearing aid fitting process. The audiological assessment in this process is based both on classical methods that use as a basis results of the audiometric test and the loudness scaling method. These methods employ artificial test signals. However, the fitting of hearing aids should be also performed on the basis of testing speech understanding in noise, because this is much closer to the real life conditions. A satisfying reliability of these tests may be achieved through the use of modern computer technology with application of a properly calibrated sound system. A new strategy applicable to fitting prostheses was developed. It allows finding automatically characteristics of a hearing aid matching patients needs. The principles of the fitting method, and results of the experiments will be also presented in the paper.

Streszczenie W referacie przedstawiono nowa metodykę dopasowania aparatów słuchowych opartą na wykorzystaniu badania zrozumiałości mowy w szumie. Metodyka ta wykorzystuje logikę rozmytą w procesie skalowania poziomu głośności bodźców słuchowych.

Entry No. 80

Entry type conference paper

Authors A. Czyżewski, M. Szczerba

English title Pitch Estimation Enhancement Employing Neural Network- Based Music Prediction

Polish title

Conference Proc. IASTED Intern. Conference, Artificial Intelligence and Soft Computing

Preprint

Number

Volume

Pages 413 - 418

Conference site Banff, Canada

Conference date 17.7.2002- 19.7.2002

Abstract In this paper a new method for pitch estimation enhancement was presented. Pitch estimation methods are widely used for extracting musical data from digital signal. A brief review of these methods is included in the paper. However, since processed signal may contain noise and distortions, the estimation results can be erroneous. The proposed method was developed in order to override disadvantages of standard pitch estimation algorithms. The new -approach is based on both pitch estimation in terms of signal processing and pitch prediction based on musical knowledge modeling. First, signal is partitioned into segments roughly analogous to consecutive notes. Thereafter, for each segment an autocorrelation function is calculated. Autocorrelation function values are then altered using pitch predictor output. A music predictor based on artificial neural networks was introduced for this task. The description of the proposed pitch estimation enhancement method is included and some details concerning music prediction are discussed in the paper.

Streszczenie W referacie przedstawiono metodę umożliwiającą poprawę skuteczności detekcji wysokości dźwięku muzycznego opartą na wykorzystaniu predykcji fraz muzycznych. Opracowany detektor wysokości dźwięku działa w oparciu o metodę autokorelacyjną, wspomaganą przez odpowiednio wytrenowaną sieć neuronowa.

Entry No. 81

Entry type conference paper

Authors G. Szwoch, A. Czyżewski, K. Kochanek, H. Skarżyński

English title

Polish title Baza danych internetowych systemów telemedycznych

Conference Infobazy 2002 - Bazy danych dla nauki

Preprint

Number

Volume

Pages 57 - 62

Conference site Gdańsk-Sobieszewo, Polska

Conference date 24.6.2002- 26.6.2002

Abstract In this paper, database of internet telemedical systems „Telezdrowie” is described. This database collects personal data of the users and results of tests performed using this system. The structure of the database, the data collected, a method of collecting and searching data, as well as implementation of the database is described.

Streszczenie W artykule opisano bazy danych współpracujące z internetowymi serwisami telemedycznymi „Telezdrowie”. Baza danych umożliwia gromadzenie danych osobowych użytkowników systemu oraz wyników badań przeprowadzonych przy użyciu omawianych systemów. Opisano strukturę bazy danych, rodzaj zbieranych informacji, sposób gromadzenia i wyszukiwania danych w bazie, a także realizację techniczną bazy.

Entry No. 82

Entry type conference paper

Authors A. Czyżewski, B. Kostek

English title Deriving Rules for Mastering Surround Sound to Accompany Video

Polish title Ekstrakcja reguł w dla potrzeb masteringu dźwięku dookólnego i video

Conference DAGA'02

Preprint

Number

Volume

Pages

Conference site Bochum, Germany

Conference date 4.3.2002- 7.3.2002

Notes referat plakatowy

Abstract The methodology of testing influence of video image on surround sound perception developed by authors led to formulation of some principles of mastering multi-channel sound accompanying video content. The literature relates mostly to the classical studies on this subject including stereo sound systems for HDTV. At present, digital video, film or multimedia presentations are often accompanied by the surround sound. However, there is still no clear answer to the question: how the video influences the localization of virtual sound sources in multichannel surround systems (e.g. DTS) and in most references one can find a list of problems only to be solved while testing relevant inter-modal relations. Therefore, authors addressed in their studies similar problems employing subjective testing procedures in which experts listened to the sound with- and without video image presence and provided answers. Results of such experiments demonstrated in which cases and in what way video may affect the localization of virtual sound sources. The so called image proximity effect confirmed some dependencies between reactions of sight and hearing senses due to perception of visual stimuli accompanied by surrounding sound. The obtained data were then analyzed by means of modern techniques of intelligent data exploration and knowledge discovery allowing finding some hidden relations between semantic descriptors of subjective impressions. Finally, basing on the results of data analysis a set of rules concerning mastering of multichannel audio to accompany various types of video content were derived. Some results of this study will be presented and discussed in the paper.

Streszczenie W referacie sformułowano przykłady reguł dotyczących tworzenia dźwięku w systemie dookólnym towarzyszącemu obrazowi video. Reguły te tworzono w opraciu o wyniki testów subiektywnych.

Entry No. 83

Entry type conference paper

Authors A. Czyżewski, B. Kostek, A. Kornacki, P. Odya, M. Dziubiński

English title Comparing some convolution-based methods for creation of surround sound

Polish title System nagrań dźwięku dookólnego z wykorzystaniem splotu odpowiedzi impulsowej sali

Conference 144th Meeting of the Acoustical Society of America (First Pan-American/Iberian Meeting on Acoustics), J. Acoust. Soc. Am.

Preprint

Number 5

Volume 112

Pages 2274

Conference site Cancun, Meksyk

Conference date 2.12.2002- 7.12.2002

Notes 1-7

Abstract Spatialization of the sound using the multichannel techniques is now getting widespread. One can derive many rules for surround sound recording and reproduction. However, there exists only few methods suitable for recording sound in large auditoria ensuring its proper subsequent reproduction in small reproduction rooms, preserving spatial properties of sound acquired in the original recording location. Some experiments presented in the paper were devoted to simulation of acoustics of the recording hall using the convolution of monophonic audio signal with the multichannel impulse response of the hall. A special microphone setup was created to that end and an original method of recording multichannel impulse response of auditory halls was conceived and implemented. In this method the acoustical signal recorded quasi-anechoically was convolved with 5 impulse responses of the simulated room measured in the room corners and at the stage position. The firecracker shots used for impulse response recording were equalized during the subsequent recorded signal processing. Surround recordings made with above mentioned convolution techniques were then compared each to others on the basis of subjective testing results. The details of the examined surround recording methods and results of their assessments will be discussed in the paper.

Streszczenie W referacie przedstawiono eksperymenty związane z symulacją dźwięku dookólnego w sali koncertowej. W tym celu wykorzystywano splot odpowiedzi impulsowej z danego wnętrza (wielokanałowe nagrania odpowiedzi impulsowej) z nagraniami z komory bezechowej. Uzyskany w ten sposób sygnał został następnie przypisany do odpowiednich kanałów w systemie dookólnym. Uzyskane w ten sposób nagrania były następnie porównywane w testach subiektywnych z nagraniami pochodzącymi z innych systemów dookólnych.

Entry No. 84

Entry type conference paper

Authors A. Czyżewski, M. Szczerba, B. Kostek

English title Pitch Estimation Assisted by the Neural Network-Based Prediction Algorithm

Polish title Estymacja częstotliwości podstawowej z wykorzystaniem predykcji neuronowej

Conference ISMA'2002

Preprint

Number

Volume

Pages 246 - 255

Conference site Meksyk, Meksyk

Conference date 9.12.2002- 13.12.2002

Abstract In this paper recent developments in pitch estimation methods enhancement were presented. This issue is well-developed within signal processing domain. However, because processed signal often contains noise and distortions, the estimation results may be erroneous. First, a brief review of such methods is shown. The developed method was introduced in order to diminish processing errors of the known pitch estimation algorithms. The proposed approach is two-fold. Both pitch estimation in terms of signal processing and pitch prediction based on neural networks are employed. First, signal is partitioned into segments roughly analogous to consecutive notes. Then, for each segment the autocorrelation function is calculated. Autocorrelation function values are then processed using pitch predictor output. A music predictor based on artificial neural networks was introduced for this task. The description of the proposed pitch estimation enhancement method is included and some details concerning music prediction are discussed in the paper.

Streszczenie W referacie zawarto przegląd metod estymacji częstotliwości podstawowej we frazach muzycznych. W celu zmniejszenia błędów oktawowych w procesie estymacji częstotliwości podstawowej dźwięków zaproponowano uwzględnienie w systemie predyktora neuronalnego. Pokazano skuteczność estymacji częstotliwości podstawowej w zaproponowanym systemie i podano wnioski.

Entry No. 85

Entry type conference paper

Authors A. Czyżewski, A. Kornacki, P. Odya

English title Some Rules and Methods for Creation of Surround Sound

Polish title Zasady realizacji dźwięku dookólnego

Conference 21st AES Conference

Preprint

Number

Volume

Pages

Conference site Petersburg, Russia

Conference date 1.6.2002- 3.6.2002

Abstract The problem of selection of an adequate surround sound life recording and reproduction methods is still open. Alternative methods of organizing this process are discussed in the paper. Some experimental recording sessions employing the 5.1 format were made with the use of various miking techniques and the convolution-based multichannel audio processing algorithm. The results were submitted to some subjective assessments and then compared. Conclusions resulting from performed experiments are derived and discussed.

Streszczenie W artykule opisano alternatywną metodę tworzenia dźwięku dookólnego wykorzystującą splot. Opisano wykorzystane techniki mikrofonowe. Zawarto wyniki subiektywnych testów odsłuchowych.

Entry No. 86

Entry type journal paper

Authors A. Czyżewski

English title Applications of Neural Networks and Perceptual Masking to Audio Restoration

Polish title Zastosowanie sieci neuronowych i maskowania perceptualnego do rekonstrukcji nagrań

Journal Journ. of New Musical Research

Volume 30

Number 4

Pages 341 - 351

Abstract Applications of learning algorithms to the restoration of recordings are presented. Attention is paid to the usage of artificial neural networks as a decision system determining which components of an input signal are valid and which ones are unwanted. It provides a basis for the parasitic impulse detection and for the interpolation of lost signal intervals. Such an approach enables also an efficient noise reduction employing the extended perceptual coding algorithm. The proposed algorithms are described briefly in the paper, obtained results are discussed and some general conclusions concerning the application of soft computing and perceptual masking to sound restoration are added.

Streszczenie Omówiono zastosowania algorytmów uczących się w dziedzinie rekonstruowania nagrań fonicznych. Szczególną uwagę zwrócono na zastosowanie sztucznych sieci neuronowych do usuwania zakłócających impulsów. Ponadto opisano zastosowanie inteligentnego algorytmu decycyzyjnego do sterowania maskowaniem perceptualnym w celu redukowania szumu.

Entry No. 87

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title

Polish title ZASTOSOWANIE WSPÓŁCZESNEJ TECHNOLOGII TELEINFORMATYCZNEJ DO POWSZECHNEJ DIAGNOSTYKI ZAGROŻEŃ HAŁASOWYCH

Conference VI Koszalińska Konferencja Naukowo-Techniczna „Hałas – Profilaktyka – Zdrowie”

Preprint

Number

Volume

Pages

Conference site Kołobrzeg, Polska

Conference date 12.11.2002- 14.11.2002

Abstract The principal objective of the project currently being realized at the Sound and Vision Engineering Department of the Gdansk University of Technology is reduction of frequency of hearing disease occurrence, whereby such diseases are caused by excessive industry, urban and traffic noise and objectionable sounds of other kind that occur in everyday life. Such an aim will be achieved as a result of implementation of solutions that will be developed within the project, whereby such solutions will be based on innovatory ideas and ways of evaluating noise and vibration nuisance and its negative impact on psychosomatic and vegetative system. The latest technological advances in information technology will be used in the course of project realization. Implementation of an all-Polish noise telemonitoring system will contribute to the increase in consciousness of society and authorities concerning impact of noise on health and can turn to be an essential factor in changing the situation for better.

Streszczenie Nadrzędnym celem projektu realizowanego aktualnie w Katedrze Inżynierii Dźwięku i Obrazu Politechniki Gdańskiej jest zmniejszenie częstości występowania chorób słuchu powodowanych nadmiernym hałasem przemysłowym, urbanistycznym, komunikacyjnym i innego rodzaju niepożądanymi dźwiękami, które występują w życiu codziennym. Cel ten będzie osiągany w wyniku wdrożenia rozwiązań opracowanych w ramach projektu, które opierają się na nowatorskich koncepcjach i sposobach szacowania stopnia dokuczliwości hałasu oraz wibracji i ich negatywnego wpływu na słuch oraz na układy psychosomatyczny i wegetatywny. W toku realizacji projektu zostaną wykorzystane najnowsze osiągnięcia technologiczne z dziedziny teleinformatyki. Wdrożenie ogólnokrajowego systemu telemonitoringu hałasu przyczyni się do wzrostu stopnia uświadomienia przez społeczeństwo i władze problematyki wpływu hałasu na zdrowie i może stać się istotnym czynnikiem przyszłej poprawy sytuacji w tym zakresie.

Entry No. 88

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya

English title Making Surround Audio Considering Image Proximity Effect

Polish title Tworzenie dźwięku przestrzennego z uwzględnieniem wpływu ściągającego obrazu na dźwięk

Conference 112th AES Convention

Preprint 5583

Number

Volume

Pages

Conference site Munich, Germany

Conference date 10.5.2002- 13.5.2002

Abstract The problem of influencing surround sound perception by video content was addressed employing subjective testing procedures in which experts listened to the sound with- and without video image presence and provided their answers. Results of experiments demonstrated in which cases and how video may affect the localization of virtual sound sources. The obtained data were then analyzed by means of modern techniques of intelligent data exploration and knowledge discovery allowing finding some hidden relations between semantic descriptors of subjective impressions. Finally, basing on the results of data analysis a set of rules concerning mastering of multichannel audio to accompany various types of video content were derived. Some results of this study will be presented and discussed in the paper.

Streszczenie Problem wpływu obrazu na dźwięk badany jest z wykorzystaniem subiektywnych testów odsłuchowych. Uzyskane wyniki analizowane są z wykorzystaniem algorytmów sztucznej inteligencji. Na podstawie uzyskanych analiz uzyskano reguły dotyczące zasad tworzenia dźwięku wielokanłowego towarzyszącego różnym typom obrazu.

Entry No. 89

Entry type journal paper

Authors A. Czyżewski, H. Skarżyński

English title

Polish title Interaktywne badania słuchu, wzroku i mowy

Journal Elektronizacja

Volume

Number 10

Pages 27 - 30

Streszczenie Telemedycyna jest jedną z najważniejszych i najszybciej rozwijających się technologii społeczeństwa informacyjnego. Pomimo dostępności wielu aplikacji, wciąż jeszcze brakuje aplikacji interaktywnych. W artykule zaprezentowano kilka przykładowych rozwiązań interaktywnych aplikacji telemedycznych, opartych na opracowaniach zrealizowanych w Katedrze Inżynierii Dźwięku i Obrazu PG.

Entry No. 90

Entry type journal paper

Authors A. Czyżewski, B. Kostek

English title Expert Media Approach to Hearing Aids Fitting

Polish title System ekspercki dopasowania protez słuchu

Journal Int. Journ. of Intelligent Systems

Volume 17

Number

Pages 277 - 294

Abstract The engineered Multimedia Hearing Aid Fitting Expert System is the experimental software that allows to find the characteristics of a hearing aid matching patients needs and to choose automatically a suitable hearing device characteristics. The key issues related to the engineered application are based on the expert system implementation. This expert system uses both fuzzy logic and rough set processing of analytical data. The principles of the engineered expert media application, some details of the rough set and fuzzy logic implementation will be presented in the paper.

Streszczenie W artykule zaprezentowano problematyke dopasowania protez słuchu. Przedstawiono system ekspercki, ktory pozwala na znalezienie charakterystyk aparatu słuchowego adekwatnego do uszkodzenia słuchu. System został oparty o metodę zbiorów przybliżonych i logikę rozmytą.

Entry No. 91

Entry type journal paper

Authors A. Czyżewski, H. Skarżyński

English title

Polish title Telemedycyna - czy może być interaktywna ?

Journal Ekspert Medyczny

Volume 6

Number 4

Pages 39 - 41

Streszczenie Obecne aplikacje telemedyczne stają się jednym ze zwrotnych motorów napędowych przyśpieszających rozwój elektroniki i teleinformatyki. W artykule opisano wdrożenia z tej dziedziny dokonane przez Katedrę Inżynierii Dźwięku i Obrazu PG we współpracy z Instytutem Fizjologii i Patologii Słuchu w Warszawie, dotyczące systemów telemedycznych do badania słuchu, wzroku i mowy.

Entry No. 92

Entry type conference paper

Authors A. Wieczorkowska, A. Czyżewski

English title Rough Set Based Automatic Classification of Musical Instrument Sounds

Polish title Automatyczna klasyfikacja instrumentów muzycznych z zastosowaniem metody zbiorów przybliżonych

Conference RSKD - International Workshop on Rough Sets in Knowledge Discovery and Soft Computing

Preprint

Number

Volume

Pages 297 - 308

Conference site Warsaw,

Conference date 5.4.2003- 13.4.2003

Abstract This paper addresses the problem of automatic recognition of musical instrument sounds, applying rough set based techniques as a tool of classification. Instruments representing wind and string families were used in the experiments. Since the main problem in case of audio data is the proper parameterization, we also investigated issues regarding various parameterization methods. Fourier transform and wavelet analysis were applied as parameterization tools. The obtained feature vectors were tested using rough set tools. The analyzed data represent singular sounds of full musical range of 11 musical instruments, played with various articulation techniques. Results of experiments are presented and discussed in this paper. We summarize our paper with conclusions on musical signal representation for timbre classification purposes.

Streszczenie Referat dotyczy problemu automatycznego rozpoznawania indtrumentów muzycznych, rozwiązywanego z zastosowaniem inteligentnych algorytmów decyzyjnych. Instrumenty muzyczne należące do rodziny dętych i strunowych zostały poddane procesowi parametryzacji z zastosowaniem analizy falkowej. Otrzymane dane stanowią reprezentację pełnej skali muzycznej 11 instrumentów, zarejestrowanych z uwzględnieniem artykulacji muzycznej. Wnioski zawarte w referacie dotyczą reprezentacji sygnałów muzycznych, która jest przydatna w procesie automatycznej klasyfikacji barwy dźwięku.

Entry No. 93

Entry type conference paper

Authors A. Czyżewski, B. Kostek

English title WAVEGUIDE MODELLING OF THE PANPIPES

Polish title Falowodowy model Fletni Pana

Conference Music Acoustics Conference

Preprint

Number

Volume

Pages

Conference site Stockholm, Szwecja

Conference date 6.8.2003- 9.8.2003

Abstract The principal aim of this paper is to present a digital waveguide model of the Panpipes. For the efficient modeling of the Panpipes instrument its structure and its physics were studied and thoroughly discussed. The acquired knowledge was then used during the construction of the model. In this context principles of the digital waveguide modeling of woodwind instruments are shortly reviewed. Because of the simplicity of designing the digital waveguide as a set of delay lines and scattering junctions the model can be easily implemented to a digital signal processor. In the paper two digital waveguide models of the Panpipes instruments were presented. They differ from each other by their complexity. This was due to examining the influence of decreasing the complexity of the model on the synthetic sound quality. The performed subjective tests resulted in showing that introduced simplifications in digital waveguide models reveal no noticeable influence on the sound quality. A comparison between synthetic and real Panpipes sounds was made. The results of both subjective tests and objective analyses obtained using engineered models of Panpipes are also included in the paper. Conclusions are derived.

Streszczenie Zaprezentowano falowodowy model Fletni Pana. W procesie opracowywania tego modelu przestudiowano zjawiska fizyczne zwiazane z powstawaniem dźwięku w tym instrumencie. W referacie przedstawiono dwa modele tego isntrumentu muzycznego, różniące się złożonością obliczeniową. Na podstawie wyników testów wykazano, że model uproszczony generuje dźwięki o podobnej jakości, jak model złożony. Wyniki i wnioski zawarto w treści referatu.

Entry No. 94

Entry type conference paper

Authors P. Suchomski, B. Kostek, A. Czyżewski

English title The Multimedia Hearing Training System for Hearing Impaired People

Polish title Multimedialny system treningu słuchowego dla osób niedosłyszących

Conference I Międzynarodowa Konferencja Telemedycyny i Telekomunikacji Multimedialnej

Preprint

Number

Volume

Pages

Conference site

Conference date 10.10.2003- 12.10.2003

Abstract A large majority of hearing aid fitting systems are focused on improving the speech understanding, because the speech is the base of people communication. A hearing aid fitting problem could be simply describe as a problem with fitting wide dynamic of speech signal to narrow dynamic of impaired hearing. To solve the problem the majority hearing aids use dynamic processors like compressor and exspander. The aim of the experiments was to design multimedia computer system which could be helpful for: - measurement of impaired hearing dynamic characteristic; - making approximate hearing impairment simulation; - obtaining dynamic characteristic of desired hearing aid; - making approximate hearing aid simulation; In the system was implemented LGOB loudness scaling test. The results of the LGOB test are the base for the hearing dynamic characteristic calculating. Base of the dynamic characteristic the system can make approximate hearing impairment simulation. The desired hearing aid dynamic characteristic is obtained as the compensation of impaired hearing dynamic characteristic. Using the hearing aid dynamic characteristic the system makes approximate simulation. For the both kind of simulation the system uses numerous recorded speech signal (logatoms, Polish words and Polish sentences). The hearing trainings are carried out based on the implemented hearing aid simulation algorithm and stored speech signal database. The details of the elaborated system will be presented in the paper.

Streszczenie Większość systemów dopasowania protez słuchu skupia się na poprawie zrozumienia mowy, ponieważ sygnał mowy jest podstawowym sposobem komunikowania się ludzi. W uproszczeniu problem dopasowania protezy słuchu może byc przedstawiony jako problem dopasowania szerokiej dynamiki sygnału do wąskiej dynamiki uszkodzonego słuchu. Do rozwiązania tego problemu większość protez słuchu wykorzystuje procesory dynamiki takie jak: kompresor i ekspander. Celem eksperymentó było stworzenie multimedialnego systemu, który byłby pomocny w: - pomiarze charakterystyki dynamiki słuchu; - tworzeniu przybliżonych symulacji ubytku słuchu; - obliczaniu poszukiwanej charakterystyki dynamiki protezy słuchu; - przeprowadzaniu przybliżonych symulacji według wyznaczonych charakterytyk protez słuchu; W systemie został zaimplementowany algorytm skalowania głośności w pasmach oktawowych. Jego wyniki są podstawą do wyznaczenia charakterystyki dynamiki uszkodzonego słuchu. Dzięki kompensacji charakterystyki dynamiki słuchu w systemie wyznaczana jest charakterystyka dynamiki protezy słuchu. W oparciu o tą wyznaczoną charakterystykę w systemie przeprowadzane są przyblizone symulacje, w których wykorzystywany jest sygnał mowy w postaci nagrań logatomów, słów i zdań w języku polskim. Szczególy opracowanego systemu są przedmiotem niniejszego plakatu.

Entry No. 95

Entry type book

Authors A. Czyżewski

English title Intelligent Acquisition of Audio Signal Employing Neural Network and Rough Set Algorithms

Polish title Inteligentna akwizycja sygnału fonicznego przy pomocy sieci neuronowych i zbiorów przybliżonych

Editor ROUGH-NEURO COMPUTING: A WAY TO COMPUTING WITH WORDS, Springer Verlag, Series on Artificial Intelligence

Pages

Entry No. 96

Entry type conference paper

Authors J. Kotus, A. Czyżewski

English title Telemedia System For Diagnosing Environmental Noise

Polish title Multimedialny system teleinformatyczny do diagnostyki hałasu.

Conference I Międzynarodowa Konferencja Telemedycyny i Telekomunikacji Multimedialnej

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 10.10.2003- 12.10.2003

Entry No. 97

Entry type conference paper

Authors P. Suchomski, B. Kostek, A. Czyżewski

English title The Multimedia Hearing Training System for Hearing Impaired People

Polish title Multimedialny system treningu słuchowego dla osób niedosłyszących

Conference Prezentacja plakatowa na VII Międzynarodowej konferencji implantów ślimakowych i medycyny audiologicznej

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 22.5.2003- 22.5.2003

Abstract A large majority of hearing aid fitting systems are focused on improving the speech understanding, because the speech is the base of people communication. A hearing aid fitting problem could be simply describe as a problem with fitting wide dynamic of speech signal to narrow dynamic of impaired hearing. To solve the problem the majority hearing aids use dynamic processors like compressor and exspander. The aim of the experiments was to design multimedia computer system which could be helpful for: - measurement of impaired hearing dynamic characteristic; - making approximate hearing impairment simulation; - obtaining dynamic characteristic of desired hearing aid; - making approximate hearing aid simulation; In the system was implemented LGOB loudness scaling test. The results of the LGOB test are the base for the hearing dynamic characteristic calculating. Base of the dynamic characteristic the system can make approximate hearing impairment simulation. The desired hearing aid dynamic characteristic is obtained as the compensation of impaired hearing dynamic characteristic. Using the hearing aid dynamic characteristic the system makes approximate simulation. For the both kind of simulation the system uses numerous recorded speech signal (logatoms, Polish words and Polish sentences). The hearing trainings are carried out based on the implemented hearing aid simulation algorithm and stored speech signal database. The details of the elaborated system will be presented in the paper.

Streszczenie Większość systemów dopasowania protez słuchu skupia się na poprawie zrozumienia mowy, ponieważ sygnał mowy jest podstawowym sposobem komunikowania się ludzi. W uproszczeniu problem dopasowania protezy słuchu może byc przedstawiony jako problem dopasowania szerokiej dynamiki sygnału do wąskiej dynamiki uszkodzonego słuchu. Do rozwiązania tego problemu większość protez słuchu wykorzystuje procesory dynamiki takie jak: kompresor i ekspander. Celem eksperymentó było stworzenie multimedialnego systemu, który byłby pomocny w: - pomiarze charakterystyki dynamiki słuchu; - tworzeniu przybliżonych symulacji ubytku słuchu; - obliczaniu poszukiwanej charakterystyki dynamiki protezy słuchu; - przeprowadzaniu przybliżonych symulacji według wyznaczonych charakterytyk protez słuchu; W systemie został zaimplementowany algorytm skalowania głośności w pasmach oktawowych. Jego wyniki są podstawą do wyznaczenia charakterystyki dynamiki uszkodzonego słuchu. Dzięki kompensacji charakterystyki dynamiki słuchu w systemie wyznaczana jest charakterystyka dynamiki protezy słuchu. W oparciu o tą wyznaczoną charakterystykę w systemie przeprowadzane są przyblizone symulacje, w których wykorzystywany jest sygnał mowy w postaci nagrań logatomów, słów i zdań w języku polskim. Szczególy opracowanego systemu są przedmiotem niniejszego plakatu.

Entry No. 98

Entry type conference paper

Authors G. Szwoch, B. Kostek, A. Czyżewski

English title Computer Modeling As a Useful Tool For Designing Acoustical Elements of Hearing Aid

Polish title

Conference VII Międzynarodowa Konferencja Implantów Ślimakowych i Medycyny Audiologicznej

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 22.5.2003- 24.5.2003

Abstract One of the most difficult aspects of hearing aid fitting is the proper choice of acoustical elements, including earmold. The aim of the research was to propose a computer tool that may be useful in this process. The computer model of the acoustical system of hearing aid is based on the physical waveguide modeling method. The algorithm computes and plots frequency response of the acoustical system. It is possible to examine the relationship between modification of the model parameters and respective changes in frequency response. Some chosen acoustical properties of the ear are implemented in the model. Although the model is simplified at this stage of research, comparison of model responses with the measurement data of real acoustical elements proved that the model behaves similarly to the physical system. In order to provide more accurate simulation of sound processing in hearing aid, dynamic signal compression was also included in the model. Signal processing procedures are performed independently in four frequency bands, compatible to LGOB examination. The complete model is implemented in Matlab as a computer program with graphical user interface. The ongoing listening tests with hearing-impaired people will be useful in validation of the model. The fully developed computer system is intended to be a helpful tool assisting the person who chooses the acoustical elements of hearing aid. It will enable one to design the acoustical system with acoustical properties optimal for the hearing aid user’s needs.

Streszczenie Jednym z najtrudniejszych aspektów dopasowywania aparatów słuchowych jest odpowiedni dobór elementów akustycznych aparatu, w tym wkładki usznej. Celem badań było zaproponowanie komputerowego narzędzia pomocnego w tym procesie. Model komputerowy akustycznego systemu aparatu słuchowego oparty jest na metodzie modelowania falowodowego. Algorytm oblicza i wykreśla charakterystykę częstotliwościową układu akustycznego. Możliwe jest zbadanie zależności pomiędzy zmianami wartości parametrów a odpowiadającym im zmianom charakterystyki częstotliwościowej. Pomimo tego, że na tym etapie badań zastosowano uproszczony model, porówanie charakterystyk modelu z charakterystykami pomiarowymi rzeczywistych elementów akustycznych aparatu słuchowego wykazało, że model odwzorowuje działanie rzeczywistego układu. W celu bardziej dokładnego odtworzenia procesu przetwarzania sygnału w aparacie słuchowym, do modelu dołączono blok dynamicznej regulacji poziomu sygnału. Przetwarzanie sygnału dokonywane jest niezależnie w czterech pasmach częstotliwości. Model komputerowy został zaimplementowany w systemie Matlab w postaci programu z graficznym interfejsem użytkownika. W celu weryfikacji poprawności działania modelu zostaną przeprowadzone testy odsłuchowe z udziałem osób z upośledzonym słuchem. Celem badań jest opracowanie pełnego komputerowego systemu służącego do projektowania akustycznych elementów aparatu słuchowego. Model ten będzie pomocny w procesie doboru aparatu słuchowego, pozwalając na dobranie elementów akustycznych aparatu najlepiej dopasowanych do potrzeb użytkownika aparatu.

Entry No. 99

Entry type journal paper

Authors A. Czyżewski

English title Automatic Identification of Sound Source Position Employing Neural Networks and Rough Sets

Polish title Automatyczna identyfikacja położenia źródła dźwięku z zastosowaniem sztucznych sieci neuronowych i zbiorów przybliżonych

Journal Pattern Recognition Letters

Volume 24

Number

Pages 921 - 933

Abstract Methods for the identification of direction of the incoming acoustical signal in the presence of noise and reverberation were investigated. Since the problem is a non-deterministic one, thus applications of two learning algorithms, namely neural networks and rough sets were developed to solve it. Consequently, two sets of parameters were formulated in order to discern target source from unwanted sound source position and then processed by learning algorithms. The applied feature extraction methods are discussed, training processes are described and obtained sound source localizing results are demonstrated and compared.

Streszczenie Przebadano metody identyfikacji kierunku napływania sygnałów akustycznych w obecności szumu i pogłosu. Ponieważ w ten sposób postawiony problem jest zagadnieniem niedeterministycznym, dlatego zastosowano algorytmy inteligentne, w celu jego rozwiązania. Przdyskutowano i porównano metody ekstrakcji cech dystynktywnych zastosowane na etapie parametryzacji oraz uzyskane wyniki automatycznej klasyfikacji.

Entry No. 100

Entry type journal paper

Authors A. Czyżewski, A. Kaczmarek, B. Kostek

English title Intelligent Processing of Stuttered Speech

Polish title Inteligentne przetwarzanie mowy osob jąkających się

Journal Journal of Intelligent Information Systems

Volume 21:2

Number

Pages 143 - 171

Abstract The process of counting stuttering events could be carried out more objectively through the automatic detection of stop-gaps, syllable repetitions and vowel prolongations. The alternative would be based on the subjective evaluations of speech fluency and may be dependent on a subjective evaluation method. Meanwhile, the automatic detection of intervocalic intervals, stop-gaps, voice onset time and vowel durations may depend on the speaker and the rules derived for a single speaker might be unreliable when trying to consider them as universal ones. This implies that learning algorithms having strong generalization capabilities could be applied to solve the problem. Nevertheless, such a system requires vectors of parameters, which characterize the distinctive features in a subject’s speech patterns. In addition, an appropriate selection of the parameters and feature vectors while learning may augment the performance of an automatic detection system. The paper reports on automatic recognition of stuttered speech in normal and frequency altered feedback speech. It presents several methods of analyzing stuttered speech and describes attempts to establish those parameters that represent stuttering event. It also reports results of some experiments on automatic detection of speech disorder events that were based on both rough sets and artificial neural networks.

Streszczenie Proces zliczania nieprawidłowo artykułowanych elementów mowy osób jakających się może być znacząco ułatwiony i zobiektywizowany poprzez zastosowanie automatycznej detekcji przerw, powtórzeń i przedłużeń. W artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się - sztucznych sieci neuronowych i zbiorów przybliżonych.

Entry No. 101

Entry type conference paper

Authors A. Lorens, A. Czyżewski, H. Skarżyński

English title

Polish title Obiektywna metoda wyznaczania skuteczności systemów implantów ślimakowych

Conference XIII Konferencja Biocybernetyki i Inżynierii Biomedycznej

Preprint

Number

Volume

Pages

Conference site Gdańsk,

Conference date 11.9.2003- 13.9.2003

Streszczenie Przedmiotem pracy było opracowanie nowej metody wyznaczania skuteczności systemów implantów ślimakowych opartej o komputerowe symulacje słyszenia elektrycznego. W celu wykonania symulacji stworzono matematyczny model percepcji słuchowej pacjenta implantowanego. Opracowany model uwzględnia wybrane zjawiska psychofizyczne, zachodzące w elektrycznie stymulowanym układzie słuchowym odpowiedzialne za percepcję mowy, takie jak: zmiany w progu słyszenia, przebieg funkcji narastania wrażenia głośności oraz rozdzielczość częstotliwościową. Opracowana metoda została sprawdzona w oparciu o eksperymenty przeprowadzone z udziałem czterech implantowanych pacjentów.

Entry No. 102

Entry type journal paper

Authors A. Czyżewski

English title Introduction to the Special Issue on Intelligent Systems to Aid the Handicapped

Polish title Wprowadzenie do specjalnego wydania na temat zastosowań systemów inteligentnych do wspomagania osób niepełnosprawnych

Journal Journal of Intelligent Information Systems

Volume 21:2

Number

Pages 103 - 104

Abstract A number of artificial intelligence and database technologies applications include those that aid handicapped people. In fact, for every new computer technology, the academic community is quick to use it to improve communication of people with disabilities or to aid them directly by e.g. mechanic controls using assistive technologies. Artificial intelligence, soft computing, domotics and intelligent agents follow similar trends. Multimedia technology combined with artificial intelligence and database technologies yields some very interesting results. A token of this is automatic interpretation of sign language for deaf people and more and more complex systems to aid the visually impaired where visual information is converted into acoustic information; examples are numerous. Until now there have been very few publications on the subject and the ones available usually came as conference proceedings, some of which were published by e.g. Springer-Verlag as part of the Lecture Notes in Computer Science series or IEEE Computer Society Workshops and various robotics conference proceedings. Moreover, there is a Special Interest Group on Computers and the Physically Handicapped (ACM SIGCAPH) that organizes international Conferences on Assistive Technologies. The subject in question has had extensive coverage as part of European Framework Programs projects and national research grants in many countries and in numerous patent applications. Few journals, however, have used their special issues to address the application of artificial intelligence and database technologies to aid the handicapped which has consequently led to the publication of this paper.

Streszczenie Wśród wielu zastosowań sztucznej inteligencji i technologii bazodanowych pojawiają się zastosowania wspierające osoby niepełnosprawne. Właściwie, z chwilą pojawienia się każdej nowej technologii komputerowej świat naukowy niezwłocznie proponuje zastosowania tej technologii do celu poprawy komunikacji z osobami niepełnosprawnymi bądź technologie wspierające te osoby bezpośrednio, np. poprzez sterowanie mechanicznymi urządzeniami dla osób niepełnosprawnych. Znane są wydawnictwa pokonferencyjne, które ukazywały się przykładowo w wydawnictwie Springer-Verlag, w jego serii z serii Lecture Notes in Computer Science lub jako proceedings of IEEE Computer Society Workshops. Istnieje także Special Interest Group on Computers and the Physically Handicapped (ACM SIGCAPH) organizująca międzynarodowe Conferences on Assistive Technologies i in. Tematyka, o której mowa jest ponadto przedmiotem licznych projektów w ramach European Framework Programs oraz grantów w wielu krajach świata i licznych aplikacji patentowych. Niniejsze wydanie specjalne czasopisma Journal of Intelligent Information Systems uzupełnia wydawnictwa, które ukazywały się do tej pory w tej dziedzinie.

Entry No. 103

Entry type conference paper

Authors P. Odya, A. Czyżewski

English title

Polish title System ekspercki do oceny stopnia nasilenia wady jąkania

Conference XII Konferencja Naukowo-Szkoleniowa Sekcji Foniatrycznej Polskiego Towarzystwa Otolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 6.6.2003- 8.6.2003

Streszczenie W związku z rozwojem metod terapii mowy osób jąkających się, w tym metod elektronicznej korekcji mowy i opracowaniem urządzeń do tego celu, rosną potrzeby w zakresie dokonywania obiektywnej oceny wyników terapii. W praktyce zliczanie błędnie artykułowanych elementów mowy jest dokonywane na podstawie testu sylabowego, jednak prowadzenie tego testu wymaga od pacjenta wysoce nienaturalnego sposobu artykułowania mowy. Z kolei zliczanie przedłużeń samogłosek, raptownych przerw a fonacji oraz powtórzeń jest procesem żmudnym a ponadto nie w pełni obiektywnym. Próby takiego zliczania dokonywanego jednocześnie przez kilka osób, ujawniają najczęściej dość znaczny rozrzut wyników. Z tego względu proponowane jest zastosowanie do tego celu komputerowej analizy mowy z zaburzeniami. Nawet jeśli program komputerowy nie sklasyfikuje potknięć artykulacyjnych w pełni poprawnie, to jednak będzie on działał za każdym razem jednakowo, umożliwiając szybkie ujęcie ilościowe postępów terapii. W procesie realizacji badań zostaną wykorzystane możliwości wynikające z udostępnienia cyfrowych korektorów mowy poradniom psychologiczno-pedagogicznym na terenie kraju, w zamian za nadsyłanie wyników badań, w tym nagrań magnetofonowych osób jąkających się. Na tej podstawie zostanie utworzona baza nagrań wypowiedzi obarczonych różnego typu zaburzeniami artykulacyjnymi, a następnie jej zawartość zostanie przeanalizowana przy wykorzystaniu opracowanych narzędzi komputerowych. W referacie zostaną omówione uzyskane wyniki i sformułowane wnioski na temat możliwości wynikających z zastosowania narzędzi komputerowych do badania mowy osób jąkająych się.

Entry No. 104

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title Employing Modern Teleinformation Technology for General Diagnostics of Noise Hazards

Polish title Technologie informacyjne w zastosowaniu do powszechnej diagnostyki hałasu

Conference First International ICSC Symposium „Information Technologies in Environmental Engineering” (ITEE’2003)

Preprint

Number

Volume

Pages 77

Conference site Gdansk, Poland

Conference date 24.6.2003- 27.6.2003

Abstract The principal objective of the project currently being realized at the Sound and Vision Engineering Department of the Gdansk University of Technology is reduction of frequency of hearing disease occurrence, whereby such diseases are caused by excessive industry, urban and traffic noise and objectionable sounds of other kind that occur in everyday life. Such an aim will be achieved as a result of implementation of solutions that will be developed within the project, whereby such solutions will be based on innovatory ideas and ways of evaluating noise and vibration nuisance and its negative impact on psychosomatic and vegetative system. The latest technological advances of teleinformatics will be used in the course of project realization. Implementation of an all-Polish noise telemonitoring system will contribute to the increase in consciousness of society and authorities concerning impact of noise on health and can turn to be an essential factor in changing the situation for better.

Streszczenie Nadrzędnym celem projektu realizowanego aktualnie w Katedrze Inżynierii Dźwięku i Obrazu Politechniki Gdańskiej jest zmniejszenie częstości występowania chorób słuchu powodowanych nadmiernym hałasem przemysłowym, urbanistycznym, komunikacyjnym i innego rodzaju niepożądanymi dźwiękami, które występują w życiu codziennym. Cel ten będzie osiągany w wyniku wdrożenia rozwiązań opracowanych w ramach projektu, które opierają się na nowatorskich koncepcjach i sposobach szacowania stopnia dokuczliwości hałasu oraz wibracji i ich negatywnego wpływu na słuch oraz na układy psychosomatyczny i wegetatywny. W toku realizacji projektu zostaną wykorzystane najnowsze osiągnięcia technologiczne z dziedziny teleinformatyki. Wdrożenie ogólnokrajowego systemu telemonitoringu hałasu przyczyni się do wzrostu stopnia uświadomienia przez społeczeństwo i władze problematyki wpływu hałasu na zdrowie i może stać się istotnym czynnikiem przyszłej poprawy sytuacji w tym zakresie.

Entry No. 105

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title Computer Diagnostics of Hearing and of Environmental Noise

Polish title Diagnostyka słuchu i zagrożeń hałasowych

Conference VI Krajowa Konferencja Naukowo-Techniczna „Diagnostyka Procesów Przemysłowych”

Preprint

Number

Volume

Pages 43 - 52

Conference site Władysławowo, Polska

Conference date 15.9.2003- 17.9.2003

Abstract The implemented system for screening testing of hearing is described. The noise telemonitoring system, developed at the Sound and Vision Engineering Department is discussed, aimed at environmental noise level monitoring. Apart from the global system characteristic, detailed system presentation was provided, consisting of descriptions of a mobile measurement unit, a computer noise measuring software, a USB sound interface with a measurement microphone and an Internet application. The results of audiometric and noise measurements were compared to those obtained with a professional noise measuring devices. The pair of engineered applications may help to diminish hearing diseases occurrence caused by environmental & industrial noise.

Streszczenie W pierwszej części referatu przedstawiono opracowaną w Katedrze Inżynierii Dźwięku i Obrazu przesiewową metodę diagnostyki słuchu. Szczegółowo opisano różne rodzaje stosowanych testów przesiewowych. Zamieszczono dyskusję zastosowanej audiometrii mowy w szumie. Podano charakterystykę opracowanej metody diagnostyki słuchu. W drugiej części przedstawiono aktualnie opracowywany system zdalnego monitorowania zagrożeń hałasowych. Oprócz ogólnej charakterystyki systemu, przedstawiono szczegółowy opis jego poszczególnych elementów składowych, którymi są: komputerowy program do analizy hałasu, przystawka dźwiękowa USB wraz z mikrofonem pomiarowym, serwis informacyjny poświęcony hałasowi oraz mobilne urządzenie pomiarowe, dokonujące analizy hałasu w dowolnie wybranych punktach pomiarowych. Przykładowe wyniki pomiarów porównano z wynikami uzyskanymi za pomocą profesjonalnego miernika poziomu dźwięku. W zamierzeniu autorów oba uzupełniające się systemy powinny przyczynić się do poprawy sytuacji w zakresie profilaktyki chorób słuchu powodowanych hałasem, przede wszystkim przemysłowym i komunikacyjnym.

Entry No. 106

Entry type

Authors A. Czyżewski

English title System and solution for speech correction

Polish title System do korekcji mowy i sposób korekcji mowy

Notes scedowany na PG decyzją U.Pat. RP w dniu 6 grudnia 2012 r.

Abstract The patent describes a solution for correcting stuttered speech and specifications of the technical means for implementing the solution.

Streszczenie W patencie opisano rozwiązanie do korygowania niepłynnej mowy, szczególnie przydatne osobom jąkającym się oraz środki techniczne techniczne przeznaczone do realizacji tego rozwiązania.

Entry No. 107

Entry type book

Authors A. Czyżewski

English title Chapter 20. Intelligent Acquisition of Audio Signal Employing Neural Network and Rough Set Algorithms

Polish title Rozdział 20. Inteligentna akwizycja sygnałów fonicznych z zastosowaniem sieci neuronowych i zbiorów przybliżonych

Editor Rough-Neural Computing. Techniques for Computing with Words. Springer.

Pages 521 - 541

Abstract The algorithms stemming from the neuro-rough computing approach were applied to digital acquisition of audio signals with regard to automatic localization of sound sources with the presence of noise and parasite echo. The application of neural networks to the automatic detection of sound arrival direction was tested first, then it was followed by some experiments employing rough sets and finally the neuro-rough approach to this problem solving was examined. The output of each tested algorithm was supposed to provide information about the direction of arriving sound. In the case of the neuro-rough algorithm the result of its action can be also available in the form of words defining the direction of arriving sound. Some details of the engineered systems and results of their experimental verification are compared and discussed.

Streszczenie Algorytmy oparte na sztucznych sieciach neuronowych i metodzie zbiorów przybliżonych zostały zastosowane do lokalizacji sygnałów fonicznych obarczonych pasożytniczym szumem i rewerberacjami. Informacja o kierunku napływania dźwięku była uzyskiwana na wyjściach tyhc algorytmów na podstawie reprezentacji parametrycznej. Przedstawiono wyniki eksperymentalne i przeprowadzono ich dyskusję.

Entry No. 108

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title Application of teleinformation technology to universal diagnostic of noise threats

Polish title Zastosowanie technologii teleinformatycznych do powszechnej diagnostyki zagrożeń hałasem

Conference I Krajowa Konferencja „Technologie Informacyjne”

Preprint

Number 1

Volume

Pages 175 - 182

Conference site Gdańsk, Polska

Conference date 18.5.2003- 21.5.2003

Streszczenie Nadrzędnym celem projektu realizowanego aktualnie w Katedrze Inżynierii Dźwięku i Obrazu Poli-techniki Gdańskiej jest zmniejszenie częstości występowania chorób słuchu powodowanych nad-miernym hałasem przemysłowym, urbanistycznym, komunikacyjnym i innego rodzaju niepożądany-mi dźwiękami, które występują w życiu codziennym. Cel ten będzie osiągany w wyniku wdrożenia rozwiązań opracowanych w ramach projektu, które opierają się na nowatorskich koncepcjach i spo-sobach szacowania stopnia dokuczliwości hałasu oraz wibracji i ich negatywnego wpływu na słuch oraz na układy psychosomatyczny i wegetatywny. W toku realizacji projektu zostaną wykorzystane najnowsze osiągnięcia technologiczne z dziedziny teleinformatyki. Wdrożenie ogólnokrajowego systemu telemonitoringu hałasu przyczyni się do wzrostu stopnia uświadomienia przez społeczeństwo i władze problematyki wpływu hałasu na zdrowie i może stać się istotnym czynnikiem przyszłej poprawy sytuacji w tym zakresie.

Entry No. 109

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Suchomski

English title Automatic Assesment of the Hearing Aid Dynamics Based on Fuzzy Logic

Polish title Automatyczna ocena dynamiki aparatu słuchowego z zastosowaniem logiki rozmytej

Conference 3rd IASTED International Conference Artificial Intelligence and Applications

Preprint

Number

Volume

Pages

Conference site Benalmadena, Hiszpania

Conference date 8.9.2003- 10.9.2003

Abstract Some principles of the fuzzy logic-based hearing fitting system are shown. A discussion on how to process loudness scaling results is presented. Then, details related to approximation of the membership functions corresponding to hearing sensation are discussed. Conclusions are also drawn.

Streszczenie Przedstawiono podstawy koncepcyjne systemu dopasowania protez słuchu opartego na logice rozmytej. Przeprowadzono dyskusje na temat metody skalowania głośności. Następnie podano szczegóły procesu aproksymacji funkcji przynależności odzwierciedlających słuchowe wrażenia głośności. Załączono wnioski.

Entry No. 110

Entry type journal paper

Authors A. Wieczorkowska, A. Czyżewski

English title Rough Set Based Automatic Classification of Musical Instrument Sounds

Polish title Automatyczna klasyfikacja instrumentów muzycznych z zastosowaniem metody zbiorów przybliżonych

Journal Electronic Notes in Theoretical Computer Science

Volume 82

Number 4

Pages 1 - 12

Abstract This paper addresses the problem of automatic recognition of musical instrument sounds, applying rough set based techniques as a tool of classification. Instruments representing wind and string families were used in the experiments. Since the main problem in case of audio data is the proper parameterization, we also investigated issues regarding various parameterization methods. Fourier transform and wavelet analysis were applied as parameterization tools. The obtained feature vectors were tested using rough set tools. The analyzed data represent singular sounds of full musical range of 11 musical instruments, played with various articulation techniques. Results of experiments are presented and discussed in this paper. We summarize our paper with conclusions on musical signal representation for timbre classification purposes.

Streszczenie Referat dotyczy problemu automatycznego rozpoznawania indtrumentów muzycznych, rozwiązywanego z zastosowaniem inteligentnych algorytmów decyzyjnych. Instrumenty muzyczne należące do rodziny dętych i strunowych zostały poddane procesowi parametryzacji z zastosowaniem analizy falkowej. Otrzymane dane stanowią reprezentację pełnej skali muzycznej 11 instrumentów, zarejestrowanych z uwzględnieniem artykulacji muzycznej. Wnioski zawarte w referacie dotyczą reprezentacji sygnałów muzycznych, która jest przydatna w procesie automatycznej klasyfikacji barwy dźwięku.

Entry No. 111

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Zastosowania inżynierii dźwięku i obrazu w biomedycynie

Conference XIII Konferencja Biocybernetyki i Inżynierii Biomedycznej

Preprint

Number

Volume

Pages

Conference site Gdańsk,

Conference date 11.9.2003- 13.9.2003

Streszczenie Wynikiem opracowań, które powstały w Katedrze Inżynierii Dźwięku i Obrazu Politechniki Gdańskiej (obecnie: Katedra Systemów Multimedialnych) we współpracy z Instytutem Fizjologii i Patologii Słuchu w Warszawie jest zestaw narzędzi komputerowych do badania słuchu oraz rozwiązanie systemowe masowych badań przesiewowych słuchu, mowy i wzroku oparte na zastosowaniu współczesnych technologii teleinformatycznych. Ponadto, celem usprawnienia procesu dopasowywania aparatów słuchowych i wszczepów ślimakowych dla potrzeb pacjentów, opracowano nowe metody wykorzystujące m. in. wnioskowanie rozmyte i komputerowy model słuchu elektrycznego. Niektóre wdrożone aplikacje i systemy z dziedziny diagnostyki słuchu, mające w znacznej mierze charakter oryginalny, zostały pokrótce przedstawione w niniejszym referacie wraz z ich podstawami koncepcyjnymi.

Entry No. 112

Entry type journal paper

Authors B. Kostek, A. Czyżewski

English title Processing of Musical Metadata Employing Pawlak’s Flow Graphs

Polish title Przetwarzanie meta danych w oparciu o metodę grafów przepływowych

Journal Transactions on Rough Sets I

Volume LNCS 3100

Number I

Pages 279 - 298

Abstract The objective of the presented research is enabling music retrieval based on intelligent analysis of metadata contained in musical databases. A database was constructed for the purpose of this study including textual data related to approximately 500 compact discs representing various categories of music. The description format of musical recordings stored in the database is compatible to the format of the widely-used CDDB database available in the Internet. An advanced query algorithm was prepared employing the concept of inference rule derivation from flow graphs introduced recently by Pawlak. The created database searching engine utilizes knowledge acquired in advance and stored in flow graphs in order to enable searching CD records.

Streszczenie W artykule przedstawiono problemy wyszukiwania informacji muzycznej. W eksperymentach posłużono się meta opisem oraz wykorzystano metodę grafów przepływowych Pawlaka. Opisano skonstruowaną bazę nagrań muzycznych. Słowa kluczowe: meta opis, wyszukiwanie informacji muzycznej, baza danych muzycznych

Entry No. 113

Entry type conference paper

Authors A. Walkowiak, A. Czyżewski, A. Lorens, B. Kostek

English title New Techniques Assisting Cochlear Implants Fitting

Polish title Techniki wspomagania procesu dopasowania implantów ślimakowych

Conference 117 Audio Eng. Society Convention

Preprint 6287

Number

Volume

Pages 1 - 4

Conference site San Francisco, USA

Conference date 28.10.2004- 31.10.2004

Abstract Measurement of Spread of Excitation (SoE) provides a potential method of assessment of cochlear implant users' benefit. To provide maxiumum benefit for the cochlear implant useres the speech processor should be fitted to the patients' need. One objective method that could deliver important information for fitting is Neural Response Telemetry (NRT). This method helps to estimate an amplitude of electrical current that is required to elicit hearing sensation via cochlear implant.

Streszczenie W referacie przedstawiono metodę telemetrii odpowiedzi neuronalnej (SoE), którą zastosowano w celu oceny amplitudy sygnału, która będzie wywoływała percepcję słuchową poprzez pobudzenie elektrod implantu ślimakowego. Jest to obiektywna metoda wyznaczania skuteczności systemów implantów ślimakowych.

Entry No. 114

Entry type journal paper

Authors A. Czyżewski, A. Kaczmarek, P. Maziewski, P. Odya, P. Szczuko, H. Krzyżniewski

English title

Polish title Badanie jakości transmisji mowy w sieciach IP

Journal Zeszyty Naukowe Wydziału ETI PG

Volume

Number 4

Pages 439 - 446

Abstract The paper contains a description of an experiment investigating the relation between a subjective speech rating and an objective quality of a telephone channel VoIP. An Internet packet transmission conditions were simulated. Listening tests were carried out based on non-sense syllable lists and a set of common speech sentences. The statistical processing of results allowed to draw-out and to present some general conclusions concerning the influence of transmission channel parameters on the subjective assessment of speech quality.

Streszczenie Praca zawiera opis eksperymentu mającego na celu zbadanie relacji pomiędzy oceną subiektywną sygnału mowy a jakością transmisji tego sygnału w kanale telefonicznym VoIP. Wykorzystano symulacje transmisji pakietowej sygnału w sieci IP. Wykonano serie testów odsłuchowych opartych na listach logatomowych i odpowiednio dobranych zdaniach. Do interpretacji wyników zastosowano analizę statystyczną.

Entry No. 115

Entry type journal paper

Authors A. Czyżewski, J. Kotus

English title UNIVERSAL SYSTEM FOR DIAGNOSING ENVIRONMENTAL NOISE

Polish title UNIWERSALNY SYSTEM TELEMONITORINGU HAŁASU ŚRODOWISKOWEGO

Journal JOURNAL of MANAGEMENT OF ENVIRONMENTAL QUALITY

Volume 15

Number 3

Pages 294 - 305

Abstract A concept of a multimedia computer system for monitoring of environmental noise is presented in the paper. The system is connected with a vital part of the project of reduction of hearing disease occurrence frequency, realized at the Sound and Vision Engineering Department, Gdansk University of Technology. Such diseases are caused by excessive industry, urban-and traffic noise and objectionable sounds of other kind that occur in everyday life. Reduction of hearing disease occurrence will be achieved as a result of implementation of solutions that will be developed within the project. It is emphasized that these solutions will be based on innovatory ideas and ways of evaluating noise and vibration nuisance and its negative influence on a psychosomatic and vegetative system. The latest technological advances in information technology will be used in the course of the project realization. Implementation of an all-Polish noise telemonitoring system will contribute to the increase in consciousness of society and authorities concerning influence of noise on health. Furthermore, it can turn to be an essential factor in changing the situation for better.

Streszczenie W publikacji przedstawiono projekt multimedialnego systemu komputerowego do monitoringu hałasu środowiskowego. Opracowywany system jest powiązany z aktualnie realizowanym w Katedrze Inżynierii Dźwięku i Obrazu Politechniki Gdańskiej projektem, którego celem jest redukcja częstości występowania niedosłuchu. Niedosłuch często jest wywołany przez nadmierny hałas przemysłowy, komunikacyjny lub innego rodzaju niepożądane dźwięki, towarzyszące człowiekowi. Wdrożenie systemu telemonitoringu hałasu przyczyni się do wzrostu stopnia świadomości społeczeństwa i władz wzakresie problematyki wpływu hałasu na zdrowie.

Entry No. 116

Entry type conference paper

Authors A. Czyżewski, J. Kotus, B. Kostek

English title Comparing Noise Levels and Audiometric Testing Results Employing IT Based Diagnostic Systems

Polish title Porównanie poziomów hałasu z wynikami przesiewowych testów audiometrycznych w oparciu o Internetowe systemy diagnostyczne

Conference The 33rd International Congress and Exposition on Noise Control Engineering INTERNOISE 2004

Preprint

Number

Volume

Pages

Conference site Prague, Czech Republic

Conference date 22.8.2004- 24.8.2004

Abstract The implemented Internet system for screening testing of hearing is described. The noise environmental noise telemonitoring system is briefly discussed. The pair of engineered applications may help to diminish hearing diseases occurrence caused by environmental & industrial noise. The results of audiometric tests and of noise measurements were compared on the basis of both systems database contents. The analysis of data captured by both systems allows a discussion included in the paper concerning the influence of excessive noise levels on the status of hearing sensitivity of large populations living or working in the areas endangered by noise.

Streszczenie W referacie przedstawiono Internetowy system przeznaczony do przeprowadzania przesiewowych testów słuchu. Zaprezentowano również system informacyjny przeznaczony do monitorowania hałasu środowiskowego. Obie Internetowe aplikacje mogą być pomocne w zmniejszaniu częstości występowania chorób słuchu powodowanych przez hałas środowiskowy i przemysłowy. Porównano wyniki testów audiometrycznych z pomiarami hałasu na podstawie zawartości baz danych obu systemów. Na podstawie analizy danych zgromadzonych przez oba systemy omówiono również problem dotyczący wpływu nadmiernych poziomów hałasu na wrażliwość słuchową u dużych grup osób żyjących lub pracujących na obszarach zagrożonych hałasem.

Entry No. 117

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title Web Based Acoustic Noise Measurement System

Polish title Internetowy System do Pomiarów Hałasu Środowiskowego

Conference 116th AES Convention

Preprint

Number

Volume

Pages

Conference site Berlin, Germany

Conference date 8.5.2004- 11.5.2004

Abstract A concept and an implementation of the multimedia computer system for the monitoring of environmental noise threats is presented. The principal aim of the project is to improve the effectiveness of prophylaxis of hearing diseases. It allows to receive, store, analyze and visualize a noise data coming from noise measurement equipments and from electronic questionnaires accessible through the Internet. A new concept of the USB noise meter with GPS is also presented.

Streszczenie W referacie przedstawiono projekt multimedialnego systemu przeznaczonego do monitorowania zagrożeń hałasem środowiskowym. Nadrzędnym celem realizowanego projektu jest zwiększenie efektywności profilaktyki chorób słuchu. Opracowywany system umożliwia odbiór, przechowywanie danych, analizę i wizualizację wyników pomiarów hałasu pozyskanych od urządzeń pomiarowych za pośrednictwem Internetu. Przedstawiono również opracowanie miernika hałasu zintegrowanego z odbiornikiem sygnału GPS, działającego w oparciu o interfejs USB.

Entry No. 118

Entry type journal paper

Authors A. Czyżewski, J. Kotus, B. Kostek, K. Kochanek, H. Skarżyński

English title IT- Enabled Comparison of Environmental Noise Levels and Noise-Evoked Hearing Impairments

Polish title Porównanie poziomów hałasu środowiskowego z uszkodzeniami słuchu wywołanych hałasem z wykorzystaniem systemów teleinformatycznych

Journal Mechanika

Volume 23

Number 2

Pages 143 - 154

Abstract The noise telemonitoring system, developed at the Multimedia Systems Department of the Gdansk University of Technology is discussed, aimed at environmental noise level monitoring. Apart from the global system characteristic, a more detailed system presentation was provided. The presentation covers descriptions of a mobile measurement unit, a computer noise measuring software, a USB sound interface with a measurement microphone, and an Internet application allowing for the automatic creation of noise maps on the basis of data received from computers or mobile phones employed to noise data acquisition. The implemented Internet system for screening testing of hearing is also described. The pair of engineered applications may help to diminish hearing diseases occurrence caused by environmental & industrial noise. The results of audiometric tests and of noise measurements will be compared systematically, on the basis of both systems database contents. The analysis of data captured by both systems underlies a discussion included in the paper, concerning the influence of excessive noise levels on the status of hearing sensitivity of large populations living or working in the areas endangered by noise.

Streszczenie Tematem pracy jest telemetryczny system monitorowania hałasu, opracowany w katedrze Systemów Multimedialnych Politechniki Gdańskiej, przeznaczony do zdalnego monitorowania poziomów hałasu środowiskowego. Oprócz ogólnej charakterystyki systemu zaprezentowano również szereg szczegółów implementacyjnych. Przedstawiono m.in. mobilne urządzenie pomiarowe, oprogramowanie do pomiarów hałasu, dźwiękowy interfejs USB wyposażony w mikrofon pomiarowy oraz Internetową aplikację, umożliwiającą automatyczne tworzenie prostych map hałasu w oparciu o dane pozyskane z urządzeń pomiarowych za pośrednictwem komunikacji bezprzewodowej. W artykule opisano również Internetowy system do przesiewowego badania słuchu. Oba prezentowane systemy mogą przyczynić się do zmniejszenia częstości występowania chorób słuchu, powodowanych prze nadmierny hałas, głównie środowiskowy i przemysłowy. Wyniki uzyskane przez poszczególne systemy będą systematycznie porównywane na podstawie danych zawartych w dedykowanych baz danych. Na podstawie analizy wyników pozyskanych przez oba systemy przeprowadzono dyskusję dotyczącą wpływu nadmiernego hałasu na wrażliwość na hałas dla dużej populacji osób zamieszkującej lub pracującej na obszarach zagrożonych hałasem.

Entry No. 119

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title

Polish title MULTIMEDIALNY SYSTEM MONITOROWANIA HAŁASU

Conference III Kongres Technologiczny

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 5.10.2004

Entry No. 120

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński, P. Odya, B. Kostek

English title A Model of Digital Hearing Aid for the Implanted Patients Revealing Residual Acoustic Hearing

Polish title Projekt i realizacja modelu subminiaturowego cyfrowego aparatu słuchowego dla osób implantowanych zachowujących resztki słuchowe

Conference II Międzynarodowa Konferecja Telemedycyny i Telekomunikacji Multimedialnej

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 8.10.2004- 9.10.2004

Notes plakat

Abstract Some recently published papers by Skarzynski et al., present a compelling demonstration of efficacy for a new treatment of severe hearing loss. The treatment combines electrical stimulation of the basal part of one cochlea, paired with acoustical stimulation of residual, low-frequency hearing on either the implanted side or both sides. Consequently, a need occurred to amplify and shift downwards frequency scale high-frequency acoustic signals, including speech consonants. Taking advantage of the recent progress in Digital Signal Processor (DSP) developments, a portable and reprogrammable digital hearing aid has been designed and implemented for shifting the speech frequency down to a lower frequency range with an algorithm based on a special resampling technique. Consequently, owing to hybrid signal processors it was possible to perform complex algorithms of signal processing at the same time providing a chance to make use of what is left of the hearing ability which has been out of reach of the hearing aids being used so far. The audio signal is proportionally transposed downwards the frequency scale by dividing each frequency component by a factor. This process compresses the speech spectrum in order to introduce as much information as possible into the limited audible frequency range of the hearing impaired listener. The developed digital hearing aid to be used by the implanted patients revealing residual hearing will be demonstrated in the poster paper.

Entry No. 121

Entry type journal paper

Authors A. Czyżewski, B. Kostek, J. Kotus

English title

Polish title Zastosowanie środków teleinformatycznych do diagnostyki zagrożeń hałasowych i chorób słuchu Część II – Multimedialny system monitorowania hałasu

Journal Przegląd Telekomunikacyjny i Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages 330 - 337

Streszczenie Opisano koncepcję i implementację multimedialnego systemu powszechnego monitorowania hałasu. Celem opisywanego projektu jest przede wszystkim poprawa profilaktyki zdrowotnej w zakresie szumów usznych i dolegliwości psychosomatycznych, które dotyczą populacji przebywającej na obszarach zagrożonych hałasem. Opracowany system pozwala na pomiar hałasu i transmisję danych do serwera, który umożliwia ich analizę i wizualizację, m.in. w postaci map akustycznych. Ponadto, opracowany system pozwala na zbieranie subiektywnych opinii na temat hałasu i na analizę danych uzyskiwanych tym sposobem.

Entry No. 122

Entry type conference paper

Authors A. Czyżewski, A. Kaczmarek, J. Kotus, A. Pawlik, A. Rypulak, P. Żwan

English title A System for Multitask Noisy Speech Enhancement

Polish title System poprawy zrozumiałości mowy w szumie

Conference 116th AES Convention

Preprint

Number

Volume

Pages

Conference site Berlin, Germany

Conference date 8.5.2004- 11.5.2004

Abstract A general characteristic of the engineered speech signal registration and restoration system is presented in the paper. It contains a concise description of specific components of the system, the system being a set of advanced tools for registration, analysis and reconstruction of speech, existing in the form of computer software. The tools included allow for prompt search of desired fragments of recordings and for the improvement of their quality through noise, distortion and interference reduction. A brief information concerning selected speech reconstruction algorithms is presented also, the use of which allowed for an especially significant increase of processed speech comprehension.

Streszczenie W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy poprawy zrozumiałości mowy w szumie wraz z oceną ich skuteczności.

Entry No. 123

Entry type conference paper

Authors A. Czyżewski, M. Dziubiński

English title Noise Reduction in Audio Employing Spectral Unpredictability Measure and Neural Net

Polish title

Conference Knowledge-Based Intelligent Information and Engineering Systems: 8th International Conference, KES 2004

Preprint

Number

Volume

Pages 743 - 750

Conference site Wellington, New Zealand

Conference date 20.9.2004- 25.9.2004

Abstract Improvements of the recently presented noise reduction algorithm, based on perceptual coding of audio are revealed. Enhancements of the spectral Unpredictability Measure parameter calculation, which is one of the significant elements in the applied psychoacoustic model are discussed. A learning decision algorithm based on a neural network is employed for determining input signal useful components acting as maskers of the spectral components classified as noise. A new iterative algorithm for calculating the masking pattern is presented. The results of experiments carried out employing the modified algorithm are discussed and conclusions are added.

Streszczenie W pracy przedstawione zostało rozwinięcie ostatnio przedstawionem metodody redukcji szumu, działającej w oparciu o kodowanie perceptualne dźwięku. Ulepszenia parametru widmowej miary nieprzewidywalności, który stanowi ważny element w wykorzystanym modelu psychoakustycznym zostały przedyskutowane. Uczący się algorytm decyzjny, działający w opraciu o sztuczną sieć neuronową wykorzystany został w klasyfikacji składowych na pasożytnicze i użyteczne. Przedstawiona została również nowa iteracyjna procedura obliczania progu maskowania. W pracy zawarte zostały wyniki eksperymentów, oraz konkluzje odnoszące się do przedstawionych algorytmów.

Entry No. 124

Entry type journal paper

Authors A. Czyżewski, A. Kaczmarek, J. Kotus, A. Pawlik, A. Rypulak, P. Żwan

English title Speech Archiving & Restoration System for Military Aviation Applications

Polish title Cyfrowy System Rejestracji I Rekonstrukcji Sygnału Mowy Dla Potrzeb Lotnictwa Wojskowego

Journal Zeszyty Naukowe Wydziału ETI PG

Volume

Number 3

Pages 135 - 142

Abstract The speech received by radio communication from jet pilots can be severely degraded by noise and various distortions. A system was developed for multi-channel recording of voice communication with jet pilots extended with a toolbox containing some advanced DSP algorithms for speech enhancement. Moreover, some innovative solutions were adopted, including the method for synchronizing transmission received from many radio stations in order to produce surround sound enabling additional perceptual filtration of speech. Some selected components of the engineered multi-task speech enhancement system are presented in the paper.

Streszczenie W referacie przedstawiono ogólną charakterystykę opracowanego systemu rejestracji i rekonstrukcji sygnału mowy. Zamieszczono opis poszczególnych składników systemu, które stanowi zestaw zaawansowanych narzędzi do rejestracji, analizy i rekonstruowania mowy, zrealizowany w formie oprogramowania komputerowego. Narzędzia te pozwalają na szybkie wyszukiwanie pożądanych fragmentów nagrań oraz poprawę ich jakości na drodze redukcji szumów, zniekształceń i zakłóceń. W skrajnie trudnych lub szczególnie istotnych przypadkach analizy zapisu, możliwe jest wykorzystanie większej liczby (do pięciu) wersji tego samego nagrania zarejestrowanego w poszczególnych naziemnych stacjach kontroli lotu, dzięki specjalnie opracowanej metodzie synchronizacji zapisu i jego odtwarzania w fonicznym zestawie wielokanałowym. Przedstawiono również opis wybranych algorytmów rekonstrukcji mowy w odniesieniu do wyników ich działania.

Entry No. 125

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title

Polish title W jaki sposób hałas i głośna muzyka wpływają na słuch ? Eksperymenty i pomiary

Conference II Bałtycki Festiwal Nauki na WETI

Preprint

Number

Volume

Pages

Conference site

Conference date 29.5.2004

Streszczenie W ramach prezentacji uczestnicy zostaną zapoznani ze sposobami pomiaru hałasu zarówno przy pomocy specjalistycznego sprzętu pomiarowego, jak i przy wykorzystaniu do tego celu komputera rozszerzonego o przystawkę pomiarową i opracowany w Politechnice Gdańskiej multimedialny system tworzenia map akustycznych. Na podstawie prowadzonych pomiarów będą mogli zorientować się jaki hałas wytwarzają różnego typu źródła i w jakim stopniu jest on uciążliwy dla organizmu. Ponadto, uczestnicy pokazu będą mogli pomierzyć poziomy głośności muzyki i dowiedzieć się, przy jakich poziomach głośności słuchanie muzyki może wpływać negatywnie na słuch.

Entry No. 126

Entry type conference paper

Authors J. Kotus, A. Czyżewski, H. Skarżyński

English title Comparing Noise Levels and Audiometric Testing Results Employing IT Based Diagnostic Systems

Polish title Zestawienie wyników pomiarów hałasu z wynikami testów audiometrycznych oparte o diagnostyczne systemy teleinformatyczne

Conference 2nd International Conference on Telemedicine and Multimedia Communication

Preprint

Number

Volume

Pages

Conference site Kajetany, Poland

Conference date 8.10.2004- 9.10.2004

Abstract A concept and an implementation of the multimedia computer system for the monitoring of environmental noise is presented in the paper. The system provides an extension to the Internet-based system for testing hearing which was engineered and launched in 1999. A considerable portion of hearing diseases is caused by excessive industry, urban and traffic noise or any unwanted sounds occurring in everyday life. Consequently, it is expected that a reduction of hearing diseases occurrence will be achieved as a result of implementation of the solutions that have been developed within the project scope. The latest technological advances in information technology were used in the course of the project realization. Consequently, it is shown in the paper that the presented solutions are based on some innovative ideas and inexpensive technical means for measuring noise and vibration allowing fast evaluation of its influence on the psychosomatic and the vegetative system. It is expected that implementation of the noise telemonitoring system covering whole country will contribute to rising awareness of society and authorities with regard of the influence of noise on health. Furthermore, it turns out to be an essential factor in the future improvement of the environmental noise conditions.

Streszczenie W referacie przedstawiono projekt i realizację multimedialnego, komputerowego systemu monitorowania hałasu. Prezentowany system stanowi uzupełnienie do Internetowego systemu przeznaczonego do przesiewowych badań słuchu, opracowanego i wdrożonego w 1999 r. Znaczna część chorób słuchu jest powodowana przez hałas przemysłowy, nadmierny hałas w miastach oraz różnego rodzaju uciążliwe dźwięki występujące w życiu codziennym. W związku z tym oczekuje się, że wprowadzane rozwiązania przyczynią się do zmniejszenia częstości występowania chorób słuchu. W toku realizacji prezentowanego projektu wykorzystano najnowsze rozwiązania technologiczne z dziedziny informatyki i telekomunikacji. Prezentowane rozwiązania są oparte o nowatorskie pomysły umożliwiające prowadzenie diagnostyki hałasu przy niskich nakładach, ponadto umożliwiają szybką ocenę negatywnego wpływu hałasu na układ psychosomatyczny i wegetatywny. Oczekuje się, że wdrożenie systemu telemonitoringu hałasu obejmie obszar całego kraju, przez co przyczyni się do zwiększenia świadomości społeczeństwa odnośnie negatywnego wpływu hałasu na zdrowie człowieka. Ponadto, upowszechnienie prezentowanych rozwiązań może okazać się istotnym czynnikiem przyszłej poprawy warunków w zakresie hałasu środowiskowego.

Entry No. 127

Entry type conference paper

Authors P. Suchomski, B. Kostek, A. Czyżewski

English title A System for Fast & Precise Hearing Aids Fitting

Polish title Komputerowy system szybkiego i dokładnego dopasowania protez słuchu

Conference Prezentacja plakatowa na II Międzynarodowej Konferecji Telemedycyny i TeleKomunikacji Multimedialnej

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 8.10.2004- 9.10.2004

Abstract Most prosthetics practitioners determine the preliminary characteristics of the hearing aid using simple calculation procedures. Experience shows that similar methods allow assessing the characteristics of the searched hearing aid in a relatively straightforward and intuitive way, but they do not guarantee finding optimum settings of the compression algorithms. Moreover, it requires some additional tuning of the determined characteristics using other methods of adjusting hearing aids. The difficulty in determining hearing characteristics on the basis of known LGOB test results lies primarily in converting the subjective scale of loudness sensation into the objective scale of sound level expressed in dB. The widely used method of determining hearing characteristics (the standard method) implicitly projects the subjective scale of categories onto the space of real numbers from the closed range from 0 to 6, and subsequently calculates the difference between the results of loudness scaling for regular hearing and those for the tested case. This problem was solved alternatively by the authors using a method that converts the results of the loudness-scaling test to the category domain in a natural way, i.e. determines the difference between regular and impaired loudness scaling in a way similar to that of a human expert, using a set of categories like e.g. very small difference, small difference, medium difference, big difference and very big difference. Subsequently, a proper interpretation of these categories is required to determine the correct sound level in the dB SPL scale. These requirements are met by the engineered and implemented method employing fuzzy logic-based processing. The poster will demonstrate the method of fast and accurate hearing aids fitting and the developed computer program making possible to use this method by audiologists.

Streszczenie Większość protetyków słuchu wyznacza wstępną charakterystykę aparatu słuchowego na podstawie prostych obliczeń, których zasadniczym parametrem jest zmierzony próg słyszenia pacjenta.. Doświadczenie pokazuje, że takie metody pozwalają w stosunkowo krótkim czasie wyznaczyć charakterystykę poszukiwanej protezy słuchu, jednak nie gwarantują one optymalnego ustawienia algorytmu kompresji dynamiki.. Ponadto metody te wymagają dodatkowych procedur optymalizujących ustawienie protezy słuchu. Trudność wyznaczenia charakterystyki protezy słuchu w oparciu o wyniki znanego testu LGOB polega na konieczności konwersji subiektywnej skali oceny wrażenia głośności do postaci obiektywnej skali poziomu dźwięku, wyrażonej w dB. Standardowa metoda wyznaczania charakterystyki protezy słuchu konwertuje skalę kategorii oceny wrażenia głośności do postaci liczb rzeczywistych z przedziału od 0 do 6, a następnie oblicza różnicę między uśrednionymi wynikami skalowania głośności dla słuchu prawidłowego i wynikami skalowania głośności danego pacjenta.. Alternatywnym, zaproponowanym przez autorów, sposobem wyznaczenia charakterystyki dynamiki uszkodzonego słuchu jest wykorzystanie logiki rozmytej do określenia różnicy w skalowaniu głośności danego pacjenta i osób o słuchu prawidłowym. Różnice te również wyrażone są w skali kategorii np. brak, bardzo mała, mała, średnia, duża, bardzo duża. Następnie za pomocą przetwarzania rozmytego następuje interpretacja otrzymanych różnic, a w konsekwencji wyznaczenie charakterystyki dynamiki uszkodzoengo słuchu wyrażonej w skali dB SPL. Ten plakat ilustruje opracowaną metodę szybkiego I precyzyjnego dopasowania protez słuchu oraz prezentuje oprogramowanie, które jest implementacją przedstawionej metody.

Entry No. 128

Entry type journal paper

Authors A. Czyżewski, M. Szczerba, B. Kostek

English title Musical Phrase Representation and Recognition by Means of Neural Networks and Rough Sets

Polish title Rozpoznawanie fraz muzycznych w oparciu o sztuczne sieci neuronowe i metodę zbiorów przybliżonych

Journal Transactions on Rough Sets I

Volume LNCS 3100

Number I

Pages 254 - 278

Abstract This paper discusses various musical phrase representations. Musical phrase analysis plays an important role in music information retrieval domain. In the paper various representations of a musical phrase are described and analyzed. Also the experiments were designed to facilitate pitch prediction within a musical phrase by means of entropy-coding of music. We used the concept of predictive data coding introduced by Shannon. Encoded music representations, stored in the database, are then used for automatic recognition of musical phrases by means of Artificial Neural Networks (ANN) and rough sets (RS). A discussion on obtained results is carried out and conclusions are included.

Streszczenie W artykule przedstawiono podstawowe definicje dotyczące frazy muzycznej. W eksperymentach posłużono się zapisem parametrycznym. W celu wzmocnienia procesu rozpoznawania wykorzystano kodowanie entropijne muzyki. W eksperymentach klasyfikacji oparto się o sztuczne sieci neuronowe i metodę zbiorów przybliżonych. Słowa kluczowe: fraza muzyczna, klasyfikacja, sztuczne sieci neuronowe, metoda zbiorów przybliżonych

Entry No. 129

Entry type journal paper

Authors A. Czyżewski

English title

Polish title Portal www.etelwelfare.com finalistą konkursu eEurope Awards for eHealth

Journal

Volume

Number 3/77

Pages 12 - 13

Streszczenie Artykuł przedstawia wybrane projekty telemedyczne prezentowane w finale konkursu eHealth w Irlandii. Wśród nominowanych znalazł się nominowany projekt Katedry Systemów Multimedialnych Politechniki Gdańskiej i Instytutu Fizjologii i Patologii Słuchu w Warszawie. System ten został pokrótce omówiony. UKD:elektronika w medycynie audiologia

Entry No. 130

Entry type conference paper

Authors H. Skarżyński, A. Czyżewski

English title 2nd International Conference on Telemedicine and Multimedia Communication

Polish title Cele 2 Międzynarodowej Konferencji Telemedycyny i Telekomunikacji Multimedialnej

Conference 2nd International Conference on Telemedicine and Multimedia Communication

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 8.10.2004- 9.10.2004

Abstract Telemedicine is one of the most important and fastest developing technologies of knowledge-based society. Although there are thousands of telemedical systems the number of the ones which offer more than just the ability of reaching the information on physiology, pathology and therapeutic methods is insufficient.

Streszczenie Autorzy zestawili główne cele konferencji, wskazując na fakt, że telemedycyna jest jedną z najszybciej rozwijających się technologii społeczeństwa informacyjnego. Pomimo iż istnieje wiele aplikacji telemedycznych, większość z nich ogranicza się prezentacji informacji. Dlatego istnieje potrzeba tworzenia i prezentacji serwisów telemedycznych oferujących np. badania przesiewowe czy informacje z zakresu terapii medycznej. UKD:elektronika w medycynie audiologia

Entry No. 131

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Music Archive Metadata Processing Based on Flow Graphs

Polish title Przetwarzanie meta opisu plików muzycznych z zastosowaniem grafów przepływowych

Conference 116th Audio Engineerig Society

Preprint 6138

Number

Volume

Pages 1 - 7

Conference site Berlin, Niemcy

Conference date 8.5.2004- 11.5.2004

Abstract The paper addresses the capabilities that should be expected from intelligent Web search tools in order to respond properly to user's music information retrieval needs. An advanced query algorithm was engineered employing a concept of inference rule derivation from flow graphs with regard to semantic data processing. This concept, introduced recently by Pawlak, is used for mining knowledge in databases. The created database searching engine utilizes knowledge acquired in advance and stored in flow graphs in order to enable searching in musical repositories. Results obtained show that employing the implemented method the resulting search matches are ranked optimally, thus metada related to recorded sound can be retrieved efficiently with the use of this algorithm.

Streszczenie W referacie zaproponowano metodykę wyszukiwania informacji muzycznej w bazach internetowych w oparciu o meta opis. Skonstruowany algorytm wykorzystuje grafy przepływowe Pawlaka. Słowa kluczowe: wyszukiwanie informacji, grafy przepływowe

Entry No. 132

Entry type conference paper

Authors A. Czyżewski, P. Maziewski, M. Dziubińki, A. Kaczmarek, B. Kostek

English title Wow detection and compensation employing spectral processing of audio

Polish title Detekcja i kompensacja zniekształceń drżenia dźwięku

Conference 117 Konferencja AES

Preprint 6212

Number 1/2

Volume 53

Pages 91

Conference site San Francisco, CA, USA

Conference date 28.10.2004- 31.10.2004

Notes abstrakt dostępy w JAES

Abstract The engineered algorithms are presented for the detection of parasitic frequency modulation in audio originating from irregularities of sound carrier velocity. The algorithms were developed with special regard to non-periodic frequency modulation effects found in old movie sound tracks. The proposed algorithms consider the influence of the wow disturbance on the location of formants in time-frequency representation. The dynamic analysis of formant structures behavior underlies discriminating between parasitic frequency changes and natural frequency fluctuations. The compensation of the detected wow-related frequency modulation is accomplished basing on the non-uniform resampling algorithm, driven by the discerned parasite modulation patterns. The details of the proposed wow detection and compensation techniques are presented and achieved results are discussed.

Streszczenie Praca zawiera opis opracowanych algorytmów detekcji i kompensacji pasożytniczych modulacji częstotliwości wynikających z nierównomiernego przesuwu nośnika dźwięku. Proponowane metody opracowano ze szczególnym uwzględnieniem przypadkowych zniekształceń drżenia obecnych w archiwalnych filmowych ścieżkach dźwiękowych. Dodatkowo algorytmy badają wpływ zniekształceń na strukturę formantową sygnałów. Analiza zmian położenia formantów umożliwia rozróżnienie naturalnych i pasożytniczych wahań częstotliwości. Kompensacja zniekształceń następuje dzięki metodzie nierównomiernego próbkowania sygnału. W kolejnych paragrafach pracy przedstawiono szczegółowy opis poszczególnych metod oraz uzyskane wyniki.

Entry No. 133

Entry type journal paper

Authors A. Czyżewski

English title

Polish title Interdyscyplinarne ujęcie problemu szumów usznych i wynikające z niego technologie elektronicznego wspomagania diagnostyki i terapii

Journal Audiofonologia

Volume

Number 25

Pages 27 - 34

Notes wydanie ukazało się w 2005 r.

Abstract Tinnitus generating process explanation based on signal quantization theory was proposed. Current work was presented concerning an ultrasound Tinnitus device employing dither noise as a masker. The device is engineered at the Gdansk University of Technology in a close co-operation with the Institute of Physiology and Pathology of Hearing.

Streszczenie Zaproponowano interpretację zjawiska powstawania szumów usznych w oparciu o teorię kwantowania sygnału fonicznego. Wskazano na realizowane aktualnie prace nad skonstruowaniem urządzenia opartego na generowaniu szumu maskującego typu dither, pracującego na częstotliwościach ponadsłyszalnych (ultradźwiękowych). Urządzenie tego typu jest aktualnie projektowane w Politechnice Gdańskiej we współpracy z Instytutem Fizjologii i Patologii Słuchu.

Entry No. 134

Entry type conference paper

Authors A. Czyżewski, J. Kotus, G. Szwoch, M. Dziubiński, A. Rypulak, A. Pawlik

English title Multitask Noisy Speech Enhancement System

Polish title Wielozadaniowy system poprawy zrozumiałości mowy

Conference AES 26th International Conference

Preprint 4-1

Number

Volume

Pages 95 - 103

Conference site Denver, Colorado, USA

Conference date 7.7.2005- 9.7.2005

Abstract The paper includes a general characteristic of the designed and implemented Multitask Noisy Speech Enhancement System providing specialized software suite designed for recording speech and for improving quality and intelligibility of recorded speech signal basing on some advanced digital signal processing algorithm applications. The software suite consists of the following applications: Restorer, Recorder and Browser. The software may be used in all cases when speech intelligibility is important, but it is not possible to obtain high quality speech recordings.

Streszczenie W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość mowy ma istotne znaczenie, a nie jest możliwe zarejestrowanie sygnału o odpowiedniej jakości.

Entry No. 135

Entry type conference paper

Authors A. Czyżewski, M. Dziubiński, A. Ciarkowski, M. Kulesza, P. Maziewski, J. Kotus

English title New Algorithms for Wow and Flutter Detection and Compensation in Audio

Polish title Nowe algorytmy detekcji i redukcji drżenia dźwięku

Conference 118 Konferencja AES

Preprint 6353

Number 7/8

Volume 53

Pages 669

Conference site Barcelona, Hiszpania

Conference date 28.5.2005- 31.5.2005

Notes abstrakt dostępy w JAES

Abstract New algorithms were developed for discriminating wow from natural musical effects, such as: periodicity detection by means of autocorrelation signal, algorithm employing AR model for power line hum frequency detection and algorithm for estimating pitch variation curve employing wow tracking based on recording bias detection in magnetic recordings. Moreover, non-uniform resampling routine was implemented and applied to wow compensation. The developed algorithms were studied using real audio examples allowing a comparison of their effectiveness.

Streszczenie W referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu. W referacie zawarto również opis metod nierównomiernego przepróbkowywania sygnału, wykonywanego w celu redukcji zniekształceń. Opisane algorytmy testowano na autentycznych archiwalnych próbkach dźwiękowych.

Entry No. 136

Entry type conference paper

Authors A. Czyżewski, J. Kotus, M. Kulesza

English title PROJECT AND DEVELOPMENT OF THE AUTOMATIC STATION FOR ENVIRONMENTAL NOISE MONITORING

Polish title PROJEKT I REALIZACJA AUTOMATYCZNEJ STACJI MONITOROWANIA HAŁASU ŚRODOWISKOWEGO

Conference XI Międzynarodowe Sympozjum Reżyserii i Inżynierii Dźwięku i Obrazu

Preprint

Number

Volume

Pages 53 - 60

Conference site Kraków, Polska

Conference date 23.6.2005- 25.6.2005

Abstract The engineered device is a part of the Universal System for Diagnosing Environmental Noise which was developed in the Multimedia System Department, Gdansk University of Technology. The overall structure of automatic station for environmental noise measurements is described, as well as its basic functionality. Furthermore, features related to GPRS wireless communication bringing an opportunity to transmit measurement data and also to configure the device remotely are characterized. The main drawback of the classic approach to noise measurements is that the noise maps which are created as static ones, and do not depict the changes of noise levels. The proposed measuring station is integrated with a GPS receiver allowing linking the information about noise level with the information on the geographic position. Utilizing the multimedia system with proposed device, it is possible to decrease the time necessary for map creation and to increase reliability of noise threads representation. Principles of operation of the software providing full functionality to the user are also discussed. Short summary and electric parameter tests related to the EU regulations are also included.

Streszczenie W referacie przedstawiono projekt i realizację automatycznej stacji monitorowania hałasu środowiskowego. Stanowi ona jeden z elementów tworzonego w Katedrze Systemów Multimedialnych Politechniki Gdańskiej Multimedialnego Systemu Monitorowania Hałasu. Przedstawiono ogólną budowę stacji pomiarowej oraz omówiono jej podstawową funkcjonalność. Obszerniej opisano dodatkowe możliwości stacji, do których należą: komunikacja z wykorzystaniem transmisji GPRS oraz możliwość określania pozycji geograficznej dzięki zastosowaniu odbiornika sygnału GPS. Bezprzewodowa transmisja danych, umożliwią zarówno natychmiastowe wysłanie aktualnych wyników pomiarów jak również zdalny nadzór nad działaniem stacji. Informacja o lokalizacji pomiarów hałasu może znacząco przyspieszyć i uprościć sposób tworzenia map zagrożeń hałasem na analizowanym obszarze. Przedstawiono ponadto strukturę oprogramowania zastosowanego w omawianej stacji pomiarowej. Podano również wyniki przeprowadzonych badań opracowanej stacji pomiarowej w odniesieniu do obowiązujących norm.

Entry No. 137

Entry type conference paper

Authors A. Czyżewski

English title Applications of Perceptual Masking and Dither Noise in Audiology

Polish title Zastosowanie zjawiska maskowania i ditheringu w audiologii

Conference 7th International Workshop on Mathematical Methods in Scattering Theory and Biomedical Engineering, Nymphaio, Grecja,

Preprint

Number

Volume

Pages 12

Conference site Nymphaio, Grecja

Conference date 8.9.2005- 11.9.2005

Notes 8.9.2005- 11.9.2005.

Abstract Perceptual properties of human hearing have already been investigate in the field of noise reduction, but the author's proposal is different. Another problem concerning hearing impaired people is the narrow dynamic range of hearing sensitivity. The treat Tinntus, therapists recommend behind-the-ear generators (maskers). The purpose of using maskers is not to provide an extra gain on the signal which is the task of the hearing aid, but to generate noise. The idea is to continuously excite the ear. meanwhile, dithering is a process that adds broadband noise to the acoustic signal. As demonstrated in the paper, understanding the existing links between dithering technique and Tinnitus masking may lead to some vital improvements in Tinnitus therapy.

Streszczenie Praca przedstawia zagadnienia związane ze zjawiskiem szumów usznych i propozycja interpretacji tego zjawiska w oparciu o teorię kwantowania sygnału fonicznego. Szumy uszne często pojawiają się w sytuacji podniesienia progu słyszenia, związanego z ubytkiem słuchu, powodowanym przez choroby ucha wewnętrznego. Przyczyną tego stanu rzeczy może być degeneracja zewnętrznych komórek rzęsatych, która powoduje, że aktywacja neuronów ma miejsce dla sygnałów o wyższym poziomie, niż normalny. W sytuacji takiej mamy, zatem, do czynienia z pojawieniem się układu progowego o podwyższonym progu zadziałania. W związku z tym, dochodzi do wtrącenia dodatkowego mechanizmu kwantyzacji progowej słabych bodźców akustycznych, co jest powodowane podniesieniem progu aktywacji słuchowych komórek nerwowych. Istniejące w audiologii teorie, mające na celu wyjaśnienie tego zjawiska, nie uwzględniają bezpośrednio mechanizmów kwantyzacji sygnału, która ma miejsce w związku z istnieniem charakterystyki progowej w układzie transmisyjnym. Interpretacja taka staje się możliwa dopiero wówczas, jeżeli skorzysta się z wiedzy związanej z dziedziną przetwarzania sygnałów elektrycznych, rozwiniętej w innych dyscyplinach nauki, np. na gruncie cyfrowego przetwarzania sygnałów.

Entry No. 138

Entry type book

Authors A. Czyżewski, B. Kostek, H. Skarżyński

English title Intelligent System for Environmental Noise Monitoring

Polish title Inteligentny System Monitorowania Środowiska

Editor Advances in Soft Computing, Springer Verlag

Pages 397 - 410

Notes rozdział w książce zagranicznej

Abstract The telemonitoring system, developed at the Multimedia Systems Department of the Gdansk University of Technology is discussed, aimed at environmental noise levels monitoring. Apart from the global system characteristic, a detailed system presentation was provided, consisting of descriptions of the following elements: mobile measurement unit, computer noise measuring software, USB sound interface with a measurement microphone, Internet multimedia application and a soft computing algorithm applied to the analysis of the system database content. The results of noise measurements were compared to those obtained with professional noise measuring devices. The engineered intelligent application may help in diminishing hearing diseases occurrence caused by environmental & industrial noise.

Streszczenie W rozdziale przedstawiono projekt i realizację automatycznej stacji monitorowania hałasu środowiskowego. Stanowi ona jeden z elementów tworzonego w Katedrze Systemów Multimedialnych Politechniki Gdańskiej Multimedialnego Systemu Monitorowania Hałasu. Przedstawiono ogólną budowę stacji pomiarowej oraz omówiono jej podstawową funkcjonalność. Obszerniej opisano dodatkowe możliwości stacji, do których należą: komunikacja z wykorzystaniem transmisji GPRS oraz możliwość określania pozycji geograficznej dzięki zastosowaniu odbiornika sygnału GPS. Bezprzewodowa transmisja danych, umożliwią zarówno natychmiastowe wysłanie aktualnych wyników pomiarów jak również zdalny nadzór nad działaniem stacji. Informacja o lokalizacji pomiarów hałasu może znacząco przyspieszyć i uprościć sposób tworzenia map zagrożeń hałasem na analizowanym obszarze. Przedstawiono ponadto strukturę oprogramowania zastosowanego w omawianej stacji pomiarowej. Podano również wyniki przeprowadzonych badań opracowanej stacji pomiarowej w odniesieniu do obowiązujących norm. Słowa kluczowe: monitoring środowiska, hałas, metody inteligentne, system GPS, stacje pomiarowe

Entry No. 139

Entry type conference paper

Authors A. Czyżewski, G. Szwoch, P. Żwan

English title COPSIMO - new technologies for multimedia distribution

Polish title Projekt COPSIMO - nowe technologie dystrybucji multimediów

Conference Krajowe Sympozjum Telekomunikacji 2005

Preprint A-TM.01

Number

Volume

Pages 233 - 236

Conference site Bydgoszcz, Polska

Conference date 7.9.2005- 9.9.2005

Streszczenie W komunikacie przedstawiono założenia, realizowanego z udziałem Katedry Systemów Multimedialnych, projektu europejskiego COPSIMO, którego celem jest opracowanie sieci typu peer-to-peer, umożliwiającej wymianę nagrań multimedialnych na terenie krajów Unii Europejskiej. Sieć ta będzie zbudowana z wykorzystaniem architektury niewykorzystującej serwerów centralnych i wyposażona w mechanizmy zabezpieczające prawa autorskie twórców.

Entry No. 140

Entry type conference paper

Authors A. Czyżewski

English title Perceptual Masking and Dithering to Ease Communication with Audiology Patients

Polish title Zastosowanie zjawiska maskowania i metody ditheringu w przypadku pacjentów z szumami usznymi

Conference 3rd International Conference on Telemedcicine and Multimedia Communication

Preprint

Number

Volume

Pages 30

Conference site Warszaw, Kajetany, Polska

Conference date 21.10.2005- 22.10.2005

Abstract Perceptual properties of human hearing have already been investigate in the field of noise reduction, but the author's proposal is different. Another problem concerning hearing impaired people is the narrow dynamic range of hearing sensitivity. The treat Tinntus, therapists recommend behind-the-ear generators (maskers). The purpose of using maskers is not to provide an extra gain on the signal which is the task of the hearing aid, but to generate noise. The idea is to continuously excite the ear. meanwhile, dithering is a process that adds broadband noise to the acoustic signal. As demonstrated in the paper, understanding the existing links between dithering technique and Tinnitus masking may lead to some vital improvements in Tinnitus therapy.

Streszczenie W referacie przedstawiono zagadnienia związane ze zjawiskiem szumów usznych i propozycję interpretacji tego zjawiska w oparciu o teorię kwantowania sygnału fonicznego. Podano też rozwiązania technologiczne wspierające pacjentów z szumami usznymi.

Entry No. 141

Entry type conference paper

Authors A. Czyżewski, J. Klejsa

English title Tinnitus Diagnosis and Therapy Method Employing Ultrasound Dithering

Polish title Diagnozowanie i terapia szumów usznych z zastosowaniem ultradźwiękowego szumu typu dither

Conference Mathematical Methods in Scattering Theory and Biomedical Engineering

Preprint

Number

Volume

Pages 277 - 296

Conference site Nymphaio, Greece

Conference date 8.9.2005- 11.9.2005

Notes Wydawnictwo książkowe typu Proceedings

Abstract Tinnitus generating process explanation based on signal quantization theory was proposed. Current work was presented concerning an ultrasound Tinnitus device employing dither noise as a masker. The device is being engineered at the Gdansk University of Technology in a close co-operation with the Institute of Physiology and Pathology of Hearing.

Streszczenie Zaproponowano interpretację sposobu powstawania szumów usznych w oparciu o teorię kwantyzacji sygnału. Przedstawiono stan prac nad urządzeniem ultradźwiękowym do maskowania szumów usznych, opartym na tej teorii.

Entry No. 142

Entry type conference paper

Authors B. Kostek, P. Dalka, A. Czyżewski

English title Audiovisual speech recognition for training hearing impaired patients

Polish title Audiowizualne rozpoznawanie mowy na potrzeby treningu osób z wadami słuchu

Conference 7th International Workshop on Mathematical Methods in Scattering Theory and Biomedical Engineering

Preprint

Number

Volume

Pages 335 - 347

Conference site Nymphaio, Grecja

Conference date 8.9.2005- 11.9.2005

Abstract This study presents isolated phoneme recognition system combining both visual and acoustical data. The Active Shape Model method is used for extracting visual speech features from the shape and movement of the lips. This method consists in a model-based approach for extracting speech information from image sequences. Its advantage over the image-based approach stems from the fact that important features are represented in a low-dimensional space and are normally invariant to translation, rotation, scaling and illumination. The Mel Frequency Cepstral Coefficients (MFCCs) are used as the acoustic speech features in the speech recognition system. MFCCs are based on the short-term spectrum. The power spectrum bins are grouped and smoothed according to the perceptually motivated Mel frequency scaling. Then the spectrum is segmented into critical bands. Finally, a discrete cosine transform is applied to the logarithm of the filter bank output signal resulting in vectors of decorrelated MFCCs features. A three-layer feed-forward artificial neural network (ANN) is used in the experiments related to speech recognition. Feature vectors extracted combine both modalities of the human speech. A matrix, containing feature vectors calculated during the utterance, forms an input to the ANN. To make the results of speech classification robust against the changes in the utterance duration, an interpolation is used to compute feature vectors. Additional experiments with the degraded acoustical information are carried out in order to test the system robustness against various distortions affecting the signals. The system engineered utilizing only the visual information correctly classifies properly nearly 80% of the speech utterances. This result is very satisfying taking into account a huge similarity between lip movements during articulation of vowels and a great diversity of lip shapes originating from the anatomical features and the way of speaking. Results of classification based on the acoustical information are much better than the ones based on the visual information. However, utilizing both modalities in the speech recognition system further improves the effectiveness. Moreover this makes the system much more robust against distortions in the audio signal. A software is prepared employing above mentioned algorithms to be used by cochlear implanted patients in the process of speech training. An interactive application was conceived making possible organizing the interactive speech training sessions without any assistance from speech therapists.

Streszczenie Praca przedstawia system rozpoznawania izolowanych głosek mowy wykorzystujący dane wizualne i akustyczne. Modele Active Shape Models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na współczynnikach melcepstralnych. Sieć neuronowa została użyta do rozpoznawania wymawianych głosek na podstawie wektora cech zawierającego oba typy parametrów. Dodatkowo zbadano odporność systemu na zakłócenia w sygnale dźwiękowym.

Entry No. 143

Entry type conference paper

Authors A. Czyżewski, A. Ciarkowski, R. Kaliakine, R. Kamiński

English title The Project DESYME – Development System for Mobile Services

Polish title Projekt DESYME - środowisko do tworzenia usług mobilnych

Conference Krajowe Sympozjum Telekomunikacji KST'2005

Preprint

Number

Volume

Pages

Conference site Bydgoszcz, 2005

Conference date 7.9.2005- 9.9.2005

Abstract The note depicts goals, technical aspects and the structure of European DESYME Project. In order to emphasize the usefulness of future project’s results, a short analysis of current state of the art is also presented, together with description of common problems faced by enterprises attempting to make use of mobile technologies. Next, potential profits coming from implementation of the DESYME project’s results are introduced. Short description of the architecture of proposed solution is also included and potentially useful existing technologies are inspected. The structure of the project is presented and the role of Multimedia Systems Department is explained. An example of mobile service, whose realization will confirm the usefulness of the project’s results summarizes the paper.

Streszczenie Przedstawiono cele, aspekty techniczne i strukturę programu europejskiego DESYME. Przeanalizowano aktualny stan techniki i trudności, na jakie napotykają podmioty usiłujące wykorzystać techniki mobilne w swych przedsięwzięciach. Przedstawiono potencjalne korzyści płynące z implementacji w tych warunkach wyników projektu DESYME. Skrótowo przedstawiono architekturę proponowanych rozwiązań oraz wykorzystywane techniki. Przedstawiono strukturę projektu oraz wyjaśniono rolę w jego realizacji Katedry Systemów Multimedialnych Politechniki Gdańskiej. Przytoczono przykład usługi mobilnej, której skuteczna implementacja będzie potwierdzeniem przydatności wyników opisywanego projektu.

Entry No. 144

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Modeling of Perceptual Masking and its Applications to Hearing Aids

Polish title Modelowanie maskowania perceptualnego i jego zastosowania w protetyce słuchu

Conference XVII Krajowe Sympozjum Audiologiczne

Preprint

Number

Volume

Pages

Conference site Cetniewo, PL

Conference date 29.9.2005- 1.10.2005

Streszczenie Modelowanie zjawisk perceptualnych obejmuje badanie zależności między odchyleniem błony podstawnej narządu Cortiego a amplitudą i częstotliwością bodźca akustycznego. Zjawiska te dotyczą makromechaniki ślimaka widzianej jako proces przyporządkowania grupom częstotliwości sygnału akustycznego różnych miejsc na błonie podstawnej, w których zachodzi jej maksymalne odchylenie. Pasma filtrów słuchowych nazwano pasmami krytycznymi słuchu. Parametry filtrów zwanych filtrami słuchowymi można wyznaczyć w oparciu o różne badania psychoakustyczne. W celu dokładniejszego określenia parametrów filtrów słuchowych, a zwłaszcza ich kształtu, można posłużyć się modelem zjawiska maskowania. Zjawisko maskowania występuje w przypadku jednoczesnego istnienia dźwięku w obecności innego dźwięku lub szumu, przy założeniu, że dźwięk zagłuszany i zagłuszający leżą w pobliskich pasmach krytycznych. Wskutek pobudzania przez dźwięk nie tylko odpowiadającego mu odcinka błony podstawnej, lecz również obszaru obejmującego częstotliwości większe, inny dźwięk pobudzający ten obszar staje się słabiej słyszalny, a nawet może przestać być słyszalny. Aby dźwięk maskowany stał się ponownie słyszalny, należy zwiększyć jego natężenie. W ten sposób jego dolna granica słyszalności została przesunięta i wartość tego przesunięcia określa miarę zagłuszania. Stwierdzono także, że: zagłuszanie jest największe w sąsiedztwie tonu zagłuszającego, zmniejszenie zagłuszania przy częstotliwościach odpowiadających harmonicznym tonu zagłuszającego związane jest z istnieniem tonów subiektywnych, tony o dużych natężeniach zagłuszają wszystkie dźwięki o częstotliwościach większych, natomiast dźwięki o częstotliwościach mniejszych - tylko w bezpośrednim swoim sąsiedztwie, łatwiej są maskowane tony wyższe przez silny ton niski. Wykorzystując powyższe założenia, dokonano implementacji modelu słyszenia w postaci algorytmicznej. Następnie model ten, zaimplementowany w formie oprogramowania wykorzystano w torze eksperymentalnej protezy słuchu, uzyskując znaczącą poprawę separacji użytecznych składowych dźwięku od niepożądanych szumów i zakłóceń.

Entry No. 145

Entry type conference paper

Authors A. Czyżewski, M. Dziubiński, Ł. Litwic, P. Maziewski

English title Intelligent Algorithms for Optical Track Audio Restoration

Polish title Inteligentne algorytmy redukcji zniekształceń dźwięku w optycznych ścieżkach fonicznych

Conference 10th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing

Preprint

Number

Volume 3642

Pages 283 - 293

Conference site Regina, Kanada

Conference date 31.8.2005- 3.9.2005

Notes dostępne w Lecture Notes in Artificial Intelligence

Abstract The Unpredictability Measure computation algorithm applied to psychoacoustic model-based broadband noise attenuation is discussed. A learning decision algorithm based on a neural network is employed for deter-mining audio signal useful components acting as maskers of the spectral com-ponents classified as noise. An iterative algorithm for calculating the sound masking pattern is presented. The routines for precise extraction of sinusoidal components from sound spectrum were examined, such as estimation of pitch variations in the optical track audio affected by parasitic frequency modula-tion. The results obtained employing proposed intelligent signal processing al-gorithms will be presented and discussed in the paper.

Streszczenie W referacie przedstawiono dwa algorytmy dedykowane redukcji pasożytniczych zniekształceń dźwięku spotykanych w optycznych ścieżkach dźwiękowych. Pierwszy algorytm umożliwia redukcję szerokopasmowego szumu w nagraniach fonicznych. Wykorzystano w nim psycho-akustyczny model słuchu oparty o miarę nieprzewidywalność sygnału (ang. Unpredictability Measure). Ocena jakości redukcji szumu została wykonana z wykorzystaniem metod inteligentnych. Drugi algorytm dedykowany jest zniekształceniom kołysania dźwięku. Wykorzystano w nim informacje o zmianach w czasie składowych tonalnych dźwięku. W referacie umieszczono wyniki obrazujące sposób działania obu algorytmów.

Entry No. 146

Entry type conference paper

Authors P. Szczuko, Ł. Kosikowski, A. Czyżewski

English title Virtual Hearing Aid – the Computer Application for Education Purposes

Polish title Wirtualny Aparat Słuchowy - aplikacja komputerowa do celów edukacyjnych

Conference 8th International Conference Advances in Diagnosis and Treatmentof Auditory Disorders

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 19.5.2005- 21.5.2005

Abstract The purpose of the elaborated application is to demonstrate the principles of hearing aid operation. It enables a normal hearing person familiarizing with hearing problems and hearing aid operation principles. The application is provided with an intuitive interface. The interactive hearing aid is presented on a diagram extended with some selectable options related to sound source, noise type, and hearing aid settings. The sound examples are included making possible listening tests performed by the application user. The application is also accompanied by the education material concerning anatomy of human ear and typical hearing impairment cases. The application details will be discussed during the paper presentation and a demonstration of its principles of operation will be included.

Streszczenie Zrealizowano aplikację komputerową demonstrującą zasady działania aparatów słuchowych. Osoba z dobrym słuchem ma możliwość zapoznania się z problemem niedosłuchu i działaniem aparatu. Aplikacja wyposażona jest w intuicyjny interfejs, który przedstawia elementy aparatu słuchowego na schemacie blokowym z opcjami dotyczącymi źródła dźwięku, typu szumu oraz ustawień aparatu. Aplikacja wzbogacona jest o materiały dydaktyczne, opisujące anatomię ludzkiego ucha oraz typowe zaburzenia słuchu.

Entry No. 147

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński

English title Theory of Tinnitus Generation and its Therapeutic Application

Polish title Interpretacja sposobu powstawania szumów usznych i jej wykorzystanie terapeutyczne

Conference XVII Krajowe Sympozjum Audiologiczne

Preprint

Number

Volume

Pages

Conference site Cetniewo, PL

Conference date 29.9.2005- 1.10.2005

Streszczenie Szumy uszne często pojawiają się w sytuacji podniesienia progu słyszenia, związanego z ubytkiem słuchu, powodowanym przez choroby ucha wewnętrznego. Przyczyną tego stanu rzeczy może być degeneracja zewnętrznych komórek rzęsatych, która powoduje, że aktywacja neuronów ma miejsce dla sygnałów o wyższym poziomie, niż normalny. W sytuacji takiej mamy, zatem, do czynienia z pojawieniem się układu progowego o podwyższonym progu zadziałania. W związku z tym, dochodzi do wtrącenia dodatkowego mechanizmu kwantyzacji progowej słabych bodźców akustycznych, co jest powodowane podniesieniem progu aktywacji słuchowych komórek nerwowych. Istniejące w audiologii teorie, mające na celu wyjaśnienie tego zjawiska, nie uwzględniają bezpośrednio mechanizmów kwantyzacji sygnału, która ma miejsce w związku z istnieniem charakterystyki progowej w układzie transmisyjnym. Interpretacja taka staje się możliwa dopiero wówczas, jeżeli skorzysta się z wiedzy związanej z dziedziną przetwarzania sygnałów elektrycznych, rozwiniętej w innych dyscyplinach nauki, np. na gruncie cyfrowego przetwarzania sygnałów. W związku z tym, została zaproponowana interpretacja zjawiska powstawania szumów usznych w oparciu o teorię kwantowania sygnału fonicznego. Ponadto, na gruncie cyfrowego przetwarzania sygnałów wypracowano metodykę eliminacji szumu powstającego w procesie kwantyzacji progowej, zwaną techniką ditheringu. Technika ta polega na dodawaniu do sygnałów użytecznych o niskim poziomie pewnej porcji szumu, co w efekcie zatrzymuje proces samorzutnej generacji szumu w torze, powodowany przez istnienie charakterystyki progowej. W tym kontekście łatwo można dostrzec, że w podobny sposób zwalczane są również szumy uszne w audiologii, tzn. stosuje się szum maskujący, który dostarczany jest przez specjalne urządzenia, zwane maskerami. Powszechnie znana skuteczność tego rodzaju technik eliminacji zarówno szumów usznych, jak i szumów kwantyzacji, powstających samorzutnie w układach elektronicznych, wskazuje na zasadność interpretacji szumów usznych jako bezpośredniej konsekwencji kwantowania słabych sygnałów fonicznych w układach progowych. Aktualnie w Instytucie Fizjologii i Patologii Słuchu we współpracy z Politechniką Gdańską trwają prace nad skonstruowaniem urządzenia opartego na generowaniu szumu maskującego typu dither, pracującego na częstotliwościach ponadsłyszalnych (ultradźwiękowych). W konstrukcji tego urządzenia wykorzystuje się bezpośrednio założenia teoretyczne dotyczące zjawisk powstających na etapie kwantyzacji sygnałów akustycznych.

Entry No. 148

Entry type conference paper

Authors A. Czyżewski

English title Computer modeling of perceptual masking and its audiology applications

Polish title

Conference 8th International Conference on Advances in Diagnosis and Treatment of Auditory Disorders

Preprint

Number

Volume 26

Pages

Conference site Kajetany, PL

Conference date 19.5.2005- 21.5.2005

Notes Audiofonologia-Supl., str. 5

Abstract A common exploitation of perceptual models in contemporary audio coding standards, their efficiency and robustness allow for compressing audio data without a noticeable loss in the subjective quality.

Streszczenie W referacie zaprezentowano podstawy perceptualne słyszenia pozwalające na stworzenie nowych modeli kodowania dźwięku, w szczególności do zastosowania w protezach słuchu.

Entry No. 149

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski, M. Kulesza, P. Maziewski

English title DSP Techniques in Wow Defect Evaluation

Polish title Metody DSP wykorzystywane do określenia przepisu drżenia dźwięku

Conference IEEE Signal Processing’2005 Workshops, Chapter Circuits and Systems, Polish Section

Preprint

Number

Volume

Pages 103 - 108

Conference site Poznań, Polska

Conference date 30.9.2005

Notes dostepne w Proc. Signal Processing‘2005 Workshop

Abstract Two methods for evaluation of the wow distortion characteristics are presented in this paper. Both approaches are based on the analysis of parasite artefacts. Chosen artefacts are the recorded power line hum and the high frequency bias. Precise frequency variations are computed using standard STFT and AR mode-based spectral representation of audio. Experiments demonstrating the results of both method applications are included.

Streszczenie Referat przedstawia dwie metody określania przebiegu pasożytniczego drżenia dźwięku. Obie wykorzystują analizę pasożytniczych artefaktów. Wybrane artefakty to: przydźwięk sieciowy i wysokoczęstotliwościowy prąd podkładu. Precyzyjny przebieg zniekształcenia kołysania oblicza się wykorzystując informacje zawarte w widmie sygnałów, a uzyskane poprzez STFT lub modelowanie AR. Referat zawiera opis eksperymentów i uzyskane wyniki.

Entry No. 150

Entry type journal paper

Authors A. Czyżewski, A. Ciarkowski, R. Kaliakine, R. Kamiński

English title The Project DESYME – Development System for Mobile Services

Polish title Projekt DESYME - środowisko do tworzenia usług mobilnych

Journal Przegląd Telekomunikacyjny i Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages

Abstract The note depicts goals, technical aspects and the structure of European DESYME Project. In order to emphasize the usefulness of future project’s results, a short analysis of current state of the art is also presented, together with description of common problems faced by enterprises attempting to make use of mobile technologies. Next, potential profits coming from implementation of the DESYME project’s results are introduced. Short description of the architecture of proposed solution is also included and potentially useful existing technologies are inspected. The structure of the project is presented and the role of Multimedia Systems Department is explained. An example of mobile service, whose realization will confirm the usefulness of the project’s results summarizes the paper.

Streszczenie Przedstawiono cele, aspekty techniczne i strukturę programu europejskiego DESYME. Przeanalizowano aktualny stan techniki i trudności, na jakie napotykają podmioty usiłujące wykorzystać techniki mobilne w swych przedsięwzięciach. Przedstawiono potencjalne korzyści płynące z implementacji w tych warunkach wyników projektu DESYME. Skrótowo przedstawiono architekturę proponowanych rozwiązań oraz wykorzystywane techniki. Przedstawiono strukturę projektu oraz wyjaśniono rolę w jego realizacji Katedry Systemów Multimedialnych Politechniki Gdańskiej. Przytoczono przykład usługi mobilnej, której skuteczna implementacja będzie potwierdzeniem przydatności wyników opisywanego projektu.

Entry No. 151

Entry type conference paper

Authors A. Czyżewski, J. Kotus, A. Kaczmarek, A. Rypulak, A. Pawlik

English title Advanced Speech Archiving And Restoration System For Aviation Applications

Polish title Zaawansowany System Rejestracji i Rekonstrukcji Mowy dla potrzeb lotnictwa

Conference First IEEE International Conference on Technologies for Homeland Security and Safety TEHOSS 2005

Preprint

Number

Volume

Pages 333 - 342

Conference site Gdańsk, Polska

Conference date 28.9.2005- 30.9.2005

Abstract The engineered Speech Archiving and Restoration System for Aviation Applications is presented in the paper. The system provides a multichannel and a multitask system for recording, storing, transmitting and for intelligibility improving of speech radio communication content. The principal aim of the system is recording and restoring of radio communication between aircraft pilots and ground control stations - extremely important in case of plane hijacking or crash accident investigating. The elaborated system is based on some innovative solutions presented in the paper. It concerns among others synchronous recording of the same speech signal versions received from different radio stations. Another feature is restoration process based on multi-channel analysis of the same recordings obtained from different ground control stations. Moreover, some selected digital speech processing algorithms implemented to the engineered multi-task speech enhancement system are also discussed in the paper and results of their application are demonstrated.

Streszczenie W referacie przedstawiono opracowany System Rejestracji I Rekonstrukcji Mowy dla potrzeb lotnictwa. System ten umożliwia jednoczesny zapis, archiwizację i poprawę zrozumiałości sygnału mowy pochodzącego z wielu różnych kanałów komunikacji radiowej. Głównym celem systemu jest rejestracja i rekonstrukcja komunikatów słownych wymienianych drogą radiową pomiędzy pilotem samolotu a stacją kontroli lotów – jest to niezwykle istotne w przypadku porwania samolotu lub przy prowadzeniu ekspertyzy powypadkowej. Opisany system opiera się na innowacyjnych rozwiązaniach, przedstawionych dokładniej w niniejszym referacie. Pierwsze z nich dotyczy zapewnienia synchronizacji pomiędzy tą samą wypowiedzią zarejestrowaną na różnych stacjach radiowych. Następne obejmuje wykonywanie procesu rekonstrukcji sygnału w oparciu o wielokanałową analizę tej samej wypowiedzi pochodzącej z różnych naziemnych stacji radiowych. Zaprezentowano ponadto wybrane algorytmy cyfrowego przetwarzania sygnału, zaimplementowane w wielozadaniowym systemie poprawy zrozumiałości mowy. Zaprezentowano ich zastosowanie na wybranych przykładach.

Entry No. 152

Entry type conference paper

Authors P. Dalka, A. Czyżewski

English title Speech recognition system for hearing impaired people

Polish title System rozpoznawania mowy dla osób niedosłyszących

Conference 8th International Conference Advances in Diagnosis and Treatmentof Auditory Disorders

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 19.5.2005- 21.5.2005

Abstract The paper presents research results in the domain of speech recognition providing assistance to hearing impaired people. The system being engineered combines both visual and acoustic data to recognize speech in order to facilitate speech training of cochlear implant patients and other patients revealing serious hearing impairments. The Active Shape Model method is used for extracting visual speech features from the shape and movement of lips. The acoustic features extraction involves mel-cepstral analysis. Both modalities of speech are combined in the feature vectors extracted. An artificial neural network is employed as a classifier allowing recognition of speech utterances.

Streszczenie Praca przedstawia wyniki badań z zakresu rozpoznawania mowy. Tworzony system wykorzystujący dane wizualne i akustyczne będzie ułatwiał trening poprawnego mówienia dla osób po operacji transplantacji ślimaka i innych osób wykazujących poważne uszkodzenia słuchu. Active Shape models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na współczynnikach melcepstralnych. Sieć neuronowa została wykorzystana do rozpoznawania wymawianych głosek na podstawie wektora cech zawierającego oba typy parametrów.

Entry No. 153

Entry type conference paper

Authors A. Walkowiak, A. Czyżewski

English title

Polish title Telemetria odpowiedzi neuronalnych jako metoda wspomagająca dobór parametrów stymulacji przez implant ślimakowy

Conference Krajowe Sympozjum Telekomunikacji

Preprint

Number

Volume

Pages 248 - 253

Conference site Bydgoszcz, PL

Conference date 7.9.2005- 9.9.2005

Streszczenie Dzięki pomiarom odpowiedzi neuronalnych i możliwości wyznaczenia na tej podstawie tendencji rozkładu progów stymulacji można lepiej zaprogramować procesor mowy. Jest to niezwykle cenne zwłaszcza u pacjentów nie współpracujących podczas badań (na przykład u małych dzieci). W przypadku takich pacjentów tradycyjne, psychoakustyczne metody doboru parametrów stymulacji przez implant często zawodzą. Natomiast gdy audiolog dysponuje prawdopodobną mapą progów stymulacji wyznaczoną z pomiaru odpowiedzi neuronalnych, może szybciej i łatwiej określić parametry pobudzeń dla każdej z elektrod.

Entry No. 154

Entry type conference paper

Authors A. Czyżewski, B. Kostek, H. Skarżyński

English title Internet-Based Automatic Hearing Assessment System

Polish title System badania słuchu w Internecie

Conference 119 Audio Engineering Society Convention

Preprint 6626

Number

Volume

Pages

Conference site New York, USA

Conference date 7.10.2005- 10.10.2005

Abstract In the paper the Internet-based system that allows for automatic testing of hearing is described. Hearing impairment is one of the fastest growing diseases of modern society. Therefore it is very important to organize mass screening tests to identify people suffering from this kind of impairment. The described application provides a test that uses automatic questionnaire analysis, standardized audiometric tone test procedures, and assessment of speech intelligibility in noise. When all the testing is completed, the system automatically analyzes the results for each person examined. Based on the number of incorrect answers, the decision is made automatically by the expert system. Persons whose hearing impairment is confirmed are referred to treatment in rehabilitation centers. All these centers are connected via the Internet and are provided with special distributed database access allowing them to automatically register and track the patient discovered during the remote screening.

Streszczenie Celem referatu jest prezentacja systemu przesiewowego badania słuchu w oparciu o Internet. Wady słuchu stanowią jedną z najszybciej postępujących chorób we współczesnym społeczeństwie. W tym kontekście ważne staje się umożliwienie przeprowadzania masowych testów wykrywających ubytki słuchu. Przedstawiona aplikacja zawiera audiometryczny test tonalny, test ilustrowany dla dzieci oraz test rozumienia mowy w szumie. Po zakończeniu testów system automatycznie analizuje wyniki dla każdej badanej osoby. Osoby z wykrytą wadą słuchu kierowane są do specjalistycznych centrów rehabilitacyjnych na dalsze badania.

Entry No. 155

Entry type conference paper

Authors A. Czyżewski, P. Maziewski, M. Dziubiński, A. Kaczmarek, M. Kulesza, A. Ciarkowski

English title Methods for Detection and Removal of Parasitic Frequency Modulation in Audio Recordings

Polish title Metody detekcji i usuwania pasożytniczej modulacji częstotliwości w nagraniach fonicznych

Conference AES 26th International Conference

Preprint

Number 5

Volume 53

Pages 440

Conference site Denver, USA

Conference date 7.7.2005- 9.7.2005

Notes abstrakt dostępy w JAES

Abstract Several methods devoted to wow defect evaluation and compensation are discussed in the paper. The presented algorithms utilize time- and spectrally-based audio processing routines. The newly proposed time-domain algorithm takes an advantage of an autocorrelation analysis of short-term pitch variations. The spectrally based methods employ: spectral peak picking techniques, analysis of spectral center of gravity, and some routines for tracking high-frequency bias in magnetic recordings. Additionally, an algorithm utilizing AR-modeling of the signal was employed to track residual hum recorded on some archive media. Several interpolation methods for wow defect reduction were studied and compared. The researched algorithms were tested employing some archival audio samples allowing their effectiveness evaluation.

Streszczenie W referacie przedstawiono metody ewaluacji i redukcji pasożytniczego kołysania dźwięku. Prezentowane algorytmy do określenia charakterystyki zniekształcenia działają zarówno w dziedzinie widma jak i w dziedzinie czasu. Nowatorski algorytm przetwarzania sygnału w dziedzinie czasu bazuje na analizie funkcji autokorelacji. W skład opisanych metod widmowych wchodzą: przetwarzanie tonalnych komponentów dźwięku, badanie środka ciężkości widma, śledzenie wysokoczęstotliwościowego prądu podkładu. Dodatkową metodą jest algorytm śledzenia zmian przydźwięku sieciowego wykorzystujący model AR widma sygnału. Do redukcji zniekształcenia wykorzystano różne metody interpolacji. Działanie opisanych algorytmów zbadano na autentycznych archiwalnych próbkach dźwiękowych.

Entry No. 156

Entry type conference paper

Authors P. Odya, A. Czyżewski

English title Application of hybrid signals processors to speech and hearing aids

Polish title Zastosowanie hybrydowych procesorów sygnałowych w protezach mowy i słuchu

Conference VIII Międzynarodowa Konferencja "Postępy w diagnostyce i leczeniu zaburzeń słuchu"

Preprint

Number

Volume

Pages 41

Conference site Kajetany, Polska

Conference date 19.5.2005- 21.5.2005

Notes Abstrakt: Audiofonologia, 2005 - Suplement

Abstract Owing to recent progress in Digital Signal Processor (DSP) developments it has been possible to build subminiature speech and hearing aids. Furthermore, despite the small dimensions, these devices can execute complex algorithms and can be easily reprogrammed. The paper placed emphasis on issues related to the design and implementation of algorithms applicable to speech and hearing aids. For example, frequency shifting or delaying the audio signal are often used for speech fluency improvement. In turn, the speech spectrum compression process can help the hearing impaired person. Some additional algorithms which can improve the quality of the audio signal are described. The basic information about the developed device is also presented in the paper.

Streszczenie Dzięki postępowi w technice Cyfrowych Procesorów Sygnałowych (ang. DSP) stało się możliwe budowanie miniaturowych protez słuchu i mowy. Mimo niewielkich wymiarów procesory te są w stanie wykonywać złożone algorytmy. Ich dodatkową zaletą jest łatwość zmiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. W pracy skupiono się na zagadnieniach związanych z projektowanie i implementacją algorytmów mających zastosowanie w protezach słuchu i mowy. Przykładowo opóźnienie sygnału mowy bądź jego przesunięcie na skali częstotliwości często powoduje wzrost płynności mowy u osób jąkających się. Widmowa kompresja widma może zaś pomóc osobom korzystającym z implantów ślimakowych. W pracy zawarto także opisy dodatkowych algorytmów zwiększających jakość przetwarzanych sygnałów dźwiękowych oraz informacje na temat stworzonej protezy.

Entry No. 157

Entry type journal paper

Authors A. Czyżewski, G. Szwoch, P. Żwan

English title COPSIMO - new technologies for multimedia distribution

Polish title Projekt COPSIMO - nowe techniki dystrybucji multimediów

Journal Przegląd Telekomunikacyjny

Volume

Number 8-9

Pages 306 - 308

Streszczenie Przedstawiono założenia, realizowanego z udziałem Katedry Systemów Multimedialnych, projektu europejskiego COPSIMO, którego celem jest opracowanie sieci typu peer-to-peer, umożliwiającej wymianę nagrań multimedialnych na terenie krajów Unii Europejskiej. Sieć ta będzie zbudowana z wykorzystaniem architektury niewykorzystującej serwerów centralnych i wyposażona w mechanizmy zabezpieczające prawa autorskie twórców.

Entry No. 158

Entry type conference paper

Authors A. Czyżewski, B. Kostek, K. Kochanek, M. Kulesza, P. Dalka, P. Suchomski

English title Hearing Aid Operating in Acoustical Free Field

Polish title Aparat słuchowy działający w polu swobodnym

Conference XLII Krajowy Zjazd Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 7.6.2006- 10.6.2006

Notes plakat

Abstract It is well known that language development through home intervention for hearing-impaired infant should start in the early months of a newborn baby. In the poster, a concept of a contactless digital hearing aid designed for infants is presented. In contrast to the typical wearable hearing aid solutions (ITC, ITE, BTE) the device proposed is mounted in the infant's bed. Any part of the hearing aid set-up contacts the infant's body. Processed speech signal is emitted by the low-power loudspeakers placed near the infant's head. The hearing aid architecture employs a digital signal processor based on Texas Instruments technology. Since one of the main problems is related to acoustic feedback between microphone and loudspeakers, therefore methods for acoustic feedback elimination are implemented in the hearing aid.

Streszczenie Jest wiadome, że korekcję wady słuchu nowo narodzonego dziecka należy rozpocząć jak najwcześniej w celu umożliwienia prawidłowego rozwoju ośrodka mowy. Plakat ten prezentuje koncepcję bezkontaktowego aparatu słuchowego przeznaczonego dla niemowląt. W przeciwieństwie do typowych rozwiązań montowanych za uchem bądź w kanale usznym dziecka (ITC, ITE, BTE) prezentowane urządzenie jest montowane w łóżeczku dziecka i żadna jego część nie styka się z jego ciałem. Przetworzony sygnał mowy jest emitowany przez miniaturowe głośniki umieszczone w pobliżu główki dziecka. W konstrukcji aparatu wykorzystano cyfrowy procesor sygnałowy firmy Texas Instruments. Ponieważ jednym z głównych problemów związanych z aparatami słuchowymi jest występowanie sprzężenia zwrotnego głośnikami i mikrofonem, aparat wykorzystuje również algorytmy eliminacji pasożytniczych sprzężeń akustycznych

Entry No. 159

Entry type conference paper

Authors G. Szwoch, M. Kulesza, A. Czyżewski

English title Transient detection algorithms for speech coding applications

Polish title Algorytmy detekcji transjentów do zastosowań w kodowaniu mowy

Conference 4th Joint Meeting of the Acoustical Society of America

Preprint 3pSP37

Number

Volume

Pages

Conference site Honolulu, HI, USA

Conference date 28.12.2006- 2.12.2006

Abstract We proposed a new speech codec architecture for Voice over IP applications that enhances subjective signal quality. This poster presents method for extraction of transients from the speech signal.

Streszczenie Zaproponowano nową architekturę kodeka do zastosowań w telefonii internetowej. W plakacie zaprezentowano metodę ekstrakcji transjentów w sygnale mowy. Słowa kluczowe: kodeki mowy; ekstrakcja transjentów; telefonia internetowa

Entry No. 160

Entry type book

Authors A. Czyżewski, B. Kostek, H. Skarżyński

English title IT Applications for the Remote Testing of Hearing

Polish title Aplikacje technologii informacyjnych w badaniu słuchu przez Internet

Editor Springer Verlag

Pages 225 - 247

Notes rozdział w książce zagranicznej w Information Technology Soultions for Healthcare; K. Zielinski, M. Duplaga, D. Ingram, Eds.

Abstract Telemedicine can play an important role in diagnosing and treating hearing losses. This fact is associated, among others, with the methodology of audiometric measurements and with supporting hearing through hearing aids and cochlear implants. Current problems related to treating hearing impairments and total deafness pose a distinct challenge for science, which must provide ever more effective methods for application in diagnostics and audiology as well as otolaryngology practice. Advances in teleinformatics as well as its wide employment in recent years have opened new possibilities for conducting mass screening of hearing, tinnitus (ear noises), speech and vision. Diagnostic and recovery systems associated with the interactive medical portal Telezdrowie (www.telewalfare.com) designed by the institutions mentioned in the header of this paper serve as an example of how simple diagnostic methods employed in screening tests can be mass-deployed thanks to teleinformatics, this defining a new diagnostic of communication senses.

Streszczenie Telemedycyna odgrywa coraz wiekszą rolę w diagnostyce i leczeniu osób z ubytkami słuchu. Jest to związane m.in. ze specyfiką badań audiometrycznych. Postęp technologiczny w dziedzinie aparatów słuchowych i implantów ślimakowych wymusza nowe metody diagnozy w audiologii, jak również w praktyce otolaryngologicznej. Serwis "Telezdrowie", w którym zaimplementowano liczne testy przesiewowe jest przykładem prowadzenia diagnostyki w zakresie zmysłów komunikacji na odległość. Słowa kluczowe: telemedycyna, portal medyczny, implanty ślimakowe, aparaty słuchowe, dopasowanie protez słuchu

Entry No. 161

Entry type conference paper

Authors A. Czyżewski, P. Odya, B. Kostek

English title New generation aids for laryngectomy patients

Polish title Pomoce nowej generacji dla pacjentów po laryngektomii

Conference 4th Joint Meeting of the Acoustical Society of America

Preprint 5aSCa26

Number

Volume

Pages

Conference site Honolulu, HI, USA

Conference date 28.11.2006- 2.12.2006

Notes wyd. Journal Acoust. Soc. Am., 120 (5), Pt. 2, Nov. 2006, 3351

Abstract The aim of this project is to help laryngectomees. There are two different approaches to solve this task. The first one focuses on the artificial larynx. Some major improvements in the construction of the device might be easily introduced. First of all, digital signal processing should result in decreasing unwanted noise. The artificial larynx engineered is equipped with digital processor and amplifier. The spectral subtraction algorithm of noise reduction is used. The second approach uses PDA to generate speech.

Streszczenie Celem prezentowanego projektu było opracowanie elektronicznych pomocy dla osób po laryngektomii. Zastosowano wiele usprawnień, które wykorzytują cyfrowy procesor sygnałowy wbudowany w urządzenie. Usprawnienia dotyczą tłumienia niepożądanych zakłoceń i eliminacji pasożytniczych akustycznych sprzężeń zwrotnych. Kolejne opracowanie obejmuje syntetyzer mowy oparty na komputerze klasy PDA. Słowa kluczowe: laryngektomia; bezgłos; protezy mowy; seynetza mowy; sztuczna krtań

Entry No. 162

Entry type journal paper

Authors M. Kulesza, G. Szwoch, A. Czyżewski

English title Improving signal quality of speech codecs using perceptual coding

Polish title Poprawa jakości sygnału w kodekach mowy za pomocą kodowania perceptualnego

Journal Zeszyty Naukowe Wydziału ETI PG

Volume

Number 10

Pages 399 - 406

Abstract Speech coding algorithm which aiming at better subjective signal quality that is provided by currently used speech codecs, was described in the paper. A higher signal quality may be achieved by discerning transient states, voiced and unvoiced components of a speech signal and encoding the signal using different approach for each component type. Unvoiced signal components are encoded using standard parametric coding algorithm, while for voiced parts of the signal, a perceptual coding algorithm is applied. Subjective quality of the signal encoded using the proposed algorithm was compared to signal quality achieved by standard speech codecs.

Streszczenie W komunikacie opisano algorytm kodowania sygnału mowy, którego celem jest uzyskanie wyższej oceny jakości zakodowanego sygnału niż w przypadku algorytmów stosowanych do tej pory. W tym celu wyodrębniane są stany transjentowe oraz fragmenty dźwięczne i bezdźwięczne sygnału. Fragmenty te są następnie kodowane w odmienny sposób: składowe bezdźwięczne są kodowane tradycyjną metoda parametryczną, natomiast do składowych dźwięcznych wykorzystano algorytm kodowania perceptualnego. Jakość sygnału mowy kodowanego zgodnie z proponowaną metodą porównano z jakością możliwą do uzyskania w przypadku powszechnie stosowanych obecnie kodeków mowy.

Entry No. 163

Entry type conference paper

Authors M. Kulesza, G. Szwoch, A. Czyżewski

English title High quality speech coding using combined parametric and perceptual modules

Polish title Kodowanie sygnału mowy z zachowaniem wysokiej jakości przy wykorzystaniu modułu parametrycznego i perceptualnego

Conference 13th World Enformatika Conference

Preprint

Number

Volume 13

Pages 244 - 249

Conference site Budapeszt, Węgry

Conference date 26.5.2006- 28.5.2006

Abstract A novel approach to speech coding using the hybrid architecture is presented. Advantages of parametric and perceptual coding methods are utilized together in order to create a speech coding algorithm assuring better signal quality than in traditional CELP parametric codec. Two approaches are discussed. One is based on selection of voiced signal components that are encoded using parametric algorithm, unvoiced components that are encoded perceptually and transients that remain unencoded. The second approach uses perceptual encoding of the residual signal in CELP codec. The algorithm applied for precise transient selection is described. Signal quality achieved using the proposed hybrid codec is compared to quality of some standard speech codecs.

Streszczenie W komunikacie zaprezentowano nową metodę hybrydowego kodowania sygnału mowy. Techniki kodowania parametrycznego oraz perceptualnego zostały wykorzystane w celu zapewnienia wysokiej jakości kodowania sygnału mowy. Przedstawiono wyniki badań dla dwóch architektur kodeka. Jedna z nich bazuje na algorytmie pozwalajacym wyodrębnić składowe dźwięczne, bezdźwięczne oraz transjenty. Składowe dźwięczne kodowane są metodą perceptualną, bezdźwięczne metodą parametryczną, zaś transjenty przesyłane są do dekodera w formacie bezstratnym. W drugim z proponowanych algorytmów część dźwięczna sygnału rezydualnego kodeka CELP poddawana jest dodatkowemu kodowaniu perceptualnemu. Jakość sygnału mowy uzyskana dzięki dwom zaproponowanym metodą jest porównana z jakością uzyskiwaną z wykorzystaniem standardowych metod kodowania sygnału mowy.

Entry No. 164

Entry type conference paper

Authors A. Ciarkowski, P. Żwan, G. Szwoch, A. Czyżewski

English title Mutlimedia Mobile Services for the Semantic Web

Polish title Multimedialne usługi mobilne dla sieci semantycznej

Conference Technologie Informacyjne 2006

Preprint

Number

Volume 10

Pages 389 - 398

Conference site Gdańsk, Polska

Conference date 21.5.2006- 24.5.2006

Abstract This document describes the methodology of creating semantically-enriched multimedia mobile services using tools and service enablers provided by the DeSyME project. A brief introduction to the Semantic Web is presented along with the explanation of its relation to the subject of Web Services. Next, the description of the DeSyME Framework is included. Finally, examples of multimedia mobile services developed at Gdańsk University of Technology are presented to illustrate possible utilities of described technologies.

Streszczenie Dokument przedstawia metodologię tworzenia semantycznie-rozszerzonych multimedialnych usług mobilnych z wykorzystaniem narzędzi i ułatwień oferowanych przez projekt DESYME. Zaprezentowano zwięzły wstęp do tematyki Sieci Semantycznej wraz z wyjaśnieniem jej związku z zagadnieniami Web Services. Następnie przedstawiono opis projektu DESYME. Przedstawiono również przykładowe usługi multimedialne, które są opracowywane w Katedrze Systemów Multimedialnych WETI PG, jako ilustrację możliwych zastosowań opisywanych technologii.

Entry No. 165

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya

English title Digital Hearing Aid with Time and Spectral Transposition

Polish title Cyfrowy aparat słuchowy z transpozycją czasową i widmową dźwięku

Conference XLII Krajowy Zjazd Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 7.6.2006- 10.6.2006

Notes plakat

Abstract Recent screening hearing tests, which have been carried out in Poland, showed that many people suffer from hearing loss. Worse still, typical hearing aids are not able to help some particular groups of patients, e.g. newborn infants, people working in a noisy environment, aircraft pilots or patients with cochlear implants. Taking advantage of the recent progress in Digital Signal Processor (DSP) developments, a portable and reprogrammable digital hearing aid can be easily designed. Furthermore, taking into account the-state-of-the-art in the research in the digital signal processing domain, it is possible to produce in Poland a sophisticated hearing aid in which complex algorithms of signal processing will be implemented. Owing to hybrid signal processors, it was possible to implement algorithms of spectral and time transposition. The first method is designed for persons with corner-audiograms, who may retain residual hearing in a low frequency band. The latter method may be used in case of patients with time resolution problem. The aim of this poster is to present information about the hearing aid developed, algorithms engineered and to comment preliminary experiment results.

Streszczenie Następstwem uruchomienia w Polsce, prowadzonych na szeroką skalę, badań przesiewowych słuchu jest konieczność zaoferowania pomocy osobom cierpiącym na niedosłuch poprzez leczenie i protetykę słuchu. Tymczasem, aktualnie oferowane rozwiązania aparatów słuchowych nie są w stanie sprostać niektórym specjalistycznym potrzebom aparatowania, m. in.: najmłodszych dzieci, osób pracujących w hałasie, pilotów wojskowych oraz osób korzystających z implantów ślimakowych, u których dzięki zastosowaniu odpowiedniej techniki mikrochirurgicznej, zachowane zostały resztki słuchowe, dające możliwość dodatkowego wykorzystania stymulacji akustycznej i in.. Likwidacja barier importowych w dziedzinie technologii mikroelektronicznej umożliwiła nielimitowany dostęp do tej technologii w naszym kraju, co stwarza techniczną możliwość opracowywania rodzimej konstrukcji cyfrowych aparatów słuchowych o wysokim stopniu nowoczesności i miniaturyzacji. Stan rozwoju krajowych badań naukowych z dziedziny cyfrowego przetwarzania sygnałów akustycznych jest na tyle zaawansowany, że praktycznie nie istnieją bariery, które w ograniczałyby od strony technicznej możliwości opracowywania i wdrażania rodzimych konstrukcji cyfrowych aparatów słuchowych. Dostęp do tej technologii zminiaturyzowanych procesorów cyfrowych gwarantuje realizację podstawowego zadania, jakim jest opracowanie eksperymentalnego modelu wewnątrzusznego aparatu cyfrowego wraz z systemem jego dopasowania do potrzeb pacjenta przy wykorzystaniu oprogramowania komputerowego. Przedmiotem prezentacji jest dokonane opracowanie i wstępne próby kliniczne algorytmów cyfrowego przetwarzania sygnałów fonicznych do zastosowań w specjalnych protezach słuchu, takich jak: transpozycja widmowa i transpozycja czasowa dźwięku. Pierwszy rodzaj transpozycji jest przydatny dla pacjentów zachowujących resztkową czułość słuchu w postaci audiogramu narożnego, zaś transpozycja skali czasu jest pomocna pacjentom o pogorszonej rozdzielczości czasowej słuchu.

Entry No. 166

Entry type conference paper

Authors A. Czyżewski, J. Kozłowski, M. Kulesza, P. Odya, A. Szkiełkowska

English title New Digital Aids for Patients after Laryngectomy

Polish title Nowe pomoce elektroniczne dla osób po laryngektomii

Conference XLII Krajowy Zjazd Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 7.6.2006- 10.6.2006

Notes plakat

Abstract The laryngectomy is a standard therapy for patiens with advanced laryngeal cancer. The operation consist in surgical removal of the larynx. As a result, patients lose ability to produce their own voice. Some laryngectomees use so-called esophageal or tracheosophageal speech to communicate with other people. Both methods require an arduous training and, furthermore, only a minority of laryngectomees are able to learn these kind of speech production. The rest of the patients have to use an electronic device, known as the artificial larynx. The artificial larynx has many disadvantages. The produced speech is monotonous and very artificial. In addition, intelligibility is poor. The major problem is a background noise, which is caused by the device. In fact, the artificial larynx is only a simple vibrator, a construction of which has been almost unchanged since 1950s. The aim of this project is to help laryngectomees. There are two different approaches to solve this task. The first one focuses on the artificial larynx. Some major improvements in the construction of the device might be easily introduced. First of all, digital signal processing should result in decreasing unwanted noise. The artificial larynx engineered will be equipped with digital processor and amplifier. The spectral subtraction algorithm of noise reduction will be used. The second approach uses PDA to generate speech.

Streszczenie Powrót do prawidłowej komunikacji z otoczeniem pacjentów po laryngektomii jest możliwy poprzez wykształcenie zastępczej mowy przełykowej lub gardłowej a w pozostałych przypadkach, kiedy się to nie udaje, poprzez zastosowanie elektronicznych protez (wibratorów szyjnych) wprowadzających w drgania tkanki dna jamy ustnej i szyi. Nawet, gdy dochodzi do wykształcenia mowy zastępczej, jakość mowy artykułowanej przełykowo jak i artykułowanej z zastosowaniem wibratorów elektromechanicznych na ogół znacznie odbiega od oczekiwań. Dostępne na rynku wibratory wykonane są według przestarzałych nieaktualnych koncepcji, z zastosowaniem technologii. Wynikiem tego są duże rozmiary sztucznych krtani i zła jakość tworzonego głosu. Realizowany projekt zakłada skonstruowanie aparatu z zastosowaniem najnowszych komponentów i technik komputerowych, co spowoduje miniaturyzację urządzenia a ponadto poprawi w sposób zasadniczy jakość wytwarzanego głosu. Założono dwa sposoby tworzenia głosu – bezpośredni i pośredni. Pierwszy wykorzystuje małe urządzenie wibracyjne do wzbudzania głosu oraz specjalnie skonstruowany zminiaturyzowany wzmacniacz z cyfrowym przetwarzaniem mowy w jego torze. Zastosowanie cyfrowego przetwarzania mowy umożliwi znaczącą poprawę jej jakości poprzez odfiltrowanie niepożądanych zakłóceń pochodzących od wibracji, szumów i świstów oraz chrypienia. Pomocniczy sposób tworzenia głosu będzie wykorzystywał komputery typu palmtop, dla których przygotowano program syntezy mowy. Poprawę diagnostyki przed i pooperacyjnej oraz procesu rehabilitacji pacjentów, umożliwia opracowane oprogramowania do komputerowej analizy mowy. W konsekwencji opracowywane pomoce elektroniczne umożliwią chorym szybszą rehabilitację i naturalną komunikację werbalną z otoczeniem. Projekt jest realizowany przez d zespół specjalistów z Politechniki Gdańskiej we współpracy z Instytutem Fizjologii i Patologii Słuchu oraz Akademią Medyczną w Gdańsku.

Entry No. 167

Entry type conference paper

Authors M. Kulesza, G. Szwoch, A. Czyżewski, Ł. Litwic

English title Transient detection algorithms for hybrid speech codec

Polish title Metody detekcji transjentów do zastosowań w hybrydowym kodeku mowy

Conference XII Krajowe Sympozjum Telekomunikacji i Teleinformatyki (KSTiT)

Preprint

Number

Volume

Pages

Conference site Bydgoszcz, Polska

Conference date 13.9.2006- 14.9.2006

Abstract The report presents the architecture of the proposed speech codec for VoIP applications. This codec uses a hybrid approach: both parametric and perceptual coding methods were used for encoding unvoiced and voiced parts of speech signal, respectively. Transient are processed separately. Several simple transient detection algorithms for speech signal are assessed for their application in speech codec. The aim of the proposed codec is to improve subjective signal quality in comparison with currently used codecs.

Streszczenie W pracy przedstawiono architekturę proponowanego kodeka mowy do zastosowań np. w telefonii VoIP. Kodek ten charakteryzuje się hybrydowym podejściem do kodowania mowy – zastosowane zostało zarówno kodowanie parametryczne (do bezdźwięcznych fragmentów sygnału), jak i perceptualne (do fragmentów bezdźwięcznych). Ponadto osobno traktowane są transjenty w sygnale. Przebadano kilka prostych algorytmów detekcji transjentów w sygnale mowy i oceniono możliwość ich zastosowania w kodeku mowy. Celem opisywanego kodeka hybrydowego jest poprawa jakości zakodowanego sygnału mowy w porównaniu do stosowanych obecnie kodeków parametrycznych.

Entry No. 168

Entry type conference paper

Authors M. Kulesza, G. Szwoch, A. Czyżewski

English title Improving signal quality in speech codec using hybrid perceptual-parametric algorithm

Polish title Poprawa jakości sygnału w kodekach mowy przy użyciu hybrydowego, parametryczno-perceptualnego algorytmu kodowania

Conference 5th International Conference on Multimedia & Network Information Systems (MISSI 2006)

Preprint

Number

Volume

Pages 181 - 192

Conference site Wrocław, Polska

Conference date 21.9.2006- 22.9.2006

Notes język publikacji: angielski

Abstract A hybrid parametric-perceptual speech codec architecture is presented. The basic CELP parametric codec structure is enhanced using the perceptual coding method. The aim of the codec hybridization is obtaining significant improvement in perceived signal quality. Two hybrid architectures are proposed. The first one encodes perceptually voiced parts in the CELP residual signal. The second one divides the signal into voiced signal components that are encoded using the perceptual algorithm, unvoiced components that are encoded parametrically and transients that remain unencoded. Signal quality achieved using the hybrid codec is compared to the quality of some standard speech codecs.

Streszczenie Przedstawiono hybrydową, parametryczno-perceptualną architekturę kodeka. Podstawowa struktura kodeka parametrycznego CELP została wzbogacona o kodowanie perceptualne. Celem hybrydyzacji kodeka jest uzyskanie znaczącej poprawy subiektywnej jakości zdekodowanego sygnału. Zaproponowano dwie hybrydowe struktury. Pierwsza polega na perceptualnym kodowaniu dźwięcznych elementów sygnału rezydualnego kodeka CELP. Druga metoda dzieli sygnał mowy na części dźwięczne (kodowane perceptualnie), bezdźwięczne (kodowane parametrycznie) i transjenty (obecnie nie kodowane). Jakość sygnału uzyskanego przy pomocy hybrydowego kodeka jest porównana z jakością uzyskiwaną w standardowych kodekach mowy.

Entry No. 169

Entry type journal paper

Authors A. Czyżewski, B. Kostek, P. Maziewski, Ł. Litwic

English title Accidental Wow Defect Evaluation Using Sinusoidal Analysis Enhanced by Artificial Neural Networks

Polish title Wyznaczanie przebiegu przypadkowych zniekształcenia kołysania przy wykorzystaniu analizy sinusoidalnej i sztucznych sieci neuronowych

Journal Lecture Notes in computer Science: Rough Set and Knowledge Technology

Volume 4062/2006

Number

Pages 389 - 395

Abstract A method for evaluation of parasitic frequency modulation (wow) in archival audio is presented. The proposed approach utilizes sinusoidal components tracking as their variations correspond with the wow defect. The sinusoidal modeling procedures are used to extract the tonal components from severely distorted and significantly modulated audio signals. A prediction module based on neural networks is proposed to improve the tonal components tracking.

Streszczenie Artykuł przedstawia metodę do wyznaczania charakterystyki pasożytniczych modulacji częstotliwości (kołysanie) obecnych w archiwalnych nagraniach dźwiękowych. Prezentowane podejście wykorzystuje śledzenie zmian sinusoidalnych komponentów dźwięku które odzwierciedlają przebieg kołysania. Analiza sinusoidalna wykorzystana jest do ekstrakcji składowych tonalnych ze zniekształconych nagrań dźwiękowych. Dodatkowo, w celu zwiększenia skuteczności śledzenia, wykorzystano predykator działający na bazie sieci neuronowej.

Entry No. 170

Entry type conference paper

Authors M. Kulesza, G. Szwoch, A. Czyżewski

English title A Hybrid Speech Codec Employing Parametric and Perceptual Coding Techniques

Polish title Hybrydowy kodek sygnału mowy wykorzystujący kodowanie parametryczne i perceptualne

Conference 121st Audio Engineering Society Convention

Preprint 6956

Number

Volume

Pages 1 - 12

Conference site San Francisco, USA

Conference date 5.10.2006- 8.10.2006

Abstract A hybrid speech codec for VoIP telephony applications is presented employing combined parametric and perceptual coding techniques. The signal is divided into voiced signal components that are encoded using the perceptual algorithm, unvoiced components that are encoded parametrically and transients that are not encoded with a lossy method. The codec architecture where voiced part of the CELP residual signal is perceptually encoded and transmitted to the decoder along with the CELP main bit stream is also examined. Various methods for transient detection in the speech signal are discussed. The results of experiments revealing the improved subjective quality of the transmitted speech are also presented.

Streszczenie W referacie przedstawiono hybrydowy kodek mowy dla zastosowan w komunikacji VoIP wykorzystujący kodowanie parametryczne i percetualne. Sygnał mowy jest dzielony na składowe dźwięczne, które podlegają kodowania perceptualnemu, składowe bezdźwięczne, które kodowane są metodą parametryczną oraz transjenty, które nie są kodowane żadną stratną metodą. Dodatkowo przedstawiono architekturę kodeka, w której perceptualnie kodowana i przesyłana do dekodera jest dźwięczna część sygnału rezydualnego kodeka CELP. Przedstawiono kilka algorytmów detekcji transjentów w sygnale mowy. Wyniki ekspermentów wskazują, iż obie zaproponowane metody pozwalają na kodowanie sygnału mowy z wysoką jakością.

Entry No. 171

Entry type conference paper

Authors A. Czyżewski, J. Kozłowski, M. Kulesza, P. Odya, A. Szkiełkowska

English title New digital aids for pateints after laryngectomy

Polish title Nowe pomoce elektroniczne dla osób po laryngektomii

Conference I Konferencja Audiologiczno-Foniatryczna

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 10.9.2006- 12.9.2006

Notes Abstrakt w Audiofonologii, Suplement, 2006, str. 12

Abstract The laryngectomy is a standard therapy for patiens with advanced laryngeal cancer. The operation consist in surgical removal of the larynx. As a result, patients lose ability to produce their own voice. The aim of this project is to help laryngectomees. There are two different approaches to solve this task. The first one focuses on the artificial larynx. Some major improvements in the construction of the device might be easily introduced. First of all, digital signal processing should result in decreasing unwanted noise. The artificial larynx engineered will be equipped with digital processor and amplifier. The spectral subtraction algorithm of noise reduction will be used. The second approach uses PDA to generate speech.

Streszczenie Powrót do prawidłowej komunikacji z otoczeniem pacjentów po laryngektomii jest możliwy poprzez wykształcenie zastępczej mowy przełykowej lub gardłowej a w pozostałych przypadkach, kiedy się to nie udaje, poprzez zastosowanie elektronicznych protez (wibratorów szyjnych) wprowadzających w drgania tkanki dna jamy ustnej i szyi. Nawet, gdy dochodzi do wykształcenia mowy zastępczej, jakość mowy artykułowanej przełykowo jak i artykułowanej z zastosowaniem wibratorów elektromechanicznych na ogół znacznie odbiega od oczekiwań. Dostępne na rynku wibratory wykonane są według przestarzałych nieaktualnych koncepcji, z zastosowaniem technologii. Wynikiem tego są duże rozmiary sztucznych krtani i zła jakość tworzonego głosu. Realizowany projekt zakłada skonstruowanie aparatu z zastosowaniem najnowszych komponentów i technik komputerowych, co spowoduje miniaturyzację urządzenia a ponadto poprawi w sposób zasadniczy jakość wytwarzanego głosu. Założono dwa sposoby tworzenia głosu – bezpośredni i pośredni. Pierwszy wykorzystuje małe urządzenie wibracyjne do wzbudzania głosu oraz specjalnie skonstruowany zminiaturyzowany wzmacniacz z cyfrowym przetwarzaniem mowy w jego torze. Zastosowanie cyfrowego przetwarzania mowy umożliwi znaczącą poprawę jej jakości poprzez odfiltrowanie niepożądanych zakłóceń pochodzących od wibracji, szumów i świstów oraz chrypienia. Pomocniczy sposób tworzenia głosu będzie wykorzystywał komputery typu palmtop, dla których przygotowano program syntezy mowy. Poprawę diagnostyki przed i pooperacyjnej oraz procesu rehabilitacji pacjentów, umożliwia opracowane oprogramowania do komputerowej analizy mowy. W konsekwencji opracowywane pomoce elektroniczne umożliwią chorym szybszą rehabilitację i naturalną komunikację werbalną z otoczeniem. Projekt jest realizowany przez zespół specjalistów z Politechniki Gdańskiej we współpracy z Instytutem Fizjologii i Patologii Słuchu oraz Akademią Medyczną w Gdańsku.

Entry No. 172

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya

English title Digital Hearing Aid with time and spectral transposition

Polish title Cyfrowy aparat słuchowy z transpozycją czasową i widmową

Conference I Konferencja Audiologiczno-Foniatryczna

Preprint

Number

Volume

Pages

Conference site

Conference date 10.9.2006- 12.9.2006

Notes Abstrakt w Audiofonologii, Suplement, 2006, str. 11

Abstract Recent screening hearing tests, which have been carried out in Poland, showed that many people suffer from hearing loss. Worse still, typical hearing aids are not able to help some particular groups of patients, e.g. newborn infants, people working in a noisy environment, aircraft pilots or patients with cochlear implants. Taking advantage of the recent progress in Digital Signal Processor (DSP) developments, a portable and reprogrammable digital hearing aid can be easily designed. Furthermore, taking into account the-state-of-the-art in the research in the digital signal processing domain, it is possible to produce in Poland a sophisticated hearing aid in which complex algorithms of signal processing will be implemented. Owing to hybrid signal processors, it was possible to implement algorithms of spectral and time transposition. The first method is designed for persons with corner-audiograms, who may retain residual hearing in a low frequency band. The latter method may be used in case of patients with time resolution problem. The aim of this poster is to present information about the hearing aid developed, algorithms engineered and to comment preliminary experiment results.

Streszczenie Następstwem uruchomienia w Polsce, prowadzonych na szeroką skalę, badań przesiewowych słuchu jest konieczność zaoferowania pomocy osobom cierpiącym na niedosłuch poprzez leczenie i protetykę słuchu. Tymczasem, aktualnie oferowane rozwiązania aparatów słuchowych nie są w stanie sprostać niektórym specjalistycznym potrzebom aparatowania, m. in.: najmłodszych dzieci, osób pracujących w hałasie, pilotów wojskowych oraz osób korzystających z implantów ślimakowych, u których dzięki zastosowaniu odpowiedniej techniki mikrochirurgicznej, zachowane zostały resztki słuchowe, dające możliwość dodatkowego wykorzystania stymulacji akustycznej i in.. Likwidacja barier importowych w dziedzinie technologii mikroelektronicznej umożliwiła nielimitowany dostęp do tej technologii w naszym kraju, co stwarza techniczną możliwość opracowywania rodzimej konstrukcji cyfrowych aparatów słuchowych o wysokim stopniu nowoczesności i miniaturyzacji. Stan rozwoju krajowych badań naukowych z dziedziny cyfrowego przetwarzania sygnałów akustycznych jest na tyle zaawansowany, że praktycznie nie istnieją bariery, które w ograniczałyby od strony technicznej możliwości opracowywania i wdrażania rodzimych konstrukcji cyfrowych aparatów słuchowych. Dostęp do tej technologii zminiaturyzowanych procesorów cyfrowych gwarantuje realizację podstawowego zadania, jakim jest opracowanie eksperymentalnego modelu wewnątrzusznego aparatu cyfrowego wraz z systemem jego dopasowania do potrzeb pacjenta przy wykorzystaniu oprogramowania komputerowego. Przedmiotem prezentacji jest dokonane opracowanie i wstępne próby kliniczne algorytmów cyfrowego przetwarzania sygnałów fonicznych do zastosowań w specjalnych protezach słuchu, takich jak: transpozycja widmowa i transpozycja czasowa dźwięku. Pierwszy rodzaj transpozycji jest przydatny dla pacjentów zachowujących resztkową czułość słuchu w postaci audiogramu narożnego, zaś transpozycja skali czasu jest pomocna pacjentom o pogorszonej rozdzielczości czasowej słuchu.

Entry No. 173

Entry type journal paper

Authors P. Dalka, B. Kostek, A. Czyżewski

English title Vowel Recognition Based On Acoustic And Visual Features

Polish title Rozpoznawanie samogłosek bazujące na parametrach akustycznych i wizualnych

Journal Archives of Acoustics

Volume 31

Number 3

Pages 275 - 288

Abstract The aim of the research work presented is to show a system that may facilitate speech training for hearing impaired people. The system engineered combines both acoustic and visual vowel data acquisition and analysis modules. The acoustic feature extraction involves mel-cepstral analysis. The Active Shape Model method is used for extracting visual speech features from the shape and movement of the lips. Artificial Neural Networks (ANNs) are utilized as the classifier, feature vectors extracted combine both modalities of the human speech. The system is validated with the recordings of speakers that were not used for the lip model creating and for the ANN training. Additional experiments with the degraded acoustic information are carried out in order to test the system robustness against various distortions affecting speech utterances.

Streszczenie W artykule zaprezentowano metodę, która może ułatwić naukę mowy dla osób z wadami słuchu. Opracowany system rozpoznawania samogłosek wykorzystuje łączną analizę parametrów akustycznych i wizualnych sygnału mowy. Parametry akustyczne bazują na współczynnikach mel-cepstralnych. Do wyznaczenia parametrów wizualnych z kształtu i ruchu ust zastosowano Active Shape Models. Jako klasyfikator użyto sztuczną sieć neuronową. Działanie systemu zostało przetestowane z wykorzystaniem nagrań mówców, które nie były wykorzystane ani do tworzenia modelu ust, ani do treningu sieci neuronowej. Dodatkowo zbadano wpływ zakłócania informacji akustycznej na uzyskiwane wyniki.

Entry No. 174

Entry type conference paper

Authors A. Czyżewski, B. Kostek, K. Kochanek, M. Kulesza, P. Suchomski

English title Hearing aid operating in acoustical free field

Polish title Aparat słuchowy działający w wolnym polu akustycznym

Conference I Konferencja Audilogiczno-Foniatryczna

Preprint

Number

Volume

Pages 11

Conference site Warszawa, Polska

Conference date 10.9.2006- 12.9.2006

Notes Audiofonologia-Suplement, str. 11

Streszczenie Aparatowanie bardzo małych dzieci (od 5 miesiąca życia) za pomocą standardowych protez słuchu natrafia na wiele trudności natury praktycznej. Dotyczy to procesu dopasowania aparatu słuchowego, czyli doboru jego ustawień stosownie do aktualnych charakterystyk ubytku słuchu dzieci. Tymczasem wczesne aparatowanie jest zagadnieniem o ogromnym zanczeniu dla rozwoju słuchu, mowy i ogólnej inteligencji dziecka. Referat prezentuje uzyskane wyniki praktycznych prób i eksperymentów w tym zakresie, które otwierają drogę do opracowania bezkontaktowego aparatu słuchowego.

Entry No. 175

Entry type journal paper

Authors M. Kulesza, B. Kostek, P. Dalka, A. Czyżewski

English title Contactless hearing aid for infants

Polish title Bezkontaktowy aparat słuchowy dla niemowląt

Journal Archives of Acoustics

Volume 31

Number 4

Pages 431 - 437

Abstract It is a well known fact that language development through home intervention for a hearing-impaired infant should start in the early months of a newborn baby's life. The aim of this paper is to present a concept of a contactless digital hearing aid designed especially for infants. In contrast to all typical wearable hearing aid solutions (ITC, ITE, BTE), the proposed device is mounted in the infant’s bed with any parts of its set-up contacting the infant’s body. A processed speech signal is emitted by low-power loudspeakers placed near the infant’s head. The hearing aid architecture employs a digital signal processor based on Texas Instruments technology. Since one of the main problems is the acoustic feedback between the microphone and the loudspeakers, the methods of its elimination are also briefly reviewed in this article. The first of the discussed methods employs an adaptive algorithm, the second alters the frequency response of the entire instrumentation through the use of notch filter banks, and the third incorporates a microphone array and beam-forming techniques. The paper also includes descriptions of some algorithmic solutions engineered by the authors in purpose to eliminate acoustic feedbacks. All the conclusions introduced in this article have been derived based on the simulations of an experimental contactless hearing aid set-up.

Streszczenie Powszechnie wiadomo, iż korekcja wad słuchu niemowląt powinna się rozpoczynać już w pierwszych miesiącach życia dziecka. Pozwala to uniknąć zaburzenia rozwoju mowy dziecka. W publikcji przedstawiono koncepcję cyfrowego, bezkontaktowego aparatu słuchowego dla niemowląt. W przeciwieństwie do typowych aparatów słuchowych (noszonych za uchem lub wewnątrz kanału słuchowego), proponowane urządzenie umieszczane jest w łóżeczku dziecka i żaden jego element nie styka się z jego ciałem. Cyforowo przetworzony sygnał mowy emitowany jest przez miniaturowe głośniki umieszczone w okolicach głowy dziecka. Aparat zbudowany został w oparciu o procesor sygnałowy Texas Instruments. Jako że jednym z głównych problemów w takiej konfiguracji są pasożytnicze sprzężenia zwrotne w artykule przedstawiono metody ich eliminacji. Pierwsza z omawianych metod wykorzystuje algorytm adaptacyjny, druga natomiast bank filtrówe wycinających. Trzecia metoda wykorzystuje matryce mikrofonów oraz algorytm filtracji przestrzennej. Artykuł zawiera również opis rozwiązań związanych z eliminacją sprzęzeń proponowanych przez autorów. Wnioski z badań wyciągnięto na podstawie przeprowadzonych eksperymentów.

Entry No. 176

Entry type journal paper

Authors M. Kulesza, G. Szwoch, Ł. Litwic, A. Czyżewski

English title High quality speech codec employing sines+noise+transients model

Polish title

Journal Archives of Acoustics

Volume 31

Number 3

Pages 356 - 356

Notes Streszczenie

Abstract A method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and to preserve high quality of speech. Therefore, the psychoacoustic model devised for perceptual speech coding is presented. The experimental results reveal that method for tonality estimation employed in the psychoacoustic model has a significant impact on perceptual coding accuracy. Various methods for tonality estimation are presented and compared.

Streszczenie W artykule przedstawiono szerokopasmową metodę kodowania sygnału mowy wysokiej jakości z wykorzystaniem reprezentacji sygnału w postaci sumy składników sinusoidlanych, szumowych oraz transjentów. Poddano dyskusji potrzebę szerokopasmowego kodowania sygnału mowy jak również różne metody ekstrakcji oraz syntezy składowych sinusoidalnych, szumowych oraz transjentów. W celu redukcji wymagań co do przepływności strumienia bitowego kryterium psychoakustyczne zostało wykorzystane do kodowania apmplitud składowych sinusoidalnych. Z tego powodu w artykule zaprezentowane model psychoakustyczny dla sygnału mowy. Wyniki eksperymentów wykazały, iż indeks tonalności wyznaczacny w modelu ma istotny wpływ na proces kodowania perceptualnego. Z tego powodu różne metody wyznaczania indeksu tonalności zostały zaprezentowane i porównane między sobą.

Entry No. 177

Entry type conference paper

Authors A. Ciarkowski, P. Żwan, G. Szwoch, A. Czyżewski

English title Multimedia Mobile Services for the Semantic Web

Polish title

Conference 4th Conference on Information Technology

Preprint

Number

Volume 10

Pages 389 - 398

Conference site Gdańsk, Polska

Conference date 21.5.2006- 24.5.2006

Abstract This document describes the methodology of creating semantically-enriched multimedia mobile services using tools and service enablers provided by the DeSyME project. A brief introduction to the Semantic Web is presented along with the explanation of its relation to the subject of Web Services. Next, the description of the DeSyME Framework is included. Finally, examples of multimedia mobile services developed at Gdańsk University of Technology are presented to illustrate possible utilities of described technologies.

Entry No. 178

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title Implementing Multimedia Mobile Services with DESYME Framework

Polish title

Conference 17th IEEE Conference on Personal, Indoor and Mobile Radio Communications

Preprint

Number

Volume

Pages

Conference site Helsinki, Finlandia

Conference date 11.9.2006- 14.9.2006

Notes Pozycja w druku

Abstract Methodology of creating multimedia mobile services enriched with semantical description OWL attachment using DeSyME project tools is described. Introduction to the Semantic Web is presented and its relation to the subject of Web Services is explained. Also included is the description of the DeSyME Framework and tools. Real-life examples of multimedia mobile services developed at Gdańsk University of Technology are showcased as an illustration of described technologies’ applications

Entry No. 179

Entry type book

Authors B. Kostek, P. Dalka, A. Czyżewski

English title Audiovisual Speech Recognition for Training Hearing Impaired Patients

Polish title Automatyczne rozpoznawanie mowy na potrzeby treningu osób z wadami słuchu

Editor World Scientific

Pages 335 - 347

Notes rozdział w książce zagranicznej Mathematical Methods in Scattering Theory And Biomedical Engineering: Proceedings of the Seventh International Workshop, D.I. Fotiadis, C., V. Massala, Eds.

Abstract This study presents isolated phoneme recognition system combining both visual and acoustical data. The Active Shape Model method is used for extracting visual speech features from the shape and movement of the lips. This method consists in a model-based approach for extracting speech information from image sequences. Its advantage over the image-based approach stems from the fact that important features are represented in a low-dimensional space and are normally invariant to translation, rotation, scaling and illumination. The Mel Frequency Cepstral Coefficients (MFCCs) are used as the acoustic speech features in the speech recognition system. MFCCs are based on the short-term spectrum. The power spectrum bins are grouped and smoothed according to the perceptually motivated Mel frequency scaling. Then the spectrum is segmented into critical bands. Finally, a discrete cosine transform is applied to the logarithm of the filter bank output signal resulting in vectors of decorrelated MFCCs features. A three-layer feed-forward artificial neural network (ANN) is used in the experiments related to speech recognition. Feature vectors extracted combine both modalities of the human speech. A matrix, containing feature vectors calculated during the utterance, forms an input to the ANN. To make the results of speech classification robust against the changes in the utterance duration, an interpolation is used to compute feature vectors. Additional experiments with the degraded acoustical information are carried out in order to test the system robustness against various distortions affecting the signals. The system engineered utilizing only the visual information correctly classifies properly nearly 80% of the speech utterances. This result is very satisfying taking into account a huge similarity between lip movements during articulation of vowels and a great diversity of lip shapes originating from the anatomical features and the way of speaking. Results of classification based on the acoustical information are much better than the ones based on the visual information. However, utilizing both modalities in the speech recognition system further improves the effectiveness. Moreover this makes the system much more robust against distortions in the audio signal. A software is prepared employing above mentioned algorithms to be used by cochlear implanted patients in the process of speech training. An interactive application was conceived making possible organizing the interactive speech training sessions without any assistance from speech therapists. This method consists in a model-based approach for extracting speech information from image sequences. Its advantage over the image-based approach stems from the fact that important features are represented in a low-dimensional space and are normally invariant to translation, rotation, scaling and illumination. MFCCs are based on the short-term spectrum. The power spectrum bins are grouped and smoothed according to the perceptually motivated Mel frequency scaling. Then the spectrum is segmented into critical bands. Finally, a discrete cosine transform is applied to the logarithm of the filter bank output signal resulting in vectors of decorrelated MFCCs features. A matrix, containing feature vectors calculated during the utterance, forms an input to the ANN. To make the results of speech classification robust against the changes in the utterance duration, an interpolation is used to compute feature vectors. Additional experiments with the degraded acoustical information are carried out in order to test the system robustness against various distortions affecting the signals. The system engineered utilizing only the visual information correctly classifies properly nearly 80% of the speech utterances. This result is very satisfying taking into account a huge similarity between lip movements during articulation of vowels and a great diversity of lip shapes originating from the anatomical features and the way of speaking. Results of classification based on the acoustical information are much better than the ones based on the visual information. However, utilizing both modalities in the speech recognition system further improves the effectiveness. Moreover this makes the system much more robust against distortions in the audio signal. A software is prepared employing above mentioned algorithms to be used by cochlear implanted patients in the process of speech training. An interactive application was conceived making possible organizing the interactive speech training sessions without any assistance from speech therapists.

Streszczenie Niniejszy rozdział stanowi rozszerzenie referatu przedstawionego na konferencji 7th Mathematical Methods in Scattering Theory and Biomedical Engineering. Rozdział ten przedstawia system rozpoznawania izolowanych głosek mowy wykorzystujący dane wizualne i akustyczne. Modele Active Shape Models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na współczynnikach melcepstralnych. Sieć neuronowa została użyta do rozpoznawania wymawianych głosek na podstawie wektora cech zawierającego oba typy parametrów. Dodatkowo zbadano odporność systemu na zakłócenia w sygnale dźwiękowym.

Entry No. 180

Entry type journal paper

Authors A. Czyżewski, L. Litwic, M. Dziubiński, P. Maziewski

English title Intelligent Algorithms for Movie Sound Track Restoration

Polish title Inteligentne algorytmy do rekonstruowania optycznych ścieżek dźwiękowych

Journal Lecture Notes in computer Science: Transaction on Rough Set V

Volume V

Number

Pages 123 - 145

Abstract Two algorithms for movie sound tracks restoration are discussed in the paper. The first algorithm is the unpredictability measure computation applied to the psychoacoustic model-based broadband noise attenuation. A learning decision algorithm, based on a neural network, is employed for determining useful audio signal components acting as maskers of the noisy spectral parts. An iterative method for calculating the sound masking pattern is presented. The second of presented algorithms is the routine for precise evaluation of parasite frequency modulations (wow) utilizing sinusoidal components extracted from the sound spectrum. The results obtained employing proposed intelligent signal processing algorithms, as well as the relationship between both routines, will be presented and discussed in the paper.

Streszczenie W artykule przedstawiono dwa algorytmy do rekonstruowania optycznych ścieżek dźwiękowych. Pierwszy z nich jest zastosowaniem miary nieprzewidywalności do obliczeń parametrów modelu psychoakustycznego stosowanego do redukowania szumów. Drugi stanowi precyzyjną procedurę oceny pasożytniczej modulacji częstotliwości, opartej na analizie składowych harmonicznych. Wyniki zastosowania obu wymienionych algorytmów są zawarte w artykule.

Entry No. 181

Entry type journal paper

Authors A. Wieczorkowska, A. Czyżewski

English title Role of Various Parametres in Automatic Classification of Musical Instrument Sound

Polish title Rola parametrów sygnału fonicznego w automatycznym rozpoznawaniu dźwięków instrumentów muzycznych

Journal Internationa Transactions on Communication and Signal Processing (GESTS)

Volume 6

Number 1

Pages 146 - 159

Abstract This paper addresses the problem of automatic classification of musical instrument sounds, especially, how various sound parameters and their evolution in various sound stages contribute to the recognition process. The parameterized data represented singular sounds of musical scale of contemporary orchestral string instruments, woodwinds, and brass. The proposed parameterization methods are based on Fourier analysis and time domain of sounds, using feature vectors consisting of 14 and 62 parameters. Authors compare classifica-tion quality and discernibility of instruments on the basis of various sound fea-tures, observed at various stages of sound. The quality of the proposed parame-terizations have been tested using decision trees and rough set based algorithms.

Streszczenie Artkuł dotyczy problemu automatycznej klasyfikacji dźwięków instrumentów muzycznych, w tym głównie wpływu indywidualnych parametrów na proces automatycznego rozpoznawania instrumentów. Parametryzacja wykorzystuje wdirmo Fourierowskie i analizę czasową dźwięków do formowania 14 i 62-parametrowych wektorów cech dystynktywnych. Autorzy porównują jakość rozpoznawania i rozróźnialność instrumentów. Przy ocenach tego typu stosowano drzewa decyzyjne i metodę zbiorów przybliżonych.

Entry No. 182

Entry type conference paper

Authors A. Czyżewski, E. Pływaczewski, Z. Rau, W. Ziółkowski

English title Polish Internal Security Platform, Pomeranian Special Economic Zone @ Gdansk Security Centre. Concept, organization and cooperation

Polish title Polska Platforma Bezpieczeństwa Wewnętrznego, Pomorska Specjalna Strefa Ekonomiczna i Gdańskie Centrum Technologii Bezpieczeństwa

Conference TRANSIS 2006

Preprint

Number

Volume

Pages 55 - 70

Conference site Gdańsk, Polska

Conference date 6.10.2006

Abstract Fundamental issuses were presented with regards to the following initiatives related to security, namely: Polish Internal Security Platform, Pomeranian Special Economic Zone @ Gdansk Security Centre. Concept, organization and cooperation.

Streszczenie Przedstawiono informacje na temat szeregu incicjatyw związanych z bepieczeństwem obywateli i biznesu, takich jak: Polska Platforma Bezpieczeństwa Wewnętrznego, Pomorska Specjalna Strefa Ekonomiczna i Gdańskie Centrum Technologii Bezpieczeństwa.

Entry No. 183

Entry type book

Authors A. Czyżewski

English title Applications of Knowledge Technologies to Sound and Vision Engineering

Polish title Zastosowania technologii wiedzy w inżynierii dźwięku i obrazu

Editor Springer-Verlag Lecture Notes in Computer Science

Pages

Notes Volume 4062/2006 (abstrakt)

Abstract Sound and Vision Engineering as an interdisciplinary branch of science should quickly assimilate new methods and new technologies. Meanwhile, there exist some advanced and well developed methods for analyzing and processing of data or signals that are only occasionally applied to this domain of science. These methods emerged from the artificial intelligence approach to image and signal processing problems. In the paper the intelligent algorithms, such as neural networks, fuzzy logic, genetic algorithm and the rough set method will be presented with regards to their applications to sound and vision engineering. The paper will include a practical demonstration of results achieved with intelligent algorithms applications to: bi-modal recognition of speech employing NN-PCA algorithm, perceptually-oriented noisy data processing methods, advanced sound acquisition, GA algorithm-based digital signal processing for telecommunication applications and others.

Streszczenie Specjalność Inżynieria Dźwięku i Obrazu jest ukierunkowana przede wszystkim na aplikacje praktyczne metod rejestracji i przetwarzania sygnałów fonicznych i wizyjnych we współczesnej telekomunikacji i w multimediach. W związku z tym, specjalność ta wykorzystuje również wiedzę z obszaru akustyki, psychofizjologii percepcji a także estetyki muzycznej. W zastosowaniach multimedialnej technologii informatycznej w telekomunikacji, w przesyłaniu i przetwarzaniu sygnałów a także w akustyce fonicznej, w technice rejestracji nagrań i w technologii studyjnejcoraz częściej pojawiają się inteligentne metody obliczeniowe. W referacie przedstawiowo zastosowania, takie jak: bi-modalne rozpoznawanie mowy, perceptualna redukcję szumów, algorytm genetyczny w zastosowaniu do eliminacji echa w torach telekomunikacyjnych i in.

Entry No. 184

Entry type conference paper

Authors A. Walkowiak, B. Kostek, A. Lorens, A. Czyżewski, A. Obrycka, A. Wąsowski

English title Simulation of electric hearing - influence of simulation parameters on quality of output signal

Polish title Wpływ wybranych parametrów symulacji słuchu elektrycznego na jakość sygnału mowy

Conference I Konferencja Audiologiczno-Foniatryczna

Preprint

Number

Volume

Pages 98

Conference site Warszawa, Polska

Conference date 10.9.2006- 12.9.2006

Notes Audiofonologia-Suplement, str. 98

Streszczenie W środowisku programistycznym Matlab stworzono symulację słuchu elektrycznego pacjenta implantowanego. W symulacji zastosowano algorytm przetwarzania wykorzystywany w komercyjnych systemach implantów ślimakowych - CIS (Continuous Interleaved Sampling). W pracy zbadano wpływ ilości kanałów, jak i innych parametrów sygnałów wyjściowych przy zastosowaniu sygału mowy jako sygnału wejściowego symulacji. Słowa kluczowe: audiologia, implant ślimakowy, algorytmy przetwarzania

Entry No. 185

Entry type journal paper

Authors M. Kulesza, Ł. Litwic, G. Szwoch, A. Czyżewski

English title High Quality Speech Codec Employing Sines+Noise+Transients Model

Polish title Kodek wysokiej jakości sygnału mowy wykorzystujący model sinusy+szum+transjenty

Journal Archives of Acoustics

Volume 31

Number 4

Pages 183 - 188

Abstract A method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and to preserve high quality of speech. Therefore, the psychoacoustic model devised for perceptual speech coding is presented. The experimental results reveal that method for tonality estimation employed in the psychoacoustic model has a significant impact on perceptual coding accuracy. Various methods for tonality estimation are presented and compared.

Streszczenie Zaprezentowano metodę kodowania szerokopasmowego sygnału mowy wykorzystującą model sinusy+szum+transjenty. Potrzeba kodowania szerokopasmowego sygnału mowy jak również różne metody analizy i syntezy komponentów sinusoidalnych, szumowych oraz transjentów zostały poddane dyskusji. W celu redukcji przepływności bitowej w procesie kodowania amplitud komponentów sinusoidalnych zastosowano kryterium psychoakustyczne. Z tego powodu w artykule przedstawiono model psychoakustyczny do zastosowania w kodowaniu sygnału mowy. Wyniki eksperymentów wskazują, iż metoda wyznaczania tonalności komponentów widma ma istotny wpływ na jakość kodowania perceptualnego. Różne metody określania tonalności komponentów widma zostały przedstawione i porównane.

Entry No. 186

Entry type conference paper

Authors A. Czyżewski

English title Applications of Knowledge Technologies to Sound and Vision Engineering

Polish title Zastosowanie technologii opartych na wiedzy w inżynierii dźwięku i obrazu

Conference Rough Sets and Knowledge Technology

Preprint

Number

Volume

Pages

Conference site Chongquing, Chiny

Conference date 24.7.2006- 26.7.2006

Notes referat plenarny - na zaproszenie organizatorów konferencji

Abstract Sound and Vision Engineering as an interdisciplinary branch of science should quickly assimilate new methods and new technologies. Meanwhile, there exist some advanced and well developed methods for analyzing and processing of data or signals that are only occasionally applied to this domain of science. These methods emerged from the artificial intelligence approach to image and signal processing problems. In the paper the intelligent algorithms, such as neural networks, fuzzy logic, genetic algorithm and the rough set method will be presented with regards to their applications to sound and vision engineering. The paper will include a practical demonstration of results achieved with intelligent algorithms applications to: bi-modal recognition of speech employing NN-PCA algorithm, perceptually-oriented noisy data processing methods, advanced sound acquisition, GA algorithm-based digital signal processing for telecommunication applications and others.

Streszczenie Inżynieria dźwięku i obrazu jako interdyscyplinarna dziedzina nauki i techniki szybko asymiluje nowe metody i technologie. W referacie zaprezentowano wyniki szeregu prac badawczych i eksperymentów naukowych z tej dziedziny, uzyskanych z zastosowaniem metod obliczeniowych, takich jak inteligencja obliczeniowa i soft computing. Słowa kluczowe: inzynieria dźwieku i obrazu; inżynieria wiedzy; metody inteligentne

Entry No. 187

Entry type conference paper

Authors A. Czyżewski, Ł. Litwic, P. Maziewski

English title Computational Intelligence Approach to Archival Musical Recordings

Polish title Podejście oparte na inteligencji obliczeniowej do zagadnień archiwizacji nagrań muzycznych

Conference 4th Joint Meeting of the Acoustical Society of America

Preprint

Number

Volume

Pages

Conference site Honolulu, HI, USA

Conference date 28.11.2006- 2.12.2006

Abstract An algorithmic approach to wow defect estimation in archival musical recordings is presented. The wow estimation is based on the simultaneous analysis of many sinusoidal components, which are assumed to depict the defect. The rough determination of sinusoidal components in analysed musical recording is performed by standard sinusoidal modeling procedures employing magnitude and phase spectra analysis. Since archival recordings tend to contain distorted tonal structure the basic sinusoidal modeling approach is often found insufficient resulting in audible distortions in the restored signal. It is found that the standard sinusoidal modeling approach is prone to errors especially when strong frequency or amplitude variations of sinusoidal components occur. It may result in gaps or inappropriately matched components leading to incorrect estimation of the wow distortion. Hence, some refinements to sinusoidal component analysis including interpolation and extrapolation of tonal components are proposed. As it was demonstrated in experiments, due to nonlinear nature of wow distortion, the enhancement of sinusoidal analysis can be performed by means of a neural network. The paper demonstrates implemented algorithms for parasite frequency modulation in archival recordings together with obtained results. The work was supported by the Commission of the European Communities, within the Integrated Project No. FP6-507336: “PRESTOSPACE”.

Streszczenie Przedstawiono algorytmy estymacji zakłóceń polegających na kołysaniu dźwięku w nagraniach. W szczególności dyskutowane podejście polega na równoczesnej analizie komponentów harmonicznych. Wykazano, że zastosowanie sieci neuronowej może poprawić efektywność działania algorytmu opartego na analizie wielu składowych harmonicznych dźwięku obarczonego pasożytniczą modulacją czestotliwościową. Słowa kluczowe: rekonstruowanie nagrań; pasożytnicze modulacje; kołysanie dźwięku

Entry No. 188

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński, B. Kostek

English title "I Can Hear" - a system for universal hearing screening

Polish title "Słyszę..." - system badań przesiewowych słuchu

Conference Inter Noise 2006. The 35th International Congress and Exposition on Noise Control Engineering

Preprint

Number

Volume

Pages

Conference site Honolulu, USA

Conference date 3.12.2006- 6.12.2006

Abstract Hearing impairment is one of the fastest growing diseases of modern society. This kind of impairment is often introduced by excessive noise. Therefore it is important to organise mass scale screening tests to identify people suffering from this kind of impairment. "I Can Hear…" provides a Web-based test that uses automatic questionnaire analysis, audiometric tone test procedures, and assesses speech intelligibility in noise. When all the testing is completed, "I Can Hear…" automatically analyses the results for each person examined. Based on the number of incorrect answers, the decision is made automatically by the expert system: does the person have normal hearing or does he or she have hearing problems and require to be examined in one of the consulting centres. Those whose hearing impairment is confirmed will be referred to treatment.

Streszczenie System "Słyszę..." jest usługą dostępną w Internecie, która służy do prowadzenia przesiewowych badań słuchu. Wykorzystywane są trzy rodzaje badań: ankieta eleketroniczna, testy audiometryczne i testy zrozumiałości mowy w szumie. W wyniku analizy odpowiedz pacjentów, system ekspercki podejmuje decyzję o zakwalifikowaniu badanej osoby do grupy osób nie mających problemów ze słuchem lub do grupy osób cierpiących na niedosłuch. Słowa kluczowe: badanie słuchu; audiometria

Entry No. 189

Entry type conference paper

Authors P. Dalka, A. Czyżewski

English title Speech recognition based on visual features applicable in the logopedics.

Polish title Rozpoznawanie mowy z wykorzystaniem cech wizualnych do zastosowań w terapii logopedycznej

Conference Konferencja Fundacji na rzecz Nauki Polskiej "Nauki techniczne jako źródło innowacji?"

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 19.5.2006- 20.5.2006

Notes plakat

Streszczenie Istniejące systemy rozpoznawania mowy bazują z reguły na informacji akustycznej. Z tego względu są bardzo wrażliwe na szumy otoczenia i często zawodzą w przypadku, gdy wiele osób mówi jednocześnie. W procesie percepcji mowy ludzie podświadomie wykorzystują informacje wizualne pochodzące bezpośrednio od mówcy, takie jak ruchy ust, pozycja języka i widoczność zębów. Celem prowadzonych badań jest stworzenie systemu pomocnego w terapii logopedycznej osób z wadami słuchu.

Entry No. 190

Entry type conference paper

Authors A. Czyżewski, B. Kostek, K. Kochanek, H. Skarżyński

English title Dithering Strategy Applied to Tinnitus Masking

Polish title Nowe podejście do maskowania szumów usznych

Conference 120th Audio Eng. Society Convention

Preprint 6856

Number

Volume

Pages 1 - 8

Conference site Paris, Francja

Conference date 20.5.2006- 23.5.2006

Notes J. Audio Eng. Soc., vol. 54, 7/8, 738-739

Abstract The hypothesis on the existence of a parasitic quantization, that accompanies hearing loss has been formulated in this work, and then related to other existing theories on causes of Tinnitus. Some preliminary experiments have been carried out, that targeted at verifying the correctness of the proposed interpretation of applied maskers employing dither theory. An effective method of providing a masking signal that uses bone conductivity was derived for the purpose of these experiments. The results of the experiments initially confirm the analogy between the threshold phenomena occurring in the digital audio circuits and ear noises origin. The presented results may induce the elaboration of more effective ear therapies based on high-frequency dither having specially formed spectral characteristics.

Streszczenie W referacie przedstawiono teorię wyjaśniającą zjawisko szumów usznych na gruncie akustyki, elektroniki i telekomunikacji. Spostrzeżenie, że słuch jest w istocie akustycznym układem transmisyjnym, skłania do poszukiwania interpretacji powstawania szumów usznych w ogólnej teorii spontanicznego generowania szumu w układach transmisyjnych. Sformułowana hipoteza wskazuje na istnienie pasożytniczej kwantyzacji, która pojawia się w sytuacji wystąpienia ubytku słuchu, dlatego została ona powiązana z teoriami, dotyczącymi przyczyn powstawiania szumów usznych. W ramach prac badawczych zostały przeprowadzone wstępne badania, mające na celu weryfikację zasadności zaproponowanej interpretacji sposobu działania maskera. Dla celów realizacji eksperymentów została opracowana skuteczna metoda podawania sygnału maskującego z wykorzystaniem kostnego przewodnictwa dźwięku. Wyniki przeprowadzonych badań potwierdzają występowanie analogii pomiędzy zjawiskami progowymi, które występują w elektronicznych układach transmisji sygnałów fonicznych z kwantyzacją i zjawiskami związanymi ze slyszeniem i powstawaniem szumów usznych, co może prowadzić do stworzenia bardziej skutecznych metod terapii. Słowa kluczowe: audiologia, szumy uszne, ultradźwięki, audiometria, słuch, efekt maskowania, dither, pasożytnicza kwantyzacja, układ transmisyjny

Entry No. 191

Entry type conference paper

Authors P. Maziewski, Ł. Litwic, A. Czyżewski

English title Accidental Wow Evaluation Based on Sinusoidal Modeling and Neural Nets Prediction

Polish title Określenie Przebiegu Przypadkowych Zniekształceń Kołysania przy Wykorzystaniu Modelowania Sinusoidalnego i Predykcji Sieciami Neuronowymi

Conference 120 Konferencja AES

Preprint 6769

Number

Volume

Pages

Conference site Paryż, Francja

Conference date 20.5.2006- 23.5.2006

Notes abstrakt w J. Audio Eng. Soc., vol. 54, 7/8, 709

Abstract In this paper an algorithmic approach to the wow defect characteristic evaluation is presented. The approach is based on a sinusoidal analysis comprising both amplitude and phase spectra processing. The frequency trajectories depicting the distortion are built on a basis of amplitude, frequency and phase dependencies and are further used for wow characteristic evaluation. Additionally the experiments concerning the neural-network-based prediction applied to the characteristic are performed. The obtained results are compared to linear-prediction.

Streszczenie Referat przedstawia opis algorytmu do określenia charakterystyki zniekształcenia kołysania dźwięku. Prezentowane podejście wykorzystuje sinusoidalną analizę dźwięku bazującą zarówno na amplitudowym jak i fazowym widmie sygnału fonicznego. Trajektorie poszczególnych składowych tonalnych, obrazujące zniekształcenie kołysania, określane są na podstawie analizy ich chwilowych amplitud, częstotliwości i faz. Dodatkowo referat przedstawia wyniki eksperymentów w których wykorzystano sieci neuronowe do predykcji przebiegu charakterystyki zniekształcenia. Otrzymane w ten sposób wyniki porównane są z liniową predykcją.

Entry No. 192

Entry type journal paper

Authors G. Szwoch, M. Kulesza, A. Czyżewski

English title Transient Detection for Speech Coding Applications

Polish title Algorytm detekcji transjentów do zastosowania w aplikacjach kodowania sygnału mowy

Journal International Journal of Computer Science and Network Security

Volume 6

Number 12

Pages 320 - 325

Abstract Signal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection in all frequency bands. Performance of the algorithm is evaluated and some enhancements are proposed. The algorithm described here allows for accurate transient detection in speech and is suitable for use in practical speech coding applications

Streszczenie Poprawa jakości kodowania sygnału mowy jest możliwa dzięku ekstrakcji fragmentów zawierających transjenty i odpowiednie ich kodowanie. W artykule przedstawiono algorytm detekcji transjentow, który bazuje na analizie sygnału mowy w kilku podpasmach. Funkcje detekcji transjentów w poszczególnych pasmach uzyskiwane są poprzez analizę wahań energi w poszczególnych podpasmach. Ostateczna klasyfikacja następuje z uwzględnieniem wyników detekcji w podpasmach. Przetestowano efektywność algorytmu jak również zaproponowano pewne jego usprawnienia. Opisany algorytm pozwala na detekcję transjentów w sygnale mowy i może być wykorzystany w aplikacjach kodowania sygnału mowy.

Entry No. 193

Entry type conference paper

Authors A. Ciarkowski, P. Mroczkiewicz, A. Czyżewski

English title Distributed Multimedia Processing in Mobile Networks using Semantically-enriched Web Services

Polish title

Conference 5th International Conference on Multimedia & Network Information Systems MISSI

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 21.9.2006- 22.9.2006

Notes Dane preprintu zostaną uzupełnione

Abstract This paper introduces the concept of distributed multimedia processing services within mobile networks as a mean to overcome hardware limitations of portable devices. A brief introduction to the DeSyME Project - an important element in presented service architecture - is included. The semantic layer provided by The DeSyME is described. Finally, practical examples of Multimedia Mobile Services realized within the portrayed architecture are presented.

Entry No. 194

Entry type conference paper

Authors B. Kostek, J. Kotus, A. Czyżewski

English title Noise Threat Impact on Hearing in Schools and Students' Music Clubs

Polish title Badania zagrożeń hałasowych i ich wpływu na słuch w szkołach i klubach studenckich

Conference Inter-Noise 2006

Preprint

Number

Volume

Pages

Conference site Honolulu, Hawaii, USA

Conference date 3.12.2006- 6.12.2006

Abstract The study aimed at showing results of a survey on noise threat which was conducted in schools and students' music clubs. The measurements of the acoustic climate employed engineered telemetry stations for continuous noise monitoring. At the same time, physiological effects of noise were measured among pupils and students. Hearing tests were performed twice, before and after noise exposure. For this purpose otoacoustic emission method (DPOAE) was utilized. The obtained results of the noise measurements revealed that an unfavorable noise climate was found in the examined schools and music clubs. This was also confirmed by the subjective examination results. The noise dose analysis taking into consideration an average time spent by pupils in schools was also performed. It revealed that noise at schools didn't constitute a risk for hearing system of the pupils, however it may be considered as an essential source of annoyance. On the other hand, noise in music clubs surpassed all permitted noise limits, thus could be treated as dangerous to hearing. Hearing tests conducted revealed changes in cochlea activity of students' examined, also Tinnitus effect was experienced temporarily. New noise annoyance & noise threat criteria were proposed and verified based on the acquired and analyzed data.

Streszczenie W referacie przedstawiono wyniki badań zagrożeń hałasowych w szkołach i muzycznych klubach studenckich. Pomiary klimatu akustycznego przeprowadzono za pomocą opracowanej, telemetrycznej stacji do ciągłego monitorowania hałasu oraz w formie ankiet. Badania słuchu przeprowadzono dwukrotnie, przed i po ekspozycji na hałas. Wykorzystano metodę otoemisji akustycznych produktów nieliniowych ślimaka (DPOAE). W czasie ekspozycji na hałas mierzono również subiektywne efekty psychologiczne wśród uczniów i studentów. Uzyskane wyniki pomiarów hałasu ujawniły niesprzyjający klimat akustyczny, który występował w szkołach i klubach. Wyniki badań obiektywnych zestawiono z subiektywnymi wynikami uzyskanymi za pomocą ankiet. Przedstawiono również analizę dozymetryczną z uwzględnieniem średniego czas przebywania uczniów w szkole oraz studentów w klubach. Uzyskane wyniki wykazały, że hałas w rozpatrywanych szkołach nie stanowi zagrożenia dla słuchu uczniów, jednak może być uznany za istotne źródło uciążliwości. Z kolei hałas panujący w klubach znacznie przekraczał poziomy dopuszczalne, może zatem stanowić zagrożenie dla słuchu. Uzyskane wyniki pomiarów słuchu wykazały istotne zmiany w aktywności ślimaka u badanych studentów. Dodatkowo osoby badane sygnalizowały wystąpienie czasowego szumu usznego (Tinnitus) w następstwie ekspozycji na hałas. Zaproponowano nowe kryteria oceny uciążliwości i zagrożenia hałasem, a także zweryfikowano je w oparciu o uzyskane wyniki pomiarów.

Entry No. 195

Entry type conference paper

Authors A. Czyżewski, B. Kostek, J. Kotus, M. Szczodrak, P. Dalka

English title Multimedia Noise Monitoring System

Polish title Multimedialny System Monitorowania Hałasu

Conference 56 Brussels Eureka 2007

Preprint

Number

Volume

Pages 34 - 35

Conference site Bruksela, Belgia

Conference date 23.11.2007- 27.11.2007

Abstract A concept and an implementation of the Multimedia Noise Monitoring System (MNMS) is presented in the application. Nowadays, environmental pollution caused by noise is extremely high, especially in cities and rises systematically. Because of the wide range of noxious effects of noise on a human organism, noise level monitoring is very important. The system developed constitutes a significant improvement in the domain of continuous monitoring of noise and accelerates the process of city acoustical map creation. The principal aim of the project is to improve the effectiveness of prophylaxis of hearing diseases. It allows to receive, store, analyze and visualize noise data coming from noise measurement equipments and from electronic questionnaires accessible through the Internet. The MNMS has a functionality to determine the noise emission level for selected kinds of noise sources (for road and rail noise sources). Moreover, the MNMS contains a new kind of the authors’ concept of the psychoacoustic noise dosimetry. The designed noise dosimeter enables asessing temporary threshold shift (TTS) during noise exposure. In this way it is possible to monitor the hearing threshold shift continuously for people who stay in the harmful noise conditions.

Streszczenie W zgłoszeniu przedstawiono Multimedialny System Monitorowania Hałasu (MSMH). Projekt jest internetowym serwisem poświęconym monitorowaniu zagrożeń hałasem. Współcześnie zanieczyszczenie hałasem, szczególnie w miastach, jest niezwykle wysokie i wciąż systematycznie wzrasta. Ze względu na szeroki zakres niekorzystnego oddziaływania hałasu na organizm człowieka bardzo istotne jest monitorowanie poziomu hałasu. Opracowany system stanowi istotny krok w dziedzinie usprawniania ciągłego pomiaru hałasu i znacząco przyspiesza proces tworzenia map akustycznych miast. Jego nadrzędnym celem jest zwiększenie skuteczności w zakresie profilaktyki chorób słuchu. Umożliwia pobieranie, gromadzenie, analizę i wizualizację danych dotyczących hałasu, pobieranych ze zdalnych urządzeń pomiarowych oraz elektronicznych ankiet dostępnych przez Internet. MSMH posiada funkcjonalność umożliwiającą określanie poziomów emisyjnych wybranych rodzajów źródeł hałasu (źródło hałasu drogowego i kolejowego). Ponadto MSMH zawiera autorską koncepcję psychoakustycznego dozymetru hałasowego. Dozymetr ten umożliwia wyznaczenie czasowego przesunięcia progu słyszenia podczas trwania ekspozycji na hałas. Możliwe jest dzięki temu ciągłe monitorowanie progu słyszenia osób przebywających w warunkach szkodliwego oddziaływania hałasu.

Entry No. 196

Entry type conference paper

Authors M. Kulesza, A. Czyżewski

English title Speech codec enhancements utlizing time compression and perectual coding

Polish title Metody poprawy jakości kodowania sygnału mowy z wykorzystaniem techniki kompresji czasowej oraz kodowania perceptualnego

Conference 122nd Audio Engineering Society

Preprint 7004

Number

Volume

Pages 1 - 14

Conference site Wiedeń, Austria

Conference date 5.5.2007- 8.5.2007

Abstract A method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband speech codec. The time expansion procedure is applied to the speech signal after transmission and decoding in order to restore original time relations. Finally, the wideband speech signal is presented to the user. The method for spectral envelope estimation involving perceptual criteria is described. The algorithms for tonal components detection were evaluated and compared during experiments carried-out.

Streszczenie Metoda kodowania szerokopasmowego sygnału mowy z wykorzystaniem ustandaryzwanych algorytmów wąskopasmowych jak również wyniki eksperymentów dotyczących nowego algorytmu detekcji komponentów tonalnych widma. Sygnał mowy próbkowany z wyższą częstotliwością próbkowania niż to jest przyjęte w przypadku stardowych kodeków owy poddawany jest kompresji czasowej w celu zmniejszenia ilości próbek. Nastepnie jest on kodowany i transmitowany. W odbiorniku przeprowadzana jest ekspansja czasowa sygnału po jego zdekodowaniu i sygnał mowy jest prezentowany rozmówcy. Omowiono również metodę do detekcji obwiedni widma wykorzystującą kryteria psychoakustyczne. Przedstawiono wyniki badań związanych z nowę metodą detekcji komponentów tonalnych widma.

Entry No. 197

Entry type conference paper

Authors A. Czyżewski, P. Dalka, M. Kulesza, Ł. Kosikowski, B. Kostek, P. Suchomski

English title Contactless Hearing Aid

Polish title Bezkontaktowy aparat słuchowy

Conference Technicon

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 23.10.2007- 25.10.2007

Notes plakat

Abstract It is essential to correct the infants hearing loss as soon as possible in order to prevent disturbing of speech development process. Commonly used hearing aids weared in ear canal are not suitable for infants. The novel approach to corection of hearing loss for infants in first months of life is presented. None part of the device contact the infants body.

Streszczenie Korekcję wady słuchu niemowlęcia należy rozpocząć jak najwcześniej w celu umożliwienia prawidłowego rozwoju ośrodka mowy. Typowe rozwiązania aparatów montowanych za uchem bądź w kanale usznym dziecka nie najlepiej nadają się do terapii niemowląt (duże rozmiary, zakłócenie wzrostu i rozwoju ucha zewnętrznego i kanału słuchowego). Zaprezentowano aparat dla niemowląt, który nie wymaga kontaktu z ciałem i może wspomóc rozwój dziecka w pierwszych miesiącach życia.

Entry No. 198

Entry type conference paper

Authors A. Czyżewski, P. Odya, Ł. Kosikowski, P. Szczuko, A. Szkiełkowska

English title New Generation Aids for Laryngectomy Patients

Polish title Nowe pomoce elektroniczne dla osób po laryngektomii

Conference II Konferencja Audiologiczno-Foniatryczna

Preprint

Number

Volume

Pages

Conference site Białystok, Polska

Conference date 6.9.2007- 9.9.2007

Notes plakat

Abstract The artificial larynx has many disadvantages. The produced speech is monotonous and sounds artificially. In addition, produced speech intelligibility is usually poor. The major problem is a background noise caused by the device. In fact, the artificial larynx is only a simple vibrator, a construction of which has been almost unchanged since the 1950s. The aim of the presented project is to design a new generation of devices for laryngectomy patients. There are two different approaches to solve this task. The first one focuses on the artificial larynx. Some major improvements in the construction of the device might be easily introduced. Hence, the artificial larynx engineered was equipped with a digital processor and an amplifier. The spectral subtraction algorithm for noise reduction was used. In this method, an average signal spectrum and average noise spectrum are estimated and subtracted from each other, thus average signal-to-noise ratio SNR is improved. The main problem to be solved was that both the noise and the speech signal have the same excitation source and consequently are strongly correlated for voiced sounds. The second approach uses a PDA portable digital assistant to generate synthetic speech. The proposed new generation devices helping laryngectomy patients are presented in the paper.

Streszczenie Celem badań opisanych w pracy było opracowanie urządzeń nowej generacji dla osób laryngektomowanych. Typowa sztuczna krtań ma wiele wad. Najpoważniejszym problemem jest warkot generowany przez urządzenie. Zaproponowane zostały dwa rozwiązania majace na celu wyeliminowania tego problemu. Pierwsze skupia się na zmianach w konstrukcji sztucznej krtani. Opracowane urządzenie zostało dodatkowo wyposażone w cyfrowy procesor i wzmacniacz. W celu redukcji zakłóceń wykorzystano dwa algorytmy: odejmowanie widmowe i filtrację grzebieniową. Drugie rozwiązanie bazuje na komputerze typu PDA służącym do generowania mowy. Wykorzystano algorytmy syntezy mowy, co pozwala na odtwarzanie dowolnych wypowiedzi.

Entry No. 199

Entry type conference paper

Authors A. Czyżewski, B. Kostek, Ł. Kosikowski

English title Virtual hearing aid – a computer application for simulating hearing aids performance

Polish title Wirtualna proteza słuchu – komputerowa aplikacja do symulacji działania protez słuchu

Conference 122nd Convention

Preprint

Number

Volume

Pages

Conference site Wiedeń, Austria

Conference date 5.5.2007- 8.5.2007

Abstract The virtual hearing aid is a computer application allowing an approximate simulation of hearing aid performance. The computer application implements algorithms simulating band-pass filters, compressors and also the perceptual masking strategies for audio signal processing. Individual persons' hearing characteristics were taken into account for this purpose. The experimental part comprises verification of engineered algorithms implemented to virtual hearing prosthesis. The paper contains also results of examinations of patients aimed at verifying the applicability of the proposed signal processing strategy to the domain of hearing prosthesis.

Streszczenie Wirtualna proteza słuchu to komputerowa aplikacja umożliwiająca symulację działania protezy słuchu. Aplikacja zawiera algorytmy filtracji pasmowej, kompresji dynamiki, a także koncepcje maskowania perceptualnego. W wirtualnej protezie słuchu wykorzystano rzeczywiste charakterystyki słyszenia wybranych osób. W części eksperymentalnej przedstawiono weryfikację zaproponowanych algorytmów. W referacie zamieszczono także wyniki badań pacjentów przerowadzone w celu sprawdzenia zaproponowanej strategii cyfrowego przetwarzania sygnałów do zastosowań w protezach słuchu.

Entry No. 200

Entry type journal paper

Authors A. Czyżewski, P. Dalka

English title Visual Traffic Noise Monitoring in Urban Areas

Polish title Wizyjny monitoring halasu drogowego w aglomeracjach miejskich

Journal International Journal of Multimedia and Ubiquitous Engineering

Volume 2

Number 2

Pages 91 - 101

Abstract The paper presents an advanced system for railway and road traffic noise monitoring in metropolitan areas. This system is a functional part of a more complex solution designed for environmental monitoring in cities utilizing analyses of sound, vision and air pollution, based on a ubiquitous computing approach. The system consists of many autonomous, universal measuring units and a multimedia server, which gathers, processes and presents data obtained from the distributed measuring units. The results are visualized on numerical maps. The paper contains a functional and technical description of the monitoring system. It describes also the algorithm for moving vehicle detection in video sequences based on a pixel-level difference among the image frames and a continually updated background model utilizing mixtures of Gaussians. The experiments carried out involve the implemented algorithm to the detection of vehicles in the recorded video sequences. The results obtained are illustrated with some examples and discussed.

Streszczenie Artykuł prezentuje zaawansowany system do monitorowania hałasu drogowego i kolejowego w obszarach miejskich. System ten stanowi część funkcjonalną większego rozwiązania przeznaczonego do monitorowania środowiska w miastach, obejmującego analizę dźwięku, obrazu i zanieczyszczenia środowiska i bazującego na wszechobecnym podejściu komputerowym. System składa się z wielu autonomicznych, uniwersalnych jednostek pomiarowych i serwera multimedialnego, który zbiera, przetwarza i prezentuje dane otrzymane z rozproszonych jednostek pomiarowych. Wyniki wizualizowane są na mapach numerycznych. Artykuł zawiera opis funkcjonalny i techniczny systemu monitorującego. Zawiera także opis algorytmu do detekcji ruchomych pojazdów w strumieniach wizyjnych, który bazuje na pikselowej różnicy pomiędzy ramkami obrazu a stale uaktualnianym modelu tła wykorzystującym sumę ważonych rozkładów normalnych. Przeprowadzone eksperymenty obejmują wykorzystanie zaimplementowanego algorytmu do detekcji pojazdów w nagraniach wizyjnych. Uzyskane wyniki są omówione i zilustrowane na przykładach.

Entry No. 201

Entry type conference paper

Authors A. Czyżewski, P. Dalka, M. Kulesza, Ł. Kosikowski, B. Kostek, P. Suchomski

English title Contactless Hearing Aid

Polish title Bezkontaktowy aparat słuchowy

Conference Z okazji Międzynarodowego Dnia Niesłyszących

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 28.9.2007- 28.9.2007

Notes plakat

Abstract It is essential to correct the infants hearing loss as soon as possible in order to prevent disturbing of speech development process. Commonly used hearing aids weared in ear canal are not suitable for infants. The novel approach to corection of hearing loss for infants in first months of life is presented. None part of the device contact the infants body.

Streszczenie Korekcję wady słuchu niemowlęcia należy rozpocząć jak najwcześniej w celu umożliwienia prawidłowego rozwoju ośrodka mowy. Typowe rozwiązania aparatów montowanych za uchem bądź w kanale usznym dziecka nie najlepiej nadają się do terapii niemowląt (duże rozmiary, zakłócenie wzrostu i rozwoju ucha zewnętrznego i kanału słuchowego). Zaprezentowano aparat dla niemowląt, który nie wymaga kontaktu z ciałem i może wspomóc rozwój dziecka w pierwszych miesiącach życia.

Entry No. 202

Entry type book

Authors A. Czyżewski, P. Dalka

English title Data Acquisition And Processing For Diagnostics Of Urban Environment Utilizing Teleinformation Technology

Polish title Teleinformatyczna akwizycja i przetwarzanie danych dla potrzeb diagnostyki środowiska aglomeracji miejskich

Editor Pomorskie Wydawnictwo Naukowo-Techniczne

Pages 241 - 256

Notes Rozdzial 14 w książce "Inteligentne wydobywanie informacji w celach diagnostycznych"

Abstract The article presents the idea of a complex system for road traffic monitoring in urban areas. This system is a functional part of the more complex solution designed for environment monitoring in cities. The system consists of many autonomous, universal monitoring stations and the server, which gathers, processes and presents data obtained from the monitoring stations. Results are visualized with the use of numerical maps. The article contains also a functional and technical description of the monitoring system. It also describes the algorithm for moving vehicle detection in video sequences. The algorithm is based on a pixel-level difference among the image frames and a continually updated background model utilizing mixtures of Gaussians. The experiments carried out involve the implemented algorithm to detect vehicles in the recorded video sequences. The results obtained are illustrated and discussed.

Streszczenie Rozdział przedstawia koncepcję aktualnie opracowywanego systemu służącego do monitorowania hałasu i ruchu drogowego w aglomeracjach miejskich, stanowiącego funkcjonalną część szerszego rozwiązania dotyczącego monitorowania środowiska w miastach. System taki składa się z rozmieszczonych w mieście autonomicznych i uniwersalnych stacji monitorujących oraz serwera gromadzącego, przetwarzającego i prezentującego wyniki z wykorzystaniem map numerycznych. W rozdziale przedstawiono opis techniczny i funkcjonalny opracowywanego systemu. Zaprezentowano również i zbadano algorytm wydobywania cech obrazu poruszających się pojazdów w nagraniach wizyjnych. Wykrywanie pojazdów odbywa się poprzez porównanie bieżącej ramki obrazu z modelem tła tworzonym w oparciu o sumę ważonych funkcji Gaussowskich. Przeprowadzone eksperymenty obejmują wykorzystanie algorytmu do detekcji pojazdów w nagraniach wideo. Uzyskane wyniki zilustrowano na przykładach.

Entry No. 203

Entry type journal paper

Authors A. Czyżewski, A. Ciarkowski, A. Kaczmarek, J. Kotus, M. Kulesza, P. Maziewski

English title DSP Techniques for Determining "Wow" Distortions

Polish title Techniki CPS do określania charakterystyki kołysania dźwięku

Journal J. Audio Eng. Soc.

Volume 55

Number 4

Pages 266 - 284

Abstract Algorithms for determining the wow distortion characteristic are proposed. These are the power-line-hum tracking algorithm, the high-frequency-bias tracking algorithm, and the algorithm based on an adaptive analysis of the center of gravity of the spectrum of the distorted signal. All of the algorithms presented allow a hardware- or software-based implementation.

Streszczenie Artykuł przedstawia opis algorytmów do wyznaczania charakterystyki zniekształceń kołysania dźwięku. Są to algorytmy: śledzenia przydźwięku sieciowego, śledzenia pozostałości magnetycznej prądu podkładu wielkich częstotliwości, adaptacyjnej analizy środka ciężkości widma dla wybranej części zniekształconego sygnału. Przedstawione algorytmy pozwalają na implementację programową i sprzętową.

Entry No. 204

Entry type journal paper

Authors A. Czyżewski, J. Kotus, B. Kostek

English title DETERMINING THE NOISE IMPACT ON HEARING USING PSYCHOACOUSTICAL NOISE DOSIMETER

Polish title Określenie wpływu hałasu na słuch przy użyciu psychoakustycznego dozymetru hałasowego

Journal Archives of Acoustics

Volume 32

Number 2

Pages 203 - 217

Abstract This research study presents the designed noise dosimeter based on psychoacoustical properties of the human hearing system and, at the same time, evaluation of time and frequency characteristics of noise. The designed noise dosimeter enables assessing temporary threshold shift (TTS) in critical bands in real time. In this way it is possible monitoring the hearing threshold shift continuously for people who stay in the harmful noise conditions. Moreover, the psychoacoustical noise dosimeter (PND) provides the functionality which determines time causing an increase of the assumed hearing threshold shift along with time required for recovery of a hearing threshold toward its initial value. Noise exposure levels, its duration along with hearing examination have been first measured in the acoustically controlled environment. Pure-tone audiometry has been used for hearing examination. This has been conducted in constant time intervals, during noise exposure as well as during resting time (time required for hearing recovery). The examination aims at measuring hearing threshold at 4 kHz. The important part of this study is validation of the dosimeter performance in the real noise exposure situation. In this case the whole noise measurement scenario encompasses both noise exposure effects, and hearing examination before and after noise exposure. The hearing examination has been extended by the distortion products otoacoustic emission method (DPOAE). The measurement results obtained in real conditions have been compared with those which were computed by means of the presented psychoacoustical noise dosimeter.

Streszczenie W pracy przedstawiono projekt i realizację nowego psychoakustycznego dozymetru hałasowego. Jego działanie jest oparte na uwzględnieniu własności psychoakustycznych słuchu oraz charakterystyki czasowej i częstotliwościowej hałasu. Opracowany dozymetr umożliwia estymację przesunięcia progu słyszenia w pasmach krytycznych w czasie rzeczywistym. Możliwe jest zatem ciągłe monitorowanie stanu słuchu osób przebywających w niekorzystnych warunkach akustycznych. Ponadto, opracowany dozymetr wyznacza czas niezbędny do odzyskania stanu słuchu sprzed ekspozycji. W pierwszej kolejności przedstawiono wyniki pomiarów wpływu hałasu na słuch uzyskane w warunkach laboratoryjnych. Badania słuchu wykonano za pomocą audiometrii tonalnej. W warunkach laboratoryjnych słuch badano w stałych interwałach czasowych, w czasie ekspozycji oraz w fazie odpoczynku. Wyznaczano próg słyszenia dla częstotliwości 4 kHz. Istotnym elementem pracy są badania porównawcze dotyczące poprawności działania psychoakustycznego dozymetru hałasowego w warunkach rzeczywistych. Badania te obejmowały ekspozycję na hałas, pomiary słuchu przed i po ekspozycji na hałas. Z kolei badania słuchu rozszerzono o badanie metodą otoemisji akustycznych produktów zniekształceń nieliniowych ślimaka (DPOAE). Wyniki uzyskane w warunkach rzeczywistych porównano z estymacją skutków ekspozycji na hałas, określoną przez opracowany dozymetr.

Entry No. 205

Entry type conference paper

Authors A. Czyżewski, J. Kotus, B. Kostek

English title DETERMINING THE NOISE IMPACT ON HEARING USING PSYCHOACOUSTICAL NOISE DOSIMETER

Polish title Określenie wpływu hałasu na słuch przy użyciu psychoakustycznego dozymetru hałasowego

Conference 14th International conference on noise control NOISE CONTROL 07

Preprint

Number

Volume

Pages

Conference site Elbląg, Polska

Conference date 3.6.2007- 6.6.2007

Abstract This research study presents the designed noise dosimeter based on psychoacoustical properties of the human hearing system and, at the same time, evaluation of time and frequency characteristics of noise. The designed noise dosimeter enables assessing temporary threshold shift (TTS) in critical bands in real time. In this way it is possible monitoring the hearing threshold shift continuously for people who stay in the harmful noise conditions. Moreover, the psychoacoustical noise dosimeter (PND) provides the functionality which determines time causing an increase of the assumed hearing threshold shift along with time required for recovery of a hearing threshold toward its initial value. Noise exposure levels, its duration along with hearing examination have been first measured in the acoustically controlled environment. Puretone audiometry has been used for hearing examination. This has been conducted in constant time intervals, during noise exposure as well as during resting time (time required for hearing recovery). The examination aims at measuring hearing threshold at 4 kHz. The important part of this study is validation of the dosimeter performance in the real noise exposure situation. In this case the whole noise measurement scenario encompasses both noise exposure effects, and hearing examination before and after noise exposure. The hearing examination has been extended by the distortion products otoacoustic emission method (DPOAE). The measurement results obtained in real conditions have been compared with those which were computed by means of the presented psychoacoustical noise dosimeter.

Streszczenie W pracy przedstawiono projekt i realizację nowego psychoakustycznego dozymetru hałasowego. Jego działanie jest oparte na uwzględnieniu własności psychoakustycznych słuchu oraz charakterystyki czasowej i częstotliwościowej hałasu. Opracowany dozymetr umożliwia estymację przesunięcia progu słyszenia w pasmach krytycznych w czasie rzeczywistym. Możliwe jest zatem ciągłe monitorowanie stanu słuchu osób przebywających w niekorzystnych warunkach akustycznych. Ponadto, opracowany dozymetr wyznacza czas niezbędny do odzyskania stanu słuchu sprzed ekspozycji. W pierwszej kolejności przedstawiono wyniki pomiarów wpływu hałasu na słuch uzyskane w warunkach laboratoryjnych. Badania słuchu wykonano za pomocą audiometrii tonalnej. W warunkach laboratoryjnych słuch badano w stałych interwałach czasowych, w czasie ekspozycji oraz w fazie odpoczynku. Wyznaczano próg słyszenia dla częstotliwości 4 kHz. Istotnym elementem pracy są badania porównawcze dotyczące poprawności działania psychoakustycznego dozymetru hałasowego w warunkach rzeczywistych. Badania te obejmowały ekspozycję na hałas, pomiary słuchu przed i po ekspozycji na hałas. Z kolei badania słuchu rozszerzono o badanie metodą otoemisji akustycznych produktów zniekształceń nieliniowych ślimaka (DPOAE). Wyniki uzyskane w warunkach rzeczywistych porównano z estymacją skutków ekspozycji na hałas, określoną przez opracowany dozymetr.

Entry No. 206

Entry type conference paper

Authors A. Czyżewski, B. Kostek

English title Microelectronics applications to communication senses diagnostics and therapy

Polish title Zastosowania mikroelektroniki do diagnostyki i terapii zmysłów komunikacji

Conference IX Konferencja Naukowa - Technologia Elektronowa (ELTE)

Preprint

Number

Volume P-8

Pages 36

Conference site Kraków, Polska

Conference date 4.9.2007- 7.9.2007

Abstract Taking advantage of the recent progress in Digital Signal Processor (DSP) developments, a portable and reprogrammable digital hearing aid can be easily designed. Furthermore, taking into account the-state-of-the-art in the research in the microelectronics, it is possible to produce a sophisticated hearing aid in which complex algorithms of signal processing will be implemented. The hearing aid developed and algorithms engineered will be demonstrated in the course of this paper. Another microelectronic technology application to hearing aids is the proposed contactless hearing aid is designated to be attached to the infant’s crib for sound amplification in a free field. It consists of electret microphone matrix, and a prototype DSP board. The compressed speech is transmitted and amplified via miniature loudspeakers. Algorithms that are worked out deal with parasitic feedback, which occurs due to the small distance between microphone and monitors in terms of potentially high amplification required. Tinnitus (ear noises) are usually defined as perceived sound sensation without acoustic external stimuli. Utilizing ear noise maskers often brings desired effects in reducing bothersome effects of Tinnitus. The new approach to Tinnitus induces the invention of new more effective methods of diagnosing and treatment, which can be called as an “ear dithering”. The approach uses microelectronic technology to produce a small wearable ultrasound Tinnitus masker which will be discussed during the paper presentation. The method of transposition of speech frequency is applied to a digital speech corrector, also called SDSA (Subminiature Digital Speech Aid). The device has an ultra-compact digital signal processor DSP and is used for testing a number of algorithms for correcting stuttering. By using the DSP processor we can process the speech sound in real time. Thanks to this, the small device can hold more complicated methods of correction. The technology standing behind the device will be discussed in the course of the paper. Another microelectronic technology serving speech impaired patients is artificial larynx. The artificial larynx engineered was equipped with a digital processor and an amplifier. The spectral subtraction algorithm for noise reduction was utilized. In this method, an average signal spectrum and average noise spectrum are estimated and subtracted from each other, thus average signal-to-noise ratio (SNR) is improved. The second approach uses a PDA (portab1e digital assistant) to generate synthetic speech. Finally, state-of-the-art assistive technologies helping blind and visually impaired patients will be reviewed and demonstrated on the basis of some advanced research examples.

Streszczenie Postępy technologiczne w dziedzinie cyfrowego przetwarzania sygnałów umożliwiają projektowanie reprogramowalnych cyfrowych protez słuchu, wykorzystujących złożone algorytmy przetwarzania sygnałów fonicznych. Referat prezentuje opracowane i zaiplementowane algorytmy cyfrowych protez słuchu. Innym tego typu zastosowaniem technologii mikroelektronicznych jest bezkontaktowa proteza słuchu, mocowana w łóżeczku niemowlęcia, będąca przedmiotem oryginalnego opracowania. Podstawowy problem techniczny, jakim jest eliminacja pasożytniczych sprzężeń zwrotnych w tego typu aplikacji jest rozwiązywany prze algorytm cyfrowego przetwarzania sygnałów zaimplementowany na skosntruowanym module procesora sygnałowego. Koleja część referatu dotyczy problematyki szumów usznych (Tinnitus). Dla potrzeb pacjentów cierpiących na ten rodzaj dolegliwości słuchowej opracowano aplikację wykorzystującą linearizację charakterystyki kwantyzacji sygnałów akustycznych na drodze słuchowej. Cyfrowe przetwarzanie sygnałów zastosowane w miniaturowym urządzeniu dla osób jąkających się o nazwie SDSA poprawia płynność mowy, zaś najnowsza aplikacja wykorzystująca przetwarzanie sygnału mowy jest wdrażana w postaci sztucznej krtani dla osób laryngektomowanych oraz syntetyzera mowy. Ostatnia część referatu dotyczy przeglądu zastosowań mikroelektroniki w protetyce osób ociemniałych. Słowa kluczowe: cyfrowe przetwarzanie sygnałów; aparaty słuchowe; korektor mowy; syntetyzer mowy; pomoce dla niewidomych

Entry No. 207

Entry type conference paper

Authors A. Czyżewski, P. Maziewski

English title Some Techniques For Wow Effect Reduction

Polish title Wybrane techniki redukcji zniekształceń kołysania dźwięku

Conference IEEE International Conference on Image Processing (ICIP)

Preprint

Number

Volume IV

Pages 29 - 32

Conference site San Antonio, Texas, USA

Conference date 16.9.2007- 19.9.2007

Abstract Wow distortion reduction has not attracted an adequate scientific attention so far. Only few papers on the subject are available, concerning mostly archive gramophone records, wax cylinders, and magnetic tapes affected by wow. This paper outlines researched wow reduction algorithms concerning archive movie soundtracks, or more generally audio recordings accompanying archival visual contents. The methods presented here are based on the pilot tone tracking, on the spectral analysis of genuine audio components, and on non-uniform resampling. The paper provides only a short overview of the concepts founding those methods; other studied approaches to the wow processing, as well as a more detailed description of the presented ones, can be found in referenced papers.

Streszczenie Redukcja kołysania dźwięku jest tematem, któremu do tej pory nie poświęcono wystarczającej uwagi. Niewielka liczba dostępnych opracowań koncentruje się na redukcji kołysania w nagraniach winylowych, fonograficznych oraz magnetycznych. Niniejsze opracowanie prezentuje algorytmy do redukcji kołysania w archiwalnych filmowych ścieżkach dźwiękowych. Prezentowane algorytmy wykorzystują zarówno technikę śledzenia tonów pilotujących jak i metody śledzenia składowych widmowych sygnału fonicznego. Ostateczna redukcja kołysania realizowana jest przy użyciu technik nierównomiernego przepróbkowania. Referat prezentuje jedynie krótki przegląd opracowanych algorytmów. Inne metody redukcji kołysania jak i bardziej szczegółowy opis tych zaprezentowanych w niniejszym referacie, można odnaleźć w materiałach wskazanych w bibliografii.

Entry No. 208

Entry type conference paper

Authors P. Odya, A. Czyżewski

English title Special Hearing Aid for Stuttering People

Polish title Specjalne aparaty słuchowe dla osób jąkających się

Conference 123rd AES Convention

Preprint 7293

Number

Volume

Pages

Conference site Nowy Jork, USA

Conference date 5.10.20078.10

Abstract Owing to recent progress in digital signal processors developments it has been possible to build a subminiature device combining speech and hearing aid. Furthermore, despite its small dimensions, the device can execute quite complex algorithms and can be easily reprogrammed. The paper puts an emphasis on issues related to the design and implementation of algorithms applicable to both speech and hearing aids. Frequency shifting or delaying the audio signal are often used for speech fluency improvement. The basic frequency altering algorithm (FAF) is similar to the sound compression algorithm used in some special hearing aid as above. Therefore, the experimental device presented in the paper provides a universal hearing & speech aid which may be used by hearing or by speech impaired persons or by persons suffering from both problems, simultaneously.

Streszczenie Dzięku postępowi w dziedzienie cyfrowego przetwarzania sygnałów możliwe stało zbudowanie subminiaturowego urządzenia łączącego funkcje aparatu słuchowego i korektora mowy. Takie urządzenie, mimo niewielkich rozmiarów, jest w stanie wykonywać skomplikowane alggorytmy a jego oprogramowanie może być łatwo zmieniane. W pracy skupiono się na zagadnieniach związanych z opracowniem prototypu i implementacją algorytmów korekcji słuchu i mowy. W celu poprawy płynności mowy stosuje się najczęściej transpozycję widmową mowy lub jej opóźnianie. Algorytmy transpozycji widmowej również mogą być wykorzystane w apratach słuchowych. Stąd opisywane w pracy prototypowe urządzenie może być wykorzystane zarówno przez osoby mające problemy ze słuchem, jak i jąkające się.

Entry No. 209

Entry type conference paper

Authors A. Czyżewski, P. Odya, B. Kostek, P. Szczuko

English title New Generation Artificial Larynx

Polish title Nowe narzędzia dla osób laryngektomowanych

Conference

Preprint 7285

Number

Volume

Pages

Conference site Nowy Jork, USA

Conference date 5.10.2007- 8.10.2007

Abstract The aim of the presented paper is to show a new generation of devices for laryngectomy patients. The artificial larynx has many disadvantages. The major problem is a background noise caused by the device. There are two different approaches to solve this task. The first one focuses on the artificial larynx. The artificial larynx engineered was equipped with a digital processor and an amplifier. Two algorithms, namely spectral subtraction algorithm and the comb filter were proposed for noise reduction. The second approach employs PDA to generate speech. A speech synthesis is performed, allowing for playing back any sentence, therefore any text can be entered by a user, and played through PDA speaker.

Streszczenie Celem badań opisanych w pracy było opracowanie urządzeń nowej generacji dla osób laryngektomowanych. Typowa sztuczna krtań ma wiele wad. Najpoważniejszym problemem jest warkot generowany przez urządzenie. Zaproponowane zostały dwa rozwiązania majace na celu wyeliminowania tego problemu. Pierwsze skupia się na zmianach w konstrukcji sztucznej krtani. Opracowane urządzenie zostało dodatkowo wyposażone w cyfrowy procesor i wzmacniacz. W celu redukcji zakłóceń wykorzystano dwa algorytmy: odejmowanie widmowe i filtrację grzebieniową. Drugie rozwiązanie bazuje na komputerze typu PDA służacym do generowania mowy. Wykorzystano algorytmy syntezy mowy, co pozwala na odtwarzanie dowolnych wypowiedzi.

Entry No. 210

Entry type journal paper

Authors A. Czyżewski, H. Skarżyński

English title Multimedia Applications for the Hearing Impaired

Polish title Aplikacje multimedialne dla osób z uszkodzeniami słuchu

Journal Archives of Acoustics

Volume 32

Number 3

Pages 491 - 504

Abstract Hearing impairment is one of the fastest growing diseases of modern society. Therefore it is very important to develop new methods for diagnosis and therapy of hearing disorders. Some of them were introduced to practice as a result of a co-operation between institutions mentioned in the header. The system for mass-scale hearing screening is one of multimedia programs for testing communication senses introduced by the authors. The further developments include among others an application of dithering theory to practical solutions for tinnitus patients and a method of fitting hearing aids employing soft computing. The implemented hearing diagnostic & therapy applications and systems with their underlying concepts are reviewed in this paper.

Streszczenie Jednym z elementów przeciwdziałania szybko narastającym zagrożeniom niedosłuchem są aplikacje oprate na nowoczesnych technologiach. W wyniku współpracy Katedry Systemów Multimedialnych PG z warszawskim Instytutem Fizjologii i Patologii Słuchu opracowano wdrożone na szeroką skale multimedialne systemy przesiewowych badań zmysłów komunikowania się. Ponadto, nowsze opracowania dotyczą maskowania szumów usznych z wykorzystaniem linearyzacji charakterystyki transmisyjnej słuchu z użyciem szumu. Multimedialne techniki dopasowania aparatów słuchowych do potrzeb pacjenta, wykorzystujące przy tym inteligentne techniki obliczeniowe stanowią jeszcze jedno pole aplikacji praktycznych, spośród tych, których przegląd stanowi temat artykułu. Słowa kluczowe: przesiewowe badania słuchu, szumy uszne, dopasowanie protez słuchu

Entry No. 211

Entry type conference paper

Authors A. Czyżewski, B. Kostek, Ł. Kosikowski, L. Śliwa, H. Skarżynski

English title Examining possibilities of transmitting signals to inner ear employing bone-conductive ultrasound carrier

Polish title Badanie możliwości transmisji sygnałów do ucha wewnętrznego z wykorzystaniem przewodnictwa kostnego ultradźwięków.

Conference 6th European Congress of Oto-Rhino-Laryngology, Head and Neck Surgery

Preprint

Number

Volume

Pages

Conference site Wiedeń, Austria

Conference date 30.6.2007- 4.7.2007

Notes plakat

Abstract The ultrasound harmonic signal was modulated with some harmonic tones of various frequency and level. The „ultrasound audiogram” was determined on this basis revealing the possibility to transmit low-frequency modulation components to the inner ear basing on ultrasound bone conduction. The bone-conductive ultrasound transmission characteristics were estimated during the experiment in which 2 ultrasound transducers were utilized: the first one acting as an excitor and the second one as a monitor. It was found that the ultrasound bone conduction may influence shape of tonal audiograms. Therefore, the experimental results demonstrate the possibility to receive of ultrasounds by the cochlea through bone conduction.

Streszczenie Ultradźwiękowy sygnał harmoniczny został zmodulowany kilkoma tonami harmonicznymi o różnej czestotliwości i poziomie. Uzyskany na tej podstawie „audiogram ultradźwiękowy” ujawnia możliwość transmisji komponentów niskoczęstotliwoścowych do ucha wewnętrznego bazując na ultradźwiękowym przewodnictwie kostnym. Charakterystyki przenoszenia ultradźwięków poprzez przewodnictwo kostne wyznaczono z wykorzystaniem 2 przekaźników ultradźwiękowych: pierwszy pełnił funkcje pobudzającą, a drugi monitorującą. Zauważono, że przewodnictwo kostne ultradźwięków może wpływać na kształt audiogramów tonalnych. Wyniki eksperymentu demonstrują możliwości odbioru ultradźwięków przez ślimak drogą przewodnictwa kostnego.

Entry No. 212

Entry type journal paper

Authors P. Żwan, B. Kostek, P. Szczuko, A. Czyżewski

English title Automatic Singing Voice Recognition Employing Neural Networks and Rough Sets

Polish title

Journal Lecture Notes In Artificial Intelligence-proc. of Rough Sets and Intelligent Systems Paradigms

Volume

Number 4585

Pages 793 - 802

Abstract The aim of the research study presented in this paper is the automatic singing voice recognition. For this purpose a database containing singers’ sample recordings has been constructed and parameters are extracted from recorded voices of trained and untrained singers of various voice types. Parameters, which are especially designed for the analysis of the singing voice are described and their physical interpretation is given. Decision systems based on artificial neutral networks and rough sets are used for automatic voice type/voice quality classification. Results obtained in the automatic classification performed by both decision systems are then compared and conclusions are derived.

Streszczenie Celem prac opisanych w referacie jest automatyczne rozpoznawanie głosów śpiewaczych. Do tego celu utworzona została baza nagrań próbek śpiewu profesjonalnego i amatorskiego. Próbki poddane zostały parametryzacji parametrami zaproponowanymi przez autorów ściśle do tego celu. Sposób wyznaczenia parametrów i ich interpretacja fizyczna przedstawione są w referacie. Parametry wprowadzane są do systemów decyzyjnych, klasyfikatorów opartych o sztuczne sieci neuronowe oraz o zbiory przybliżone. Zadaniem klasyfikatorów jest określenie typu i jakości głosu. Zawarto porównanie wyników uzyskanych dla sieci neuronowych i zbiorów przybliżonych. Podano wnioski.

Entry No. 213

Entry type journal paper

Authors A. Czyżewski, J. Kotus, B. Kostek, M. Szczodrak

English title Multimedia Noise Monitoring System

Polish title Multimedialny System Monitorowania Hałasu

Journal Bezpieczeństwo Pracy

Volume

Number 7-8

Pages 8 - 11

Abstract A concept and an implementation of the multimedia computer system for the monitoring of environmental noise threats is presented. The principal aim of the project is to improve the effectiveness of prophylaxis of hearing diseases. It allows to receive, store, analyze and visualize a noise data coming from noise measurement equipments and from electronic questionnaires accessible through the Internet. Moreover a new kind of the authors’ concept of the psychoacoustic noise dosimetry was also presented in the paper. The designed noise dosimeter enables to assess temporary threshold shift (TTS) during noise exposure. In this way it is possible to monitor the hearing threshold shift continuously for people who stay in the harmful noise conditions.

Streszczenie W artykule przedstawiono Multimedialny System Monitorowania Hałasu. Projekt jest sieciocentrycznym systemem dedykowanym monitorowaniu zagrożeń hałasem. Jego nadrzędnym celem jest zwiększenie skuteczności w zakresie profilaktyki chorób słuchu. Umożliwia pobieranie, gromadzenie, analizę i wizualizację danych dotyczących hałasu, pobieranych ze zdalnych urządzeń pomiarowych oraz elektronicznych ankiet dostępnych przez Internet. Ponadto w artykule przedstawiono autorską koncepcję psychoakustycznego dozymetru hałasowego. Dozymetr ten umożliwia wyznaczenie czasowego przesunięcia progu słyszenia podczas trwania ekspozycji na hałas. Możliwe jest dzięki temu ciągłe monitorowanie progu słyszenia osób przebywających w warunkach szkodliwego oddziaływania hałasu.

Entry No. 214

Entry type conference paper

Authors A. Czyżewski, J. Kotus, B. Kostek

English title Application of the psychoacoustic noise dosimeter for the determination of noise impact on hearing

Polish title Zastosowanie psychoakustycznej dozymetrii hałasowej do określenia wypływu hałasu na słuch

Conference Inter-Noise 2007

Preprint

Number

Volume

Pages 10

Conference site Istanbuł, Turcja

Conference date 28.8.2007- 31.8.2007

Abstract The new research results regarding the noise impact on hearing applying the authors’ concept of the psychoacoustic noise dosimetry (PND) were presented in the paper. In the fist part the noise and hearing examination conducted in the acoustically controlled environment were shown. The noise level was equal to 88 dB(A). The band-pass white noise, limited to the range of 1–6 kHz was used as a stimulus signal. The hearing threshold was examined using pure-tone audiometry for 4 kHz only. This experiment depended on simultaneous measuring both noise and hearing. Firstly, hearing was examined directly before the noise exposure. Next, the noise exposure phase started. The hearing was examined every 10 minutes for particular person. The total time of noise exposure was equal to 30 minutes. The hearing examinations were conducted also during subjects’ resting time (time required for hearing recovery). The main aim of this research was to determine the time constant of the TTS effect disappearance. The results were used for the optimization of the designed PND performance. In the last part of the paper the validation of the PND was presented considering real noise exposure conditions. In the course of further research the noise impact on hearing for this purpose was carried-out. The hearing of attendees was examined twice in this case, directly before and immediately after the noise exposure. The pure-tone audiometry and the distortion products otoacoustic emission method (DPOAE) were used. The extended noise dose analysis was performed on the basis of the obtained results employing the PND algorithm. The presented PND algorithm reflects correctly the hearing temporary threshold shift (TTS) changes produced by the noise. The computing of such parameters as: a real time assessment of the TTS in critical bands, time required for recovery of a hearing threshold to its initial value enables a very precise determination of hearing abilities of subjects under noise exposure.

Streszczenie W referacie przedstawiono nowe wyniki badań wpływu hałasu na słuch z zastosowaniem autorskiej koncepcji psychoakustycznej dozymetrii hałasowej. W pierwszej części pracy przedstawiono wyniki badań hałasu i słuchu przeprowadzone w warunkach laboratoryjnych. Poziom hałasu wynosił 88 dB(A). Sygnałem testowym był szum biały, odfiltrowany do przedziału częstotliwości 1000 – 6000 Hz. Próg słyszenia badano za pomocą audiometrii tonalnej dla częstotliwości 4 kHz. Eksperyment polegał na jednoczesnym pomiarze hałasu i charakterystyk słyszenia. W pierwszej kolejności zbadano słuch bezpośrednio przed ekspozycją na hałas. Następnie rozpoczęła się faza ekspozycji. Co 10 minut poszczególnych osobom badano słuch, zarówno w fazie ekspozycji jak i odpoczynku. Całkowity czas przebywania w hałasie wynosił 30 minut. Celem badań było wyznaczenie stałych czasowych zmian efektu czasowej zmiany progu słyszenia (TTS). Wyniki zostały wykorzystane do optymalizacji opracowanego psychoakustycznego dozymetru hałasowego. W ostatniej części pracy zamieszczono informacje na temat weryfikacji opracowanego dozymetru, przeprowadzonej w warunkach rzeczywistego narażenia na hałas. W tym celu przeprowadzono badania wpływu hałasu na próg słyszenia osób przebywających w klubach studenckich. W tym przypadku badania słuchu wykonano dwukrotnie, bezpośrednio przed i po ekspozycji na hałas. Wykorzystano metodę audiometrii tonalnej oraz badanie otoemisji akustycznej produktów zniekształceń nieliniowych ślimaka (DPOAE). Na podstawie uzyskanych wyników pomiarów przeprowadzono rozszerzoną analizę dozymetryczną z zastosowaniem dozymetru psychoakustycznego. Opracowany algorytm we właściwy sposób odzwierciedla zmiany progu słyszenia wywołane hałasem. Wyznaczanie takich parametrów jak: aktualne przesunięcie progu słyszenia, czas niezbędny do powrotu progu słyszenia do stanu początkowego, w czasie trwania ekspozycji umożliwia bardzo dokładną kontrolę stanu słuchu osób narażonych na hałas.

Entry No. 215

Entry type conference paper

Authors A. Czyżewski, J. Kotus

English title

Polish title MONITORING STANU ŚRODOWISKA - NOWE SZANSE TECHNOLOGICZNE

Conference I Pomorskia Konferencja JAKOŚĆ POWIETRZA

Preprint

Number

Volume

Pages 32 - 36

Conference site Gdańsk, Polska

Conference date 25.6.2007

Entry No. 216

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski

English title

Polish title Zastosowanie superkomputera do tworzenia dynamicznych map hałasu

Journal Zeszyty Naukowe Wydziału ETI PG

Volume 16

Number 6

Pages 305 - 310

Streszczenie W artykule przedstawiono koncepcję i implementację Modelu Dynamicznego Prognozowania Hałasu przeznaczonego do tworzenia map hałasu. Omówiony został cel wykonania powstałej aplikacji. Zawarto krótki opis użytego sprzętu. Omówiono poszczególne elementy Modelu Dynamicznego Prognozowania Hałasu oraz zastosowane metody. Przedstawiono zagadnienia związane z implementacją algorytmów na klastrze komputerowym. Zaprezentowano również rezultaty eksperymentów otrzymanych podczas eksploatacji modelu prognozowania hałasu.

Entry No. 217

Entry type conference paper

Authors A. Czyżewski, Ł. Kosikowski, M. Kurkowski, A. Szkiełkowska, K. Kochanek, H. Skarżyński

English title Mobile multimedia hearing and speech screening applications

Polish title Aplikacje do przesiewowego badania słuchu i mowy na urządzenia mobilne

Conference III Konferencja Naukowo-Szkoleniowa Sekcji Foniatrycznej i Sekcji Audiologicznej Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 8.5.2008- 10.5.2008

Notes plakat

Abstract The aim of the presented work was to elaborate and implement the mobile prototype version of the devices meant for speech and hearing screening tests. An essential feature of the presented systems is their ability to exmine hearing means of tonal audiometry, and to test speech perception in noise. The system can also evaluate: phonematic hearing, phonetic hearing, functioning of speech production organs motor, articulation, vocabulary and grammar correctness and others.

Streszczenie Celem projektu było opracowanie i implementacja prototypowej wersji urządzeń mobilnych do badania przesiewowego słuchu i mowy. Główną cechą urządzenia jest możliwość wykonania badań słuchu z wykorzystaniem audiometrii tanalnej oraz badania zrozumiałości mowy w szumie. System umożliwia również badanie słuchu fonematycznego, fonetycznego, motoryki organów mowy, artykulacji, słownictwa, gramatyki i in.

Entry No. 218

Entry type conference paper

Authors G. Szwoch, P. Dalka, A. Czyżewski

English title Objects classification based on their physical sizes for detection of events in camera images

Polish title

Conference NTAV/SPA 2008 Signal Processing: Algorithms, Architectures, Arrangements, and Applications; New Trends in Audio and Video

Preprint

Number

Volume

Pages 15 - 20

Conference site Poznań, Polska

Conference date 25.9.2008- 27.9.2008

Abstract In the paper, a method of estimation of the physical sizes of the objects tracked in the video surveillance system, and a simple module for object classification based on the estimated physical sizes, are presented. The results of object classification are then used for automatic detection of various types of events in the camera image.

Entry No. 219

Entry type journal paper

Authors M. Kulesza, A. Czyżewski

English title Novel approaches to wideband speech coding

Polish title Nowe metody szerokopasmowego kodowania sygnału mowy

Journal GESTS International Transactions on Computer Science and Engineering

Volume 44

Number 1

Pages 154 - 165

Abstract Two methods for encoding wideband speech are presented and discussed. In the first approach the pair of time compression and expansion procedures is employed in order to enable coding of the wideband speech signal using standardized narrowband coding algorithms. The proposed method is dedicated to adaptive speech coding algorithms operating in very low bit-rate modes, as it does not affect a codec bit-rate. The second investigated approach to wideband speech coding relies on an accurate spectrum envelope estimation procedure employing psychoacoustic criterion. As the spectrum envelope is estimated basing on appropriately selected set of an unmasked tonal and noise components, the novel algorithm for tonality detection is considered. The experimental results concerning the first presented method for wideband speech coding as well as promising results regarding tonality detection are presented and discussed.

Streszczenie Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy wykorzystujacego kryteria psychoakustyczne. Ponieważ w metodzie tej obwiednia widma okreslana jest na podstawie odpowiednio dobranych słyszalnych komponentów tonalnych i szumowych zaproponowano nową metodę detekcji tych komponentów. Praca zawiera wyniki eksperymentów dotyczące pierwszej metody kodowania oraz związanych z zaproponowana metodą detekcji komponentów tonalnych w widmie sygnału mowy.

Entry No. 220

Entry type conference paper

Authors A. Czyżewski, P. Maziewski, A. Kupryjanow, M. Papaj

English title Methods for Reducing the Audio-Visual Distortions Developed in the European PrestoSpace Project

Polish title Metody korekcji zniekształceń dźwięku i obrazu opracowane w ramach europejskiego projektu PrestoSpace

Conference Krajowa Konferencja Radiokomunikacji, Radiofonii i Telewizji 2008

Preprint

Number

Volume

Pages 465 - 468

Conference site Wrocław, Polska

Conference date 9.4.2008- 11.4.2008

Abstract This paper presents the overview of the research and development done within the PrestoSpace Project in the 6th framework programme of the European Union. Methods for audio distortion removal, such as broad-band noise and wow and flutter, were presented as well as methods for picture distortions corrections such as movie tape shrinkage. The developed algorithms were positively evaluated by the archivists which were using them to restore real-life archival audio-visual recordings.

Streszczenie W referacie przedstawiono przegląd prac badawczo-wdrożeniowych wykonanych przez autorów w ramach projektu PrestoSpace w 6. Programie Ramowym Unii Europejskiej. Opisano metody i algorytmy korekcji zniekształceń fonicznych, takich jak szum szerokopasmowy oraz kołysanie i drżenie dźwięku, a także korekcji zniekształceń obrazu spowodowanych skurczem taśm filmowych. Wdrożone algorytmy zostały pozytywnie ocenione przez archiwistów stosujących je w odniesieniu do archiwalnych nagrań foniczno-wizyjnych.

Entry No. 221

Entry type conference paper

Authors P. Maziewski, A. Kupryjanow, A. Czyżewski

English title Drift, Wow and Flutter Measurement and Reduction in Shrunken Movie Soundtracks

Polish title Wyznaczenie charakterystyki i redukcja dryfu, kołysania i drżenia w skurczonych taśmach filmowych

Conference 124 AES Convention

Preprint 7392

Number

Volume

Pages

Conference site Amsterdam, Holandia

Conference date 17.5.2008- 20.5.2008

Abstract The paper presents the method and algorithms used to determine and reduce drift, wow and flutter in shrunken movie tapes. The idea behind the algorithms is to use image processing for calculating the local tape shrinkage which is one of the reasons for drift, wow and flutter. The shrinkage can be calculated via analyzing the image height of: a movie frame, sprocket hole, pitch or another standardized movie tape element; and then it can be expressed as the drift, wow and flutter characteristic. After the characteristic determination both the soundtrack and movie frames can be corrected. The paper presents the description of the image based drift, wow and flutter determination method and the experiments confirming the theoretical findings.

Streszczenie Referat przedstawia metodę i algorytmy do oceny i redukcji dryfu, kołysania i drżenia w skurczonych taśmach filmowych. Przedstawione algorytmy działają w oparciu o przetwarzaniu obrazu taśmy filmowej w celu wyznaczenia jej lokalnego skurczu, który jest jednym z powodów powstawania dryfu, kołysania i drżenia. Lokalny skurcz taśmy można wyznaczyć dzięki analizie wysokości: ramki obrazu, perforacji, lub innego, standaryzowanego elementu taśmy. Następnie skurcz można przedstawić w formie charakterystyki dryfu, kołysania i drżenia. Po wyznaczeniu charakterystyki zarówno filmowa ścieżka dźwiękowa jak i klatki filmu mogą być skorygowane. Referat przedstawia opis metody wyznaczenia dryfu, kołysania i drżenia za pomocą algorytmów analizy obrazu taśmy filmowej oraz eksperymenty potwierdzające przedstawioną teorię.

Entry No. 222

Entry type conference paper

Authors P. Odya, A. Czyżewski

English title New Generation Speech Aid for Stuttering People

Polish title Korektor mowy nowej generacji

Conference 55 Otwarte Seminarium z Akustyki

Preprint

Number

Volume

Pages 463 - 468

Conference site Wrocław-Piechowice, Polska

Conference date 8.9.2008- 12.9.2008

Abstract Modern Digital Signal Processors (DSP) may have small dimensions and very low current con-sumption, but they are able to execute complex algorithms. In addition, they can be easily reprogrammed using a standard PC computer. Taking advantage of these processors, it was possible to build a device, which can be used either as a speech aid or a hearing aid, or both. The paper placed emphasis on issues related to the implementation of algorithms applicable to speech aids. For example, spectral compression or delaying the audio signal are often used for speech fluency improvement, thus they are shortly reviewed in the paper. Some additional algorithms which can improve the quality of the audio signal are also described. Clinical tests proved that the SDSA improves speech fluency. A detailed description of tests carried out and of their results is included in the paper.

Streszczenie Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów mających zastosowanie w korektorach mowy. Przykładowo, opóźnienie sygnału mowy bądź jego kompresja widmowa często powoduje wzrost płynności mowy u osób jąkających się. W referacie zawarto także opisy dodatkowych algorytmów zwiększających jakość przetwarzanych sygnałów dźwiękowych oraz informacje na temat stworzonego subminiaturowego korektora mowy. Testy kliniczne wskazują, że opracowane urządzenie poprawia płynność mowy osób jąkających się. Opis testów ich wyniki przedstawiono w referacie.

Entry No. 223

Entry type conference paper

Authors P. Suchomski, B. Kostek, A. Czyżewski

English title HEARING AID FITTING METHOD BASED ON FUZZY LOGIC PROCESSING

Polish title PRZETWARZANIE ROZMYTE W METODZIE DOPASOWANIA APARATÓW SŁUCHOWYCH

Conference 55 Otwarte Seminarium z Akustyki OSA 2008

Preprint

Number

Volume

Pages 481 - 486

Conference site Piechowice, Polska

Conference date 8.9.2008- 12.9.2008

Abstract One of the most important steps in a hearing aids fitting procedure is determining hearing dynamic characteristics. The hearing dynamic characteristics are typically calculated on the basis of loudness scaling test results. The problem is that the loudness scaling test results are presented on a loudness category scale, but a hearing prosthesis requires numerical parameters to be fed. A fuzzy logic method is useful for processing parameters expressed in human natural language. In this paper a fuzzy logic-based system for loudness scaling result processing is shortly presented. On the basis of the developed fuzzy system a way to shorten the loudness scaling test was found out.

Streszczenie Ważnym etapem dopasowania współczesnych aparatów słuchowych jest wyznaczanie charakterystyki dynamiki słuchu. Charakterystyka ta wyznaczana jest na podstawie wyników testu skalowania głośności. Niestety wyniki te wyrażone są w skali kategorii głośności, natomiast aparaty słuchowe wymagają para-metrów numerycznych. Problem ten można rozwiązać za pomocą logiki rozmytej. W niniejszym referacie przedstawiono metodę przetwarzania rozmytego wyników testu skalowania głośności. Na bazie opraco-wanej metody pokazano również sposób skrócenia testu skalowania głośności.

Entry No. 224

Entry type journal paper

Authors G. Szwoch, A. Czyżewski, M. Kulesza

English title A low complexity double-talk detector based on the signal envelope

Polish title

Journal Signal Processing

Volume 88

Number 11

Pages 2856 - 2862

Abstract A new algorithm for double-talk detection, intended for use in the acoustic echo canceller for voice communication applications, is proposed. The communication system developed by the authors required the use of a double-talk detection algorithm with low complexity and good accuracy. The authors propose an approach to doubletalk detection based on the signal envelopes. For each of three signals: the far-end speech, the microphone signal and the echo estimate, an envelope is detected. Next, using these envelopes, a detection function is determined and compared to the threshold. Additionally, a dynamic threshold is introduced in order to improve the accuracy of the algorithm. The results of the simulations presented in the paper proved that the accuracy of double-talk detection obtained using the proposed algorithm is higher than in the Geigel algorithm and comparable to the correlation-based methods, while the computational complexity of the proposed method remains at an acceptable level. The double-talk detection algorithm presented here may be used in voice communication systems having limited resources, allowing for accurate double-talk detection and, as a consequence, efficient acoustic echo cancellation.

Entry No. 225

Entry type conference paper

Authors M. Reiter, J. Kotus, A. Czyżewski

English title Optimizing localization of noise monitoring stations for the purpose of inverse engineering applications

Polish title

Conference Acoustics 08

Preprint

Number

Volume

Pages

Conference site Paryż, Francja

Conference date 18.5.2008- 21.5.2008

Abstract Long-term environmental monitoring of noise levels can be done using autonomous measurement stations. Because of the high cost of monitoring systems and management of these stations, it is essential to identify how many of measuring localization points are really required. In cases related to complex noise generation schemes, when there are various noise sources, the differences between calculations and measurements can be difficult to estimate. Therefore, it is vital to find some most appropriate locations for measurement stations which would ensure obtaining an adequate number of measurement results to be employed in the reverse engineering. These measurements can be then utilized to update dynamic noise maps. Furthermore, predictive noise models may be developed accordingly to certain local requirements. This could result in a better accuracy of dynamic noise maps. The paper focuses on defining the proper choice of the measurement points localizations. The experiments described include a comparison between real-life measurement results performed with the Multimedia Noise Monitoring System developed at the Multimedia Systems Department of the Gdansk University of Technology and the noise level prediction results. The optimization of the number and location of noise monitoring points with regard to the measurement accuracy is also discussed.

Entry No. 226

Entry type conference paper

Authors J. Kotus, B. Kostek, A. Czyżewski

English title The noise induced harmful effects assessment using psychoacoustical noise dosimeter

Polish title

Conference Acoustics 08

Preprint

Number

Volume

Pages

Conference site Paryż, Francja

Conference date 29.6.2008- 4.7.2008

Abstract A new way of assessment of noise-induced harmful effects on human hearing system was presented in the paper. Employing the developed psychoacoustical noise dosimeter the new indicators of noise harmfulness were verified on the basis of hearing examinations and noise measurement results. The indicators were based on some psychoacoustical properties of the human hearing system and, at the same time, on evaluation of the time and frequency characteristics of noise. Additionally, time properties of the Temporary Threshold Shift are calculated during the noise exposure. The evaluation of the proposed indicators were conducted on the basis of hearing examinations in the real noise exposure situations and also on the basis of simulation results employing standard test signals (such as: white, pink and brown noise). The standard noise dose analysis results were also presented for the purpose of comparison. The performed analysis and obtained results confirmed correctness and practical usefulness of the proposed indicators.

Entry No. 227

Entry type journal paper

Authors M. Kulesza, G. Szwoch, A. Czyżewski

English title Improving signal quality of a speech codec using hybrid perceptual-parametric algorithm

Polish title Poprawa jakości kodowania sygnału mowy z wykorzystaniem algorytmu perceptualno-parametrycznego

Journal Int. J. of Intelligent Information and Database Systems

Volume 2

Number 3

Pages 354 - 369

Abstract A hybrid parametric-perceptual speech codec architecture is presented. The basic CELP parametric codec structure is enhanced using the perceptual coding method. The objective of the codec hybridisation is obtaining significant improvement in the perceived signal quality. Two hybrid architectures are proposed. The first one encodes voiced parts perceptually in the CELP residual signal. The second one divides the signal into voiced signal components that are encoded using the perceptual algorithm, unvoiced components that are encoded parametrically and transients remaining unencoded. The signal quality achieved using the hybrid codec is compared to the quality of some standard speech codecs.

Streszczenie W artykule zaprezentowano hybrydową architekturę parametryczno-perceptualną kodeka mowy. Jego podstawę stanowi kodek CELP, który wspomagany jest kodekiem perceptualnym. Celem zastosowania proponowanej metody jest uzyskanie poprawy jakości kodowania sygnału mowy. Badaniom poddano dwie architektury, z których w jednej dźwięczne części sygnału rezydualnego kodeka CELP kodowane są perceptualnie. Drugi z proponowanych kodeków dokonuje podziału sygnału na części dźwięczne które kodowane są perceptualnie, bedźwięczne kodowane parametrycznie oraz transjenty, które nie podlegają kompresji. Uzyskana jakość kodowania z wykorzystaniem proponowanych algorytmów została porównana z jakością wybranych algorytmów zaimplementowanych zgodnie ze standardami telekomunikacyjnymi.

Entry No. 228

Entry type journal paper

Authors P. Odya, A. Czyżewski

English title New Generation Speech Aid for Stuttering People

Polish title Korektor mowy nowej generacji

Journal Archives of Acoustics

Volume 33

Number 4

Pages 141 - 146

Abstract Modern Digital Signal Processors (DSP) may have small dimensions and very low current con-sumption, but they are able to execute complex algorithms. In addition, they can be easily reprogrammed using a standard PC computer. Taking advantage of these processors, it was possible to build a device, which can be used either as a speech aid or a hearing aid, or both. The paper placed emphasis on issues related to the implementation of algorithms applicable to speech aids. For example, spectral compression or delaying the audio signal are often used for speech fluency improvement, thus they are shortly reviewed in the paper. Some additional algorithms which can improve the quality of the audio signal are also described. Clinical tests proved that the SDSA improves speech fluency. A detailed description of tests carried out and of their results is included in the paper.

Streszczenie Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów mających zastosowanie w korektorach mowy. Przykładowo, opóźnienie sygnału mowy bądź jego kompresja widmowa często powoduje wzrost płynności mowy u osób jąkających się. W referacie zawarto także opisy dodatkowych algorytmów zwiększających jakość przetwarzanych sygnałów dźwiękowych oraz informacje na temat stworzonego subminiaturowego korektora mowy. Testy kliniczne wskazują, że opracowane urządzenie poprawia płynność mowy osób jąkających się. Opis testów ich wyniki przedstawiono w referacie.

Entry No. 229

Entry type journal paper

Authors P. Suchomski, B. Kostek, A. Czyżewski

English title HEARING AID FITTING METHOD BASED ON FUZZY LOGIC PROCESSING

Polish title PRZETWARZANIE ROZMYTE W METODZIE DOPASOWANIA APARATÓW SŁUCHOWYCH

Journal ARCHIVES of ACOUSTICS

Volume 33

Number 4

Pages 153 - 158

Abstract One of the most important steps in a hearing aids fitting procedure is determining hearing dynamic characteristics. The hearing dynamic characteristics are typically calculated on the basis of loudness scaling test results. The problem is that the loudness scaling test results are presented on a loudness category scale, but a hearing prosthesis requires numerical parameters to be fed. A fuzzy logic method is useful for processing parameters expressed in human natural language. In this paper a fuzzy logic-based system for loudness scaling result processing is shortly presented. On the basis of the developed fuzzy system a way to shorten the loudness scaling test was found out.

Streszczenie Ważnym etapem dopasowania współczesnych aparatów słuchowych jest wyznaczanie charakterystyki dynamiki słuchu. Charakterystyka ta wyznaczana jest na podstawie wyników testu skalowania głośności. Niestety wyniki te wyrażone są w skali kategorii głośności, natomiast aparaty słuchowe wymagają para-metrów numerycznych. Problem ten można rozwiązać za pomocą logiki rozmytej. W niniejszym referacie przedstawiono metodę przetwarzania rozmytego wyników testu skalowania głośności. Na bazie opraco-wanej metody pokazano również sposób skrócenia testu skalowania głośności

Entry No. 230

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title Automatic system for audio-video material reconstruction and archiving

Polish title System automatycznej rekonstrukcji i archiwizacji nagrań audio-wideo

Conference NTAV/SPA 2008

Preprint

Number

Volume

Pages 173 - 176

Conference site Poznań, Polska

Conference date 25.9.2008- 27.9.2008

Abstract The paper presents model of audio-video materials automatic reconstruction and archiving system. The idea of this solution is to make process of restoration more human-independent. It will reduce costs of reconstruction of processed records. Because of great number of audio-video materials it is necessary to create system that will intelligently index this data. It will help with more effective material searching.

Streszczenie Referat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże to w efektywniejszym znajdowaniu szukanych nagrań.

Entry No. 231

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski, J. Ejsmont

English title Road noise prediction results compared to real measurements in Gdansk surroundings

Polish title Porównanie wyników modelowania hałasu drogowego do rzeczywistych pomiarów w okolicach Gdańska

Conference Joint Baltic-Nordic Acoustics Meeting 2008

Preprint

Number

Volume

Pages

Conference site Reykjavik, Iceland

Conference date 17.8.2008- 19.8.2008

Abstract The comparison of noise levels obtained by model and measurement results is presented in this paper. The HARMONOISE road noise source prediction model and propagation method used for simulation are mentioned. The model implementation outcome is compared to measurement results. Inaccuracies, which are defined as differences between the results calculated by the model and the actual measurements under the same atmospheric conditions, are discussed. An attempt to analyze the error reasons is made. Consideration of possibility of employing road noise prediction model to dynamic noise mapping is also included.

Entry No. 232

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski, J. Kotus

English title Investigation of the road noise source employing an automatic noise monitoring station

Polish title Badanie modelu źródła hałasu drogowego z zastosowaniem automatyczej stacji pomiaru hałasu

Journal Archives of Acoustics

Volume 33

Number 4

Pages 77 - 83

Abstract The paper presents a pilot investigation of noise source models in two selected localizations in the context of future dynamic noise map creation. The experiments were carried out using the automatic noise monitoring station engineered at the Multimedia Systems Departmentof the Gda´nsk University of Technology. The results of the noise measurements employing monitoring stations and its comparison to the reference values are depicted. Short- and longterm studies of noise level are also presented. The experiments described include a comparison between environmental measurement results and the noise level prediction results. Data obtained from the NMPB-96 (Nouvelle Méthode de Prévision du Bruit) and the Harmonoise model data provide the subject of the analysis. The proposed solution of permanent noisemonitoring system is also shortly described.

Entry No. 233

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski, J. Kotus

English title Road noise mapping in the city area: measurements compared to model-based estimations

Polish title Odwzorowanie hałasu drogowego w mieście: porównanie pomiarów z obliczeniami modeli

Conference 55 Otwarte Seminarium z Akustyki

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 8.9.2008- 12.9.2008

Abstract The paper presents an approach to the verification of noise prediction models in selected localization in the city of Gdansk. The experiments described include a comparison between environmentalmeasurement results performed in the terrain and the noise level prediction results. The NMPB-96 (Nouvelle Méthode de Prévision du Bruit) and Harmonoise models outcomes provide the subject ofthe analysis. The proposed solution of continuous noise monitoring system needed for validation of models is shortly described.

Entry No. 234

Entry type journal paper

Authors J. Kotus, B. Kostek, A. Czyżewski

English title PSYCHOACOUSTICAL NOISE DOSIMETRY IN THE MULTIMEDIA NOISE MONITORING SYSTEM

Polish title PSYCHOAKUSTYCZNA DOZYMETRIA HAŁASOWA W MULTIMEDIALNYM SYSTEMIE MONITOROWANIA HAŁASU

Journal Zeszyty Naukowe Wydziału ETI PG

Volume

Number 16

Pages 477 - 482

Abstract The results obtained by means of the Psychoacoustical Noise Dosimeter (PND) were presented in the paper. The developed algorithm provides a new way of the assessment of noise harmfulness. This method was developed on the basis of the available scientific knowledge followed by hearing and noise measurements carried out in laboratory conditions. Taking this knowledge into consideration the new indicators of the cumulative noise-induced harmfulness effects assessment were proposed. Their usefulness and correctness were confirmed on the basis of hearing examination conducted in the real noise exposure situation. Moreover, the PND algorithm was also implemented in the noise monitoring station. It provides an integral part of the teleinformation system for the noise threat monitoring, developed in the Multimedia System Department. The unique functionality of the station enables very precise evaluation of the acoustical conditions. Owing to that, it makes an essential tool supporting the noise-induced hearing loss prevention.

Streszczenie W referacie przedstawiono wyniki działania Psychoakustycznego Dozymetru Hałasowego (PDH). Opracowany algorytm stanowi nowy sposób oceny szkodliwości hałasu. Metoda ta opiera się na wykorzystaniu wiedzy na temat właściwości słuchu dostępnej w literaturze oraz na wynikach badań słuchu i hałasu przeprowadzonych w warunkach laboratoryjnych. Na tej podstawie zaproponowano nowe wskaźniki oceny skumulowanych skutków słuchowych wywołanych ekspozycją na hałas. Poprawność działania opracowanego algorytmu i zaproponowanych wskaźników potwierdzono na podstawie badań w warunkach rzeczywistego narażenia na hałas. Algorytm PDH zaimplementowano ponadto w stacji monitorowania hałasu. Stanowi ona integralną część, opracowanego w Katedrze, teleinformatycznego systemu monitorowania zagrożeń hałasem. Unikatowa funkcjonalność stacji umożliwia dokładną ocenę warunków akustycznych pod względem ich potencjalnej szkodliwości dla słuchu.

Entry No. 235

Entry type journal paper

Authors J. Kotus, A. Czyżewski, B. Kostek

English title Evaluation of Excessive Noise Effects on Hearing Employing Psychoacoustic Dosimetry

Polish title

Journal Noise Control Engineering Journal

Volume

Number

Pages

Notes W druku

Abstract Research results regarding the noise impact on hearing applying the concept of the Psychoacoustic Noise Dosimetry (PND) are presented. The general characteristics of the PND algorithm are discussed. Additionally, the results of hearing examinations conducted in the laboratory conditions are shown. The main objective of the research was to determine the time needed for the Temporary Threshold Shift to reverse. The results were used for the optimization of the designed PND performance. A validation of the PND algorithm was performed considering real noise exposure conditions. A new way of assessing noise-induced harmful effects on human hearing system was proposed employing the new indicators of noise harmfulness. The indicators are based on some psychoacoustical properties of the human hearing system and, simultaneously, on the time and frequency characteristics of noise. The correctness and the practical applicability of the newly proposed indicators were confirmed experimentally using hearing testing with real noise exposures and also on the basis of simulation results employing some standard test signals.

Entry No. 236

Entry type book

Authors A. Czyżewski, B. Kostek, J. Kotus

English title Multimedia Interactive Services in Intelligent Environments; Multimedia Services Applied to Noise and Hearing Monitoring and Measuring

Polish title

Editor Springer

Pages 275 - 295

Notes Rozdział w książce

Abstract The goal of this chapter is to show a research study related to processing of data acquired by the multimedia services engineered at the multimedia systems department (MSD) of the Gdansk University of Technology. This concerns a survey on noise threat employing the multimedia noise monitoring system (MNMS) and hearing tests performed by the “I can hear. . . ” system. The obtained results of the noise measurements revealed that an unfavorable noise climate was found in the examined schools and music clubs. This was also confirmed by the hearing examination results. On the basis of data gathered by both systems it was possible to perform an analysis relating the hearing impairment and noise indicators. New noise annoyance and noise threat criteria were proposed and verified based on the data acquired and analyzed. The measurement results obtained under in situ conditions were compared with those computed by means of the proposed psychoacoustical noise dosimeter.

Entry No. 237

Entry type conference paper

Authors A. Czyżewski, J. Kotus, B. Kostek, K. Kochanek, H. Skarżyński

English title Extending the Universal Screening System "I can hear." with diagnosing influence of noise to hearing

Polish title Rozbudowa systemu przesiewowego badań słuchu "Słyszę" o funkcje diagnozowania wpływu hałasu na słuch

Conference XLIII Zjazd Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Łódź, Polska

Conference date 4.6.2008- 7.6.2008

Streszczenie Liczne i alarmujące sygnały dotyczące stanu słuchu społeczeństwa, a zwłaszcza dzieci i młodzieży oraz klimatu akustycznego w kraju stanowiły motywację do rozszerzenia systemu powszechnych badań przesiewowych słuchu "SŁYSZĘ" o moduł do diagnozowania wpływu hałasu na słuch. W Katedrze Systemów Multimedialnych we współpracy z Instytutem Fizjologii i Patologii Słuchu, opracowano nowatorskie narzędzia diagnostyczne, umożliwiające przeprowadzenie wiarygodnych, przesiewowych testów słuchu. Nadmierny hałas występujący w środowisku (w tym również szkołach) może stanowić realne zagrożenie dla słuchu. Niezwykle potrzebne jest szerokie propagowanie walki z hałasem i zapobieganie negatywnym skutkom wywołanym ekspozycją na wysoki poziom hałasu. Jest to możliwe przez zastosowanie zaprojektowanych w tym celu urządzeń pomiarowych, które z automatyczny za pomocą komunikacji bezprzewodowej będą przekazywały aktualne informacje o zagrożeniu hałasem. Najnowsza wersja oprogramowania systemu "Słyszę" umożliwia przeprowadzanie testów przesiewowych słuchu przez Internet jak również za pomocą urządzeń typu PoketPC.

Entry No. 238

Entry type conference paper

Authors A. Czyżewski, B. Kostek, A. Geremek, K. Kochanek, H. Skarżyński

English title Contactless hearing aid

Polish title Bezkontaktowy aparat słuchowy

Conference XLIII Zjazd Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Łódź, Polska

Conference date 4.6.2008- 7.6.2008

Streszczenie Celem prowadzonych prac jest wdrożenie cyfrowego bezkontaktowego aparatu słuchowego dla niemowląt. W ubiegłych latach opracowano model takiego urządzenia, który wykorzystano do uruchomienia algorytmów filtracji przestrzennej dźwięku, algorytmów eliminowania pasożytniczych sprzężeń akustycznych i metod kompresji oraz wzmacniania mowy. Model przetestowano w łóżeczku dziecięcym, z udziałem małych dzieci. Aktualnie prowadzone jest opracowanie praktycznego zestawu aparatu słuchowego, pracującego w swobodnym polu akustycznym oraz jego kolejne próby w warunkach klinicznych, w Klinice Audiologii IFPS. W toku badań zaprojektowany bezkontaktowy aparat słuchowy umiejscawiany jest w łóżeczku niemowlęcia. Aparat składający się z matrycy 4 mikrofonów oraz z prototypowej karty z procesorem DSP pracuje w polu akustycznym otaczającym głowę dziecka. Przetworzony sygnał mowy emitowany jest z wykorzystaniem miniaturowych głośników, w tym głośników kierunkowych o specjalnej konstrukcji. Opracowane algorytmy pozwalają na eliminację akustycznych sprzężeń zwrotnych, które mogą występować ze względu na niewielką odległość mikrofonów od głośników i potencjalnie wysokie wzmocnienie sygnału w polu akustycznym. Algorytm filtracji przestrzennej wykorzystuje nieliniową filtrację sygnału w dziedzinie widma. W toku prowadzenia eksperymentów wykorzystywana jest metodyka badań obiektywnych i metoda obserwacji behawioralnych z udziałem matek i ich dzieci. Badaniom poddawane zostają dzieci kilkumiesięczne. Reakcje obiektywnie mierzalne są badane przy użyciu urządzenia służącego do prowadzenia przesiewowych badań słuchu, z wykorzystaniem metod ABR (potencjały wywołane pnia mózgu) i TEOAE (otoemisja akustyczna wywołana trzaskiem) – Kuba Mikro. Ponadto, w zakres projektu, którego dotychczasowe rezultaty są przedmiotem referatu, wchodzi analiza i dyskusja wyników eksperymentalnych uzyskanych w powyżej określonych warunkach. Słowa kluczowe: audiometria, badanie słuchu, protezy słuchu

Entry No. 239

Entry type conference paper

Authors B. Kostek, A. Czyżewski, Ł. Kosikowski, K. Kochanek, H. Skarżyński

English title Hearing-screening tests based on filtered sounds and on speech-in-noise intelligibility tests

Polish title Audiometria przesiewowa dedykowana dla dzieci przedszkolnych

Conference Acoustics'2008

Preprint 2958

Number

Volume

Pages

Conference site Paris, France

Conference date 29.6.2008- 4.6.2008

Abstract A hearing-screening system dedicated to small-children in pre-schools and primary schools is described in the paper. It uses as a hardware a palmtop computer supplemented with a small sound calibrating device. The described application provides tests that employ automatic questionnaire analysis, audiometric test procedures, and assessment of speech intelligibility in noise. In the speech-in-noise intelligibility tests, pictures are used for young children, and the screening tests are supervised by adults. Apart from the standardized audiometric tests, the screening tests employ environmental sounds filtered in audiometric frequency bands and calibrated as to their levels. When all the testing is completed, the system automatically analyzes the results for each child examined. The decision is made automatically by the expert system taking into account the number of incorrect answers. Children whose hearing impairment is confirmed are referred to treatment in rehabilitation centers. The project presented is a part of the large-scale ”I can hear...” screening tests program carried out in Poland for the last few years. This may help to increase awareness and inspire action against noise at a very early age. The methods employed for filtering and calibration environmental sounds and results achieved are presented in the paper.

Streszczenie W referacie przedstawiono testy audiometrii dziecięcej przesiewowej, zaimplementowane na urządzeniu typu PDA. Przedyskutowano funkcjonalności tego typu audiometru i opisano przygotowane testy audiometrii przesiewowej. Słowa kluczowe: audiometria przesiewowa, audiometria mowy w szumie, testy audiometryczne, PDA

Entry No. 240

Entry type conference paper

Authors P. Kozielecki, A. Czyżewski

English title A Novel Dynamic Noise Maps Visualization Tool

Polish title Nowe podejście do wykreślania dynamicznych map akustycznych

Conference IT 2008. 1st International Conference on Information Technology

Preprint

Number

Volume

Pages 245 - 248

Conference site Gdańsk, Polska

Conference date 19.5.2008- 21.5.2008

Abstract A concept and an implementation of the Novel Dynamic Noise Maps Visualization Tool is presented in this paper. The principal aim of the project is described. The first part depicts general features of the Tool. The main focus is put on presenting the innovative features of the Dynamic Noise Maps Visualization Tool in comparison to the commonly used software and hardware solutions. The software components applied are briefly presented. Furthermore, the project contribution into the field of dynamic noise maps visualization is introduced. All the application features are thoroughly described. The results of the Dynamic Noise Maps Visualization Tool performance, quality and ergonomics experiments are presented.

Streszczenie W referacie przedstawiono aplikację realizujacą wizualizację dynamicznych map akustycznych zintegrowaną z multimedialnym systemem monitoringu hałasu. Moduł ten został oparty na nowym podejściu do wykreślania dynamicznych map, w referacie przedstawiono porównanie wyników uzyskanych metodami tradycyjnymi i zaproponowaną metodą. Słowa kluczowe: dynamiczne mapy, wizualizacja, monitoring, hałas, system GIS

Entry No. 241

Entry type conference paper

Authors P. Kozielecki, A. Czyżewski

English title AN APPLICATION FOR VECTOR-BASED DYNAMIC NOISE MAPS GENERATION

Polish title Wizualizacja dynamicznych map hałasowych oparta na analizie wektorowej

Conference Joint Baltic-Nordic Acoustic Meeting, BNAM 2008

Preprint

Number

Volume

Pages 1 - 13

Conference site Reykjavik, Iceland

Conference date 17.8.2008- 19.8.2008

Abstract The concept and the developed application for vector-based, dynamic noise maps generation is presented. General features of dynamic noise maps are described. The advantages of dynamic noise maps visualization, utilizing vector graphics and animation techniques, compared to the commonly used software solutions are discussed. The main focus is put on presenting authors’ concept of the algorithm for noise contour extraction and shape labeling. The implementation details of the application are presented. Furthermore, quality and fidelity of extracted noise contours are evaluated.

Streszczenie W referacie przedstawiono koncepcję i metodę wykreślania dynamicznych map hałasowych opartą na analizie wektorowej. Podane zostały cechy metody oraz wyniki porównania metod wykreślania map dynamicznych. W referacie przedstawiono aplikację realizującą wizualizację map hałasowych ze szczególnym uwzględnieniem ekstrakcji konturów. Słowa kluczowe: hałas, dynamiczne mapy hałasowe, analiza wektorowa, animacja

Entry No. 242

Entry type conference paper

Authors A. Czyżewski, J. Ejsmont

English title VALIDATION OF HARMONOISE/IMAGINE TRAFFIC NOISE PREDICTION MODEL BY LONG TERM NOISE AND TRAFFIC MONITORING

Polish title Walidacja modelu Harmonoise/Imagine do przewidywania hałasu drogowego na podstawie długotrwałego monitoringu hałasu i natężenia ruchu drogowego.

Conference Joint Baltic-Nordic Acoustics Meeting 2008, 17-19 August 2008, Reykjavik, Iceland

Preprint

Number

Volume

Pages 1 - 13

Conference site Reykjavik, Iceland

Conference date 17.8.2008- 19.8.2008

Notes publikacja elektroniczna, recenzowana

Abstract In June 2002, the European Directive on the Assessment and Management of Environmental Noise (2002/49/EC) was accepted and came into force. Under this Directive the member states were obliged to produce strategic noise maps of major roads, railways, airports and large agglomerations by 30th June 2007. During the first round of noise mapping the national prediction methods and so called interim methods were allowed. Nevertheless, already in 2001, under the 5th Framework Programme, the HARMONOISE project has been started with a goal to establish a common assessment method for noise mapping of road and railroad sources. A following project, called IMAGINE, was started in 2003 to refine methods proposed by HARMONOISE and to include modules dealing with airports and industrial sources. The basic principle of the so called "HARMONOISE/IMAGINE" model is to provide separate, but well harmonized, modules for sources' description (road traffic, railroad traffic, air traffic and industrial sources) and for propagation. This paper is however focused only on the module describing road traffic noise source. In 2005 the Technical University of Gdansk has built and started to operate a noise, traffic and weather monitoring station on one of the major roads close to Gdansk. Now the results for over 3 years of continuous monitoring are available. The follow up of the monitoring is carried out as a part of the national project sponsored by the Polish Ministry of Science and Higher Education that deals with noise monitoring of urban areas by means of tele-informatics and GIS. The paper presents validation of the model performed on the base of those results. Moreover, it contains also critical analysis of the data and procedures included in the model and suggestions of improvement. One of the problems with the HARMONOISE/IMAGINE model is the overestimation of propulsion noise of heavy trucks in comparison to the rolling noise. Another problem is the specific procedure for establishing correction factors during deceleration that is not accounting for the increase of rolling noise.

Streszczenie W pracy przedstawiono model do przewidywania hałasu ruchu drogowego opracowany w projektach Unii Europejskiej Harmonoise i Imagine. Omówiono strukturę modelu, klasyfikację pojazdów, zastosowane zależności oraz wprowadzone czynniki korekcyjne. Zaprezentowano stację zainstalowaną w Chwaszczynie do monitoringu hałasu i natężenia ruchu drogowego oraz warunków pogodowych. Przedstawiono wyniki długotrwałych pomiarów wykonanych za pomocą tej stacji, które posłużyły do walidacji modelu do przewidywania hałasu ruchu drogowego. Słowa kluczowe: hałas drogowe, monitoring, model przewidywania hałasu, walidacja

Entry No. 243

Entry type journal paper

Authors P. Żwan, P. Szczuko, B. Kostek, A. Czyżewski

English title Automatic Singing Voice Recognition Employing Neural Networks and Rough Sets

Polish title Automatyczne rozpoznawanie głosów śpiewaczych przy pomocy sieci neuronowych i zbiorów przybliżonych

Journal Transactions on Rough Sets

Volume

Number 9

Pages 455 - 473

Abstract The aim of the research study presented in this paper is the automatic recognition of a singing voice. For this purpose, a database containing sample recordings of trained and untrained singers was constructed. Based on these recordings, certain voice parameters were extracted. Two recognition categories were deﬁned – one reﬂecting the skills of a singer (quality), and the other reﬂecting the type of the singing voice (type). The paper also presents the parameters designed especially for the analysis of a singing voice and gives their physical interpretation. Decision systems based on artiﬁcial neutral networks and rough sets are used for automatic voice quality/ type classiﬁcation. Results obtained from both decision systems are then compared and conclusions are derived.

Streszczenie Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory śpiewu oparte o sieci neuronowe i zbiory rozmyte.

Entry No. 244

Entry type

Authors A. Czyżewski, G. Szwoch

English title Method and apparatus for acoustic echo cancellation in VoIP terminal

Polish title

Notes USA: US20110002458

Abstract A method of acoustic echo cancellation in the VoIP terminal using processing of the far-end signal with the digital adaptive filter in order to obtain the echo estimate that is subtracted from the microphone signal in which the far-end signal, before it is converted to the analog form and passed to the loudspeaker (4), is marked by embedding an encoded digital signature obtained from the signature generator (14) and then detection of the digital signature is performed in the signal collected by the microphone (7) and converted to digital form, depending on the result of the digital signature detection, adaptation of the digital adaptive filter (9) is stopped or resumed. A circuit for acoustic echo cancellation in VoIP terminal contain the digital adaptive filter with the control block situated between the far-end speech signal path and the near-end speech signal path, and the double-talk detector (11) that comprises the signature generator (14) connected by the signature encoder (15) to the signature embedding block (16) that is situated between the speech decoder (2) and the digital-to-analog converter (3) in the far-end speech signal path. The signature generator (14) is also connected to the signature decoder (17) which is connected to the output of the analog-to-digital converter (8) in the near-end speech signal path and the output of the signature decoder (17) is connected by the decision block (18) to the control block (10) of the digital adaptive filter (9).

Entry No. 245

Entry type conference paper

Authors A. Czyżewski, P. Dalka

English title Examining Kalman filters applied to tracking objects in motion

Polish title Zastosowanie filtrów Kalmana do śledzenia ruchomych obiektów

Conference Ninth International Workshop on Image Analysis for Multimedia Interactive Services

Preprint

Number

Volume

Pages 175 - 178

Conference site Klagenfurt, Austria

Conference date 7.5.2008- 9.5.2008

Notes http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4556857&arnumber=4556913&count=74&index=55

Abstract Kalman filters were used for establishing relations between objects moving in video frames to the real moving objects under analysis. As a result of applying some popular methods of moving objects detection, the objects were represented by rectangles. A two-dimensional colour histogram based on a chromatic space was used for each object in experiments. The objects coupling with adequate regions including the relation of many-to-many was studied experimentally employing Kalman filters. The implemented algorithm provides a part of an advanced audio-video surveillance system for security applications.

Streszczenie Filtry Kalmana zostały wykorzystane do ustanowienia relacji między obiektami poruszającymi się w ramkach obrazu a rzeczywistymi ruchomymi obiektami podlegającymi analizie. W wyniku zastosowania popularnych metod detekcji ruchomych obiektów, obiekty te są reprezentowane przez prostokąty. Dwuwymiarowy histogram koloru bazujący na przestrzeni chromatycznej został wykorzystany w eksperymentach do opisu każdego obiektu. Eksperymentalnie zbadano sposoby powiązania obiektów z odpowiednimi regionami w ramce obrazu, uwzględniają relacje wiele-do-wielu, wykorzystujące filtry Kalmana. Zaimplementowany algorytm stanowi część zaawansowanego systemu do audio-wizualnego monitorowania bezpieczeństwa.

Entry No. 246

Entry type journal paper

Authors A. Czyżewski, P. Dalka

English title Moving Object Detection and Tracking for the Purpose of Multimodal Surveillance System in Urban Areas

Polish title Detekcja i śledzenie ruchomych obiektów na potrzeby multimodalnego systemu monitorowania bezpieczeństwa w aglomeracjach miejskich

Journal New Directions in Intelligent Interactive Multimedia, seria: Studies in Computational Intelligence

Volume 142

Number

Pages 75 - 84

Notes Proc. of 1st International Symposium on Intelligent Interactive Multimedia Systems and Services, University of Piraeus, Greece 9-11 July 2008

Abstract Background subtraction method based on mixture of Gaussians was employed to detect all regions in a video frame denoting moving objects. Kalman filters were used for establishing relations between the regions and real moving objects in a scene and for tracking them continuously. The objects were represented by rectangles. The objects coupling with adequate regions including the relation of many-to-many was studied experimentally employing Kalman filters. The implemented algorithm provides a part of an advanced audio-video surveillance system for security applications which is described briefly in the paper.

Streszczenie Metoda odejmowania tła wykorzystująca sumę ważonych rozkładów normalnych została użyta do wykrycia w bieżącej ramce obrazu wszystkich obszarów oznaczających ruchome obiekty. Filtry Kalmana zastosowano do określenia właściwych relacji pomiędzy obszarami, a rzeczywistymi obiektami poruszającymi się polu widzenia kamery oraz do ich ciągłego śledzenia. Obiekty są reprezentowane przez prostokąty. Zbadano eksperymentalnie sposoby powiązania odpowiednich regionów z obiektami z wykorzystaniem filtrów Kalmana i uwzględniając relacje wiele-do-wielu. Zaimplementowany algorytm stanowi część zaawansowanego systemu do audio-wizualnego monitorowania bezpieczeństwa, który krótko opisano w artykule.

Entry No. 247

Entry type

Authors A. Czyżewski, G. Szwoch

English title

Polish title Metoda i układ tłumienia echa akustycznego w terminalu VoIP wykorzystująca technikę znakowania sygnału

Streszczenie Wynalazek dotyczy zastosowania w terminalu klienckim systemu VoIP układu tłumienia echa akustycznego z zastosowaniem nowej metody wykrywania przypadku tzw. mowy równoczesnej (ang. double talk). Istotą wynalazku jest przy tym wykorzystanie techniki cyfrowego znakowania sygnału do oznaczenia sygnału przychodzącego od zdalnego klienta systemu VoIP, dzięki czemu możliwe jest późniejsze stwierdzenie występowania mowy równoczesnej i sterowanie procesem adaptacji filtru dokonującego obliczania estymaty echa, a w rezultacie stłumienie echa akustycznego. Zastosowanie wynalazku w systemie VoIP umożliwia uzyskanie skutecznego tłumienia echa akustycznego przy użyciu metody nie wprowadzającej istotnych opóźnień w transmisji danych, a w rezultacie poprawę jakości usługi VoIP. Wynalazek może być szczególnie użyteczny dla tych użytkowników systemów komunikacyjnych VoIP, którzy z różnych względów podczas komunikacji w systemie VoIP wykorzystują głośnik zamiast słuchawek. Zastosowanie metody według wynalazku zapobiega powstawaniu efektu echa akustycznego, który znacznie utrudnia prowadzenie rozmów i obniża jakość usługi. Proponowane rozwiązanie realizuje układ tłumienia echa akustycznego za pomocą adaptacyjnego filtru cyfrowego. Przetworzenie sygnału docierającego od drugiego mówcy przez filtr adaptacyjny powoduje uzyskanie estymaty echa akustycznego, która następnie jest odejmowana od sygnału zebranego przez mikrofon. Wynik tej operacji jest wykorzystywany do adaptacji (strojenia) filtru. Istotnym elementem opisywanego układu jest detektor tzw. mowy równoczesnej (ang. double talk detector). Jego zadaniem jest wykrycie przypadku mowy równoczesnej, gdy mikrofon przyłączony do terminala klienckiego odbiera użyteczny sygnał mowy pochodzący od mówcy bliskiego oraz jednocześnie sygnał echa od drugiego mówcy. Detektor mowy równoczesnej wykrywa taki przypadek i powoduje wstrzymanie procesu adaptacji filtru na okres występowania mowy równoczesnej. Zapobiega on w ten sposób rozstrojeniu filtru adaptacyjnego, co skutkowałoby zniekształceniem przetwarzanego sygnału. Schemat blokowy układu detekcji mowy równoczesnej oraz sterowanego przez niego układu tłumienia echa akustycznego będącego zasadniczą częścią terminala systemu VoIP, stanowiącego sobą urządzenie według wynalazku, przedstawiono na rysunku. Opisywany układ wykorzystuje technikę znakowania sygnału pobranego z wejścia terminala A, po jego przetworzeniu przez dekoder mowy. Generator znacznika 1 wytwarza ustaloną sekwencję bitów, nazywaną znacznikiem, dobraną w taki sposób, aby umożliwić późniejszą detekcję obecności znacznika w sygnale, który został zniekształcony podczas transmisji fal akustycznych pomiędzy głośnikiem i mikrofonem. Ponadto generator 1 wytwarza ustaloną sekwencję pseudolosową oraz sygnał nośny. W bloku kodowania znacznika 2 następuje mnożenie znacznika przez sygnał nośny, a następnie przez sygnał pseudolosowy, aby uzyskać w ten sposób sygnał obejmujący szeroki zakres częstotliwości, odporny na zniekształcenia. Otrzymany w ten sposób zakodowany znacznik jest tłumiony i dodawany do właściwego sygnału w bloku zapisu znacznika 3. Sygnał z zapisanym znacznikiem jest przekazywany na wyjście układu B. Z kolei sygnał odebrany przez mikrofon trafia na wejście C opisywanego układu i jest sprawdzany pod kątem występowania znacznika. Opisywana metoda detekcji mowy równoczesnej opiera się na spostrzeżeniu, że w przypadku braku mowy równoczesnej w sygnale odebranym przez mikrofon obecny będzie tylko sygnał echa oraz ewentualnie szum i inne zakłócenia, możliwe będzie zatem wykrycie obecności znacznika. Natomiast w przypadku gdy w analizowanym sygnale jest obecny również sygnał mowy wprowadzony przez użytkownika terminala, znacznik zawarty w sygnale echa zostanie stłumiony przez użyteczny sygnał mowy, przez co blok detektora znacznika stwierdzi brak znacznika w sygnale. Blok dekodowania znacznika 4 dokonuje wstępnego przetwarzania sygnału, obejmującego m.in. normalizację i synchronizację z sygnałem kodowanym, a następnie sygnał jest mnożony przez sygnał pseudolosowy uzyskany z generatora 1, po czym przeprowadzana jest detekcja treści znacznika. Odczytany znacznik jest następnie porównywany ze znacznikiem uzyskanym z generatora 1, wprowadzonym wcześniej do sygnału, po czym blok decyzyjny 5 na podstawie wyniku próby odczytu znacznika określa czy znacznik ten był obecny w analizowanym sygnale i odpowiednio włącza lub wyłącza blok 6 sterujący adaptacją filtru 7. Blok decyzyjny 5 dostarcza binarną informację: jeżeli znacznik nie został wykryty, oznacza to konieczność zatrzymania procesu adaptacji filtru 7 przez blok sterujący 6, natomiast jeżeli znacznik zostanie wykryty, oznacza to brak mowy równoczesnej, zatem adaptacja filtru 7 powinna zostać wznowiona. Niezależnie od wyniku detekcji znacznika, estymata echa uzyskana przy użyciu filtru 7 zostaje odjęta od sygnału wejściowego, po czym wynik tej operacji jest przetwarzany przez procesor dynamiki 8, dokonujący tłumienia tzw. echa resztkowego, a następnie sygnał przekazywany jest na wyjście układu D. Istotną różnicą wprowadzoną w omawianej metodzie, odróżniającą proponowany wynalazek od typowych zastosowań techniki znakowania sygnałów, jest wykrywanie samej obecności znacznika, a nie odczytywanie jego treści. Z tego względu sygnał znacznika został dobrany w taki sposób, aby jego obecność w sygnale była możliwa do stwierdzenia pomimo zniekształcenia sygnału zawierającego znacznik na skutek pogłosu wprowadzonego w pętli akustycznego sprzężenia zwrotnego oraz szumu i innych zakłóceń zewnętrznych, a zarazem aby sygnał mowy użytkownika terminala powodował stłumienie znacznika uniemożliwiające jego wykrycie, pozwalając w ten sposób stwierdzić przypadek występowania mowy równoczesnej. Opisana metoda tłumienia echa akustycznego, z wykorzystaniem znakowania sygnału w celu detekcji mowy równoczesnej, może zostać zaimplementowana w terminalu klienta systemu VoIP, który może mieć postać sprzętową lub programową. Wykorzystanie wynalazku umożliwia skuteczne tłumienie niekorzystnego efektu echa akustycznego bez zwiększania opóźnień w komunikacji, które mógłby spowodować bardziej złożony układ detekcji mowy równoczesnej. Z powyższych względów opisywany wynalazek może znaleźć szczególne zastosowanie w tych terminalach klienckich VoiP, w które nie dysponują dużą mocą obliczeniową. Wykorzystanie wynalazku prowadzi do podwyższenia jakości usług telekomunikacyjnych w systemach VoIP. ZASTRZEŻENIA PATENTOWE 1. Terminal kliencki systemu VoIP znamienny tym, że wykorzystuje układ tłumienia echa akustycznego zawierający detektor mowy równoczesnej zrealizowany w oparciu o znakowanie sygnału przychodzącego za pomocą ustalonego znacznika oraz stwierdzenie obecności znacznika w sygnale przesyłanym do sieci w celu określenia czy występuje mowa równoczesna i czy występuje konieczność zatrzymania procesu adaptacji układu tłumienia echa akustycznego. 2. Metoda detekcji mowy równoczesnej znamienna tym, że wykorzystuje technikę znakowania sygnału w sposób podany w zastrzeżeniu 1.

Entry No. 248

Entry type

Authors A. Czyżewski, P. Odya

English title

Polish title Pisak ekranowy zwłaszcza do tabletu

Notes Wzór użytkowy nadany w 2014 r. (konwersja ze zgłoszenia patentowego)

Entry No. 249

Entry type conference paper

Authors A. Czyżewski, B. Kunka, M. Kurkowski, R. Branchat

English title Comparison of developed gaze point estimation methods

Polish title Porównanie metod wyznaczania punktu fiksacji wzroku

Conference NTAV/SPA 2008

Preprint

Number

Volume

Pages 133 - 136

Conference site Poznań, Polska

Conference date 25.9.2008- 27.9.2008

Notes publikacja w materiałach konferencyjnych oraz na płycie CD

Abstract This paper presents the software part of an inexpensive hands-free eye tracking system. The system works using infrared illumination like most of the available eye trackers. Two methods allowing estimation the gaze point on computer screen are compared. Research on effectiveness of these methods is discussed and the better one is indicated.

Streszczenie Publikacja prezentuje część programową taniego bezkontaktowego systemu śledzenia punktu fiksacji wzroku na monitorze komputera. System pracuje przy wykorzystaniu oświetlenia podczerwienią jak większość dostępnych na rynku systemów tego typu. Zostały porównane dwie metody pozwalające wyznaczyć punkt fiksacji wzroku na monitorze komputerowym. Przedstawiono wyniki badań efektywności zaprezentowanych metod i wskazano lepszą z nich.

Entry No. 250

Entry type conference paper

Authors A. Czyżewski, Ł. Kosikowski, B. Kostek, A. Szkiełkowska, K. Kochanek, H. Skarżyński

English title A portable device for voice monitoring

Polish title Urządzenie przenośne do monitorowania głosu

Conference III Konferencja Naukowo-Szkoleniowa Sekcji Foniatrycznej i Sekcji Audiologicznej Polskiego Towarzystwa Otorynolaryngologów Chirurgów Głowy i Szyi

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 8.5.2008- 10.5.2008

Notes plakat

Abstract The aim of the work was the elaboration of a prototype and, in the second step, the implementation of a portable voice monitoring device for the clinical practice. This device can be used at home, in the work etc. by people with a high risk of voice disorders, e.g. teachers or actors. Those people while speaking, often commit emission errors causing voice problems that can lead to future pathological changes in the larynx.

Streszczenie Celem projektu było opracowanie i implementacja przenośnego urządzenia do monitorowania głosu. Urządzenie może być wykorzystywane w domu i pracy, przez osoby, u których występuję zwiększone ryzyko wystąpienia zaburzeń głosu np. nauczycieli i aktorów. Osoby te często popełniają błędy emisyjne, które mogą w przyszłości spowodować zmiany patologiczne w krtani.

Entry No. 251

Entry type conference paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak, B. Kostek

English title Online urban noise monitoring system

Polish title

Conference 16 ICSV

Preprint

Number

Volume

Pages 1 - 8

Conference site Kraków, Polska

Conference date 5.7.2009- 9.7.2009

Abstract Concepts and implementation of the Online Urban Noise Monitoring System are presented in this paper. The objectives of the realized project are described. The concept of the dynamic acquisition of the noise source parameters is introduced. The idea of noise modeling, based on noise emission and propagation simulations, was developed and practically utilized in the system. The practical implementation of noise maps generation and visualization is pre-sented, together with introduced improvements in the domain of continuous noise monitoring and acoustic maps creation. The results of tests performed using the system prototype are shown.

Entry No. 252

Entry type conference paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak

English title

Polish title Modelowanie jakości powietrza w powiązaniu z modelem numerycznym miasta z wykorzystaniem oprogramowania działającego na platformie wieloprocesorowej

Conference Jakość powietrza

Preprint

Number

Volume

Pages 1 - 9

Conference site Gdańsk, Polska

Conference date 23.4.2009- 24.4.2009

Abstract Artykuł przedstawia wyniki modelowania zanieczyszczenia powietrza dla wybranych źródeł drogowych. Obliczenia emisji i propagacji zanieczyszczeń powietrza dokonano za pomocą modelu AUSTAL2000, dostępnego w aplikacji CadnaA. Przedstawiono wyniki analiz dla następujących substancji: SO2, NOx, PM10, benzen. Dodatkowo zamieszczono łączne wyniki zanieczyszczeń powietrza i poziomy hałasu dla rozpatrywanych dróg. Zastosowanie platformy wieloprocesorowej do celów obliczeń numerycznych umożliwiło zredukowanie czasu uzyskania wyniku.

Entry No. 253

Entry type journal paper

Authors M. Kulesza, A. Czyżewski

English title Tonality Estimation and Frequency Tracking of Modulated Tonal Components

Polish title Estymacja tonalności oraz śledzenie w dziedzinie częstotliwości modulowanych komponentów tonalnych

Journal J. Audio Eng. Soc.

Volume 57

Number 4

Pages 221 - 236

Abstract A novel method for tonality estimation and frequency tracking of tonal components modulated in frequency and amplitude is presented. The algorithm detects the local maxima of magnitude spectra corresponding to three contiguous frames of a signal and matches them into the tonal track candidates. The magnitude-based and phase-based methods are used to estimate the frequency jumps between spectrum maxima belonging to the tonal track candidates. Verification of the candidate tonality is based on the distance between frequency jumps derived from the magnitude-based and phase-based estimators. The effectiveness of the proposed algorithm is compared with selected existing methods for tonality estimation used in psychoacoustic models.

Streszczenie Zaprezentowano nową metoda estymacji tonalności oraz śledzenia częstotliwości chwilowej modulowanych komponentów tonalnych. Algorytm dokonuje detekcji lokalnych maksimów widma w trzech następujących po sobie widma sygnału i przeprowadza proces formowania kandydatów do ścieżek tonalnych. Dla każdego z kandydatów określana jest zmiana częstotliwości chwilowej z wykorzystaniem metody bazującej na analizie widma amplitudowego i fazowego. Weryfikacja tonalności kandydatów następuje na podstawie różnicy estymat uzyskanych dwoma wspomnianymi metodami. Zaproponowany algorytm został porównany z wybranyi metodami estumacji tonalności stosowanymi w modelach psychoakustycznych.

Entry No. 254

Entry type conference paper

Authors M. Kulesza, A. Czyżewski

English title Audio codec employing frequency-derived tonality measure

Polish title Kodek sygnałów fonicznych wykorzystujący miarę tonalności bazującą na analizie częstotliwości komponentów widma

Conference 127th Convention of Audio Engineering Society

Preprint 7877

Number

Volume

Pages 1 - 14

Conference site Nowy Jork, USA

Conference date 9.10.2009- 12.10.2009

Abstract A transform codec employing efficient algorithm for detection of spectral tonal components is presented. The tonality measure used in MPEG psychoacoustic model is replaced with the method providing adequate tonality estimates even if the tonal components are deeply frequency modulated. The reliability of hearing threshold estimated using psychoacoustic model with standardized tonality measure and the proposed one is investigated using objective quality testing methods. The proposed tonality estimator is also used as a basis for detector of noise-like signal bands. Instead of quantizing the noise-like signal components according to the usual transform coding scheme, the signal bands containing only noise-like components are filled with locally generated noise in the decoder. The results of the listening tests reveling usefulness of employed tonality estimation method for such a coding scenario are presented.

Streszczenie Zaprezentowano kodek sygnałów fonicznych wykorzystujący efektywny algorytm detekcji komponentów tonalnych sygnału. Miara tonalności stosowana w kodekach MPEG została zastąpiona metodą pozwalającą na efektywne określenie tonalności komponentów modulowanych częstotliwościowo. Zastosowano obiektywne metody porównawcze w celu określenia wiarygodności estymacji progów słyszenia uzyskiwanych przy pomocy modeli psychoakustycznych ze standardową miarą tonalności oraz miarą proponowaną przez autorów. Miara tonalności została również wykorzystana do detekcji pasm sygnału zawierających jedynie komponenty szumowe. Pasma szumowe nie są kodowane w typowy sposób, ale syntetyzowane lokalnie w dekoderze. Przedstawiono wyniki testów odsłuchowych potwierdzających użyteczność wykorzystanej metody estymacji tonalności w przypadku kodowania sygnałów fonicznych zgodnie z opisaną metodą.

Entry No. 255

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski, J. Kotus

English title Dynamic noise mapping in the city of Gdansk

Polish title

Conference Euronoise 2009

Preprint

Number

Volume

Pages

Conference site Edynburg, Wielka Brytania

Conference date 26.10.2009- 28.10.2009

Abstract Investigation results of the system for creating dynamic noise maps are presented. Brief description of the system engineered at the Multimedia Department of Gdansk University of Technology is introduced. The method for acquiring input data to the numerical model for computing the distribution of the acoustic field over urban area is described. Distribution in the city area of stations for road traffic volume and noise level monitoring is illustrated. Pilot series of experiments aiming at investigation of operating efficiency of dynamic noise maps is presented. The noise maps obtained as an output of algorithms implemented on the supercomputer are shown. The time needed for updating a map for a particular fragment of city is investigated. Moreover, the outcomes obtained by modeling and by measurements done in the places of station deployment are compared.

Streszczenie W referacie przedstawiono wyniki badania systemu do tworzenia dynamicznych map hałasu. Zaprezentowano krótki opis systemu opracowanego w Katedrze Systemów Multimedialnych. Opisano metodę pobierania danych wejściowych do modelu numerycznego obliczającego rozkład pola akustycznego w obszarze miejskim. Przedstawiono próbną serię eksperymentów mających na celu zbadanie efektywności działania systemu. Pokazano mapy hałasu otrzymane w wyniku działania algorytmów zaimplementowanych na superkomputerze. Zbadano czas potrzebny do aktualizacji mapy dla wybranego obszaru miasta. Ponadto porównano wyniki otrzymane w procesie modelowania z pomiarami wykonanymi w miejscach rozlokowania stacji pomiarowych.

Entry No. 256

Entry type conference paper

Authors A. Czyżewski, P. Odya, A. Grabkowska, M. Grabkowski, B. Kostek

English title Smart Pen - new multimodal computer control tool for dyslexia therapy

Polish title Inteligentny długopis - komputerowy interfejs przeznaczony do terapii dysleksji

Conference Siggraph 2009

Preprint

Number

Volume

Pages

Conference site Nowy Orlean, USA

Conference date 3.8.2009- 7.8.2009

Notes Plakat

Abstract Smart Pen is a tool for supporting the therapy of developmental dyslexia, with particular regard to dysgraphia. It comprises a display monitor equipped with a high-sensitivity touchpad and specially designed writing tool equipped with pressure sensors.

Streszczenie Inteligentny długopis jest narzędziem przeznaczonym do wspomagania terapii dysleksji, ze szczególnym uwzględnieniem dysgrafii. Interfejs składa się z monitora zintegrowanego z tabletem oraz długopisu wyposażonego w czujniki nacisku.

Entry No. 257

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya, H. Skarżyński, P. Skarżyński, P. Suchomski

English title New Technology for Hearing Stimulation Employing the SPS-S Method

Polish title Wykorzystanie nowych technologii do treningu słuchu z użyciem metody SPS-S

Conference 127th AES Convention

Preprint 7919

Number

Volume

Pages

Conference site Nowy Jork, USA

Conference date 9.10.2009- 12.10.2009

Abstract A prototype of a the new Compact Audio Therapy Unit (CATU) is presented that can process any audio signal inside a very compact device working in real time, employing advanced digital filtration, signal keying, manipulating playback rate, various spectral modifications of the signal, repeating phrases and others. It was designed to provide a platform for the therapy with the new Method of the Aural Perception Stimulation (SPS-S). The design for wearability allows one to use the device effectively in normal everyday life conditions, e. g. outdoors. The compact and versatile processing device can potentially open a new era in patients and trainees mobility.

Streszczenie Istotnym założeniem metody SPS-S jest możliwość stosowania treningu słuchowego w warunkach życia codziennego, a więc także poza gabinetami placówek terapeutycznych. Mobilne urządzenie – stymulator słuchu jest oparte na najnowszej technologii mikroelektronicznej, Oferuje on możliwość wykorzystywania wielu programów terapii, które przebiegają z zastosowaniem algorytmów cyfrowego przetwarzania dźwięku. Algorytmy te można podzielić na klasyczne, nawiązujące w swojej zasadzie działania do koncepcji tzw. „Elektronicznego ucha” i na w pełni oryginalne algorytmy, które przekształcają dźwięki w taki sposób, aby ich odsłuchiwanie powodowało poprawę w zakresie lateralizacji słuchowej. Wspomniane „Elektroniczne ucho” służy przede wszystkim poprawie motoryki mikromięśni ucha środkowego, podczas gdy algorytmy związane z poprawą lateralizacji, mogą być wykorzystywane do niwelowania licznych niekorzystnych objawów nieprawidłowej lateralizacji. Jak dowodzą badania naukowe, choć diagnozowane nieprawidłowości w tym zakresie dotyczą najczęściej lateralizacji słuchowej, to mogą one mieć również związek z jąkaniem się, z nie w pełni efektywnym widzeniem, a nawet z dysleksją. W komunikacie zostanie przedstawiona nowa metoda stymulacji sensorycznej i opracowane urządzenie mobilne umożliwiające tę stymulację.

Entry No. 258

Entry type conference paper

Authors A. Czyżewski, P. Odya, B. Kostek, H. Skarżyński

English title

Polish title SPS-S - nowa metoda stymulacji słuchowej

Conference ISSET 2009 - XIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku

Preprint

Number

Volume

Pages 37 - 44

Conference site Warszawa, Polska

Conference date 16.10.2009- 18.10.2009

Abstract One of the main objectives of the SPS-S method is the possibility of auditory training in conditions of everyday life, and therefore also outside the therapeutic institutions. A prototype of a new Compact Audio Therapy Unit (CATU) is presented that can process any audio signal inside a very compact device working in real time. The CATU offers the possibility of using multiple therapy programs, which run with the use of digital audio processing algorithms. It was designed to provide a platform for the therapy with the new Method of the Aural Perception Stimulation (SPS-S). The CATU and the SPS-S method will be described in the paper.

Streszczenie Istotnym założeniem metody SPS-S jest możliwość stosowania treningu słuchowego w warunkach życia codziennego, a więc także poza gabinetami placówek terapeutycznych. Mobilne urządzenie – stymulator słuchu jest oparte na najnowszej technologii mikroelektronicznej, Oferuje on możliwość wykorzystywania wielu programów terapii, które przebiegają z zastosowaniem algorytmów cyfrowego przetwarzania dźwięku. Algorytmy te można podzielić na klasyczne, nawiązujące w swojej zasadzie działania do koncepcji tzw. „Elektronicznego ucha” i na w pełni oryginalne algorytmy, które przekształcają dźwięki w taki sposób, aby ich odsłuchiwanie powodowało poprawę w zakresie lateralizacji słuchowej. W komunikacie zostanie przedstawiona nowa metoda stymulacji sensorycznej i opracowane urządzenie mobilne umożliwiające tę stymulację.

Entry No. 259

Entry type conference paper

Authors J. Kotus, B. Kostek, A. Czyżewski

English title A new methodological approach to the noise threat evaluation based on the selected physiological properties of the human hearing system

Polish title Nowe metodologiczne podejście w ocenie zagrożeń hałasem oparte na wybranych fizjologicznych właściwościach słuchu

Conference 126th AES Convention

Preprint

Number

Volume

Pages

Conference site Munich, Germany

Conference date 7.5.2009- 10.5.2009

Abstract A new way of assessment of noise-induced harmful effects on human hearing system is presented in the paper. The method takes into consideration properties of the selected physiological human hearing system. On the basis of the hearing examinations and noise measurements results and psychoacoustical noise dosimeter performance the new indicators of the noise harmfulness were proposed. The evaluation of the proposed indicators were conducted on the basis of hearing examination in the real noise exposure situations and also on the basis of the simulation results using standard test signals (such as: white, pink and brown noise). The performed analysis and obtained results confirmed the practical usefulness and correctness of the proposed indicators.

Entry No. 260

Entry type conference paper

Authors P. Maziewski, A. Kupryjanow, K. Kaszuba, A. Czyżewski

English title ACCELEROMETER SIGNAL PRE-PROCESSING INFLUENCE ON HUMAN ACTIVITY RECOGNITION

Polish title WPŁYW PRZETWARZANIA WSTĘPNEGO SYGNAŁÓW PRZYSPIESZENIA NA JAKOŚĆ ROZPOZNAWANIA KATEGORII RUCHU CZŁOWIEKA

Conference NTAV/SPA 2009

Preprint

Number

Volume

Pages 95 - 99

Conference site Poznań, Polska

Conference date 24.9.2009- 26.9.2009

Abstract A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy

Streszczenie W referacie przedstawiono analizę wpływu przetwarzania wstępnego sygnałów pochodzących z czujników przyspieszenia na sposób działania algorytmów rozpoznawania kategorii ruchu człowieka. Zbadano wpływ zmian częstotliwości odcięcia filtrów oraz ilości czujników przyspieszenia na skuteczność działania poszczególnych algorytmów rozpoznawania aktywności ruchowych.

Entry No. 261

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński

English title

Polish title Multimedialny System Monitorowania Hałasu

Conference XVI giełda polskich wynalazków nagrodzonych na światowych targach wynalazczości w 2008 roku

Preprint

Number

Volume

Pages 52 - 52

Conference site Warszawa, Polska

Conference date 9.3.2009- 15.3.2009

Streszczenie Prezentowane rozwiązanie jest sieciocentrycznym serwisem poświęconym monitorowaniu zagrożeń hałasem. Umożliwia pobieranie, gromadzenie, analizę i wizualizację danych dotyczących hałasu i koncentracji zanieczyszczeń powietrza, pobieranych ze zdalnych urządzeń pomiarowych oraz elektronicznych ankiet dostępnych przez Internet. Bezobsługowe stacje pomiarowe, będące integralnym elementem systemu, umożliwiają automatyczne określanie parametrów wybranych modeli źródeł hałasu (droga, linia kolejowa). Funkcjonalność ta umożliwia automatyzację procesu tworzenia i weryfikowania map hałasu.

Entry No. 262

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński

English title

Polish title Multimedialny System szacowania wpływu hałasu na słuchz zastosowaniem środków teleinformatycznych

Conference XVI giełda polskich wynalazków nagrodzonych na światowych targach wynalazczości w 2008 roku

Preprint

Number

Volume

Pages 59 - 59

Conference site Warszawa, Polska

Conference date 9.3.2009- 15.3.2009

Streszczenie Prezentowane rozwiązanie jest sieciocentrycznym serwisem poświęconym monitorowaniu zagrożeń hałasem. System zawiera unikatową i mająca bardzo duże znaczenie praktyczne, autorską koncepcję psychoakustycznego dozymetru hałasowego. Podstawową funkcją dozymetru jest szacowanie w czasie rzeczywistym skutków słuchowych, jakie wywołuje ekspozycja na hałas. Dzięki temu możliwe jest dokładne określenie zagrożeń słuchowych dla dowolnych warunków akustycznych. Specjalne procedury zawarte w systemie umożliwiają ponadto wyznaczenie częstotliwości, które stanowią największe zagrożenie dla słuchu.

Entry No. 263

Entry type conference paper

Authors H. SKARŻYŃSKI, P. BOGORODZKI, K. KOCHANEK, A. CZYŻEWSKI, J. KOTUS, P. SKARŻYŃSKI

English title A fMRI audio system for temporary hearing threshold shifts studies

Polish title

Conference European Society for Magnetic Resonance in Medicine and Biology (ESMRMB) 2009 Congress

Preprint

Number

Volume

Pages 300 - 300

Conference site Antalya, Turkey

Conference date 1.10.2009- 3.10.2009

Abstract Several empirical studies have shown, that high level acoustic exposure may cause on humans effect called. Temporary Hearing Threshold Shift (TTParadigmDesigner tool for fMRI experiments, ESMRMB2006.S). A dB SPL expressed TTS value expresses a subject specific hearing level shift caused by an acoustic overload. The main aim of this project was to develop a MRI compatible audio system with the capabilities of: a)performing fMRI stimulation, b)measuring subject specific hearing levels, c)providing high level acoustic exposures. A mean 12 dB TTS effect on 3kHz was measured. The FMRI results with random effect analysis showed strong activations in different parts of auditory cortex (max. T-value left hemisphere 6.5, right hemisphere 6.3, all values with p<0.001) for both functional runs (before and after noise exposure). A cross-run comparisons we will be presented. The experimental data showed that proposed system can be used for more quantitative analysis of TTS effect. A possibility of auditory stimulation and hearing level measurements directly in magnet bore may open a new possibilities in this ara.

Entry No. 264

Entry type journal paper

Authors P. Dalka, A Ciarkowski, P. Szczuko, G. Szwoch, A. Czyżewski

English title Surveillance Camera Tracking of Geo positioned Objects

Polish title Śledzenie obiektów o znanej pozycji geograficznej za pomocą kamer PTZ

Journal New Directions in Intelligent Interactive Multimedia Systems and Services - 2, seria: Studies in Computational Intelligence, Springer

Volume 226

Number

Pages 21 - 30

Notes http://www.springerlink.com/content/c621777791653422/

Abstract A system for tracking moving objects with a known GPS position using a set of PTZ cameras with limited areas of coverage is presented. The basic idea of the system and its possible applications are discussed. The proposed camera calibra-tion technique is used to transform the GPS position to camera settings. The cur-rent position of the tracked object is predicted in order to compensate the trans-mission and processing delays. The distributed client-server system using mobile terminals and its application to tracking objects movement, with additional func-tionality such as showing the position of the objects on the map, is presented. The results of the tests performed in real-life conditions, as well as perspectives of a future development of the system, are discussed.

Streszczenie Rozdział opisuje system sterowania kamerami ruchomymi PTZ realizujący śledzenie poruszającego się obiektu o znanej pozycji GPS. Przedstawione są idea systemu oraz możliwości jego wykorzystania. Opisane są: procedura kalibracji pola widzenia kamery i sposób powiązania z danymi o lokalizacji, procedura predykcji ruchu w celu kompensacji opóźnień czasowych. Omówiony jest zaimplementowany system modułowy, w którego skład wchodzą: terminale mobilne, serwer centralny oraz aplikacja kliencka wizualizująca pozycję obiektów i kamer na mapie. Przedstawione są wyniki testów praktycznych oraz perspektywy rozwoju systemu.

Entry No. 265

Entry type conference paper

Authors A. Czyżewski

English title Applications of audio & video analysis to identifying of security threats

Polish title Zastosowania analizy dźwięku i obrazu do identyfikacji zagrożeń

Conference Konferencja Zespołu Naukowo-Przemysłowego przy Radzie Uzbrojenia MON

Preprint

Number

Volume

Pages

Conference site WAT, Warszawa, Polska

Conference date 13.5.2009

Notes materiał w formie prezentacji Power Point

Abstract The structure of a system of parallel analysis of sound and video data was presented. In particular algorithms concerning detection of the the following threats were discussed: entering a zone, traffci violation, abandoned luggage, gatherings, fights, monitoring camera assault.

Streszczenie Zaprezentowano strukturę systemu łącznej analizy danych pochodzących z analizy obrazu i dźwięku w zastosowaniu do monitorowania obszarów, w których zagrożone jest bezpieczeństwo. W szczególności skupiono się na następujących rodzajach zagrożeń: wkroczenie na niedozwolony obszar, naruszenie przepisów ruchu drogowego, bagaż pozostawiony bez nadzoru, zgromadzenia, bójki, próba uszkodzenia kamery.

Entry No. 266

Entry type conference paper

Authors A. Czyżewski

English title High technology innovations - examples of commercialisation of innovative products

Polish title Innowacje wysokich technologii - przykłady komercjalizacji produktów innowacyjnych

Conference Pre-Event Globe Forum - Innovation Summit Conference

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 27.10.2009

Abstract Przedstawiono i omówiono aplikacje opracowane w Katedrze Systemów Multimedialnych WETI Politechniki Gdańskiej, dotyczące następujących dziedzin: bezpieczeństwo, monitorowanie środowiska, telemedycyna, rejestracja dźwięku i obrazu, ochrona dziedzictwa kulturowego.

Streszczenie Security & Safety applications, environmental monitoring, e-Health multimedia applications, sound & vision recording and mastering, cultural heritage applications elaborated in the Multimedia Systems Department of the Gdansk University of Technology were presented and discussed.

Entry No. 267

Entry type conference paper

Authors A. Czyżewski

English title Multimodal interfaces - new means of communicating with conputers

Polish title Interfejsy multimodalne –nowe sposoby komunikowania się z komputerem

Conference XXV Jubileuszowe Jesienne Spotkania PTI

Preprint

Number

Volume

Pages

Conference site Mrągowo, Polska

Conference date 12.10.2009- 16.10.2009

Abstract A line of original (prepared by the Multimedia Systems Department together with a industrial partner) human-computer interaction (HCI) interfaces was presented that offers extended functionality beyond of a traditional computer mouse and keyboard and allows a user to work on a computer using movements and gestures.

Streszczenie Przedmiotem prezentacji były opracowane dotychczasowo w Katedrze Systemów Multimdialnych we współpracy z partnerem przemysłowym interfejsy multimodalne, które poszerzają możliwości komunikowania się użytkownika z komputerem ponad możliwości wynikające ze stosowania tracyjnej myszy i klawiatury komputerowej.

Entry No. 268

Entry type conference paper

Authors A. Czyżewski

English title Multimedia system supporting identification and combating crime and violence - project progress and hitherto results

Polish title Multimedialny system wspomagający identyfikację i zwalczanie przestępczości oraz terroryzmu - realizacja projektu i jego dotychczasowe wyniki

Conference IV MIĘDZYNARODOWA KONFERENCJA POLICYJNA i WYSTAWA "EUROPOLTECH"

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 22.4.2009- 24.4.2009

Abstract Zaprezentowano zestaw opracowanych algorytmów analizy zdarzeń w strumieniu multimedialnym. Przedstawiono ponadto oprogramowanie komunikacyjne, działające na przenośnych urządzeniach klasy PDA. Rozwiązanie oparte na technologiach GIS, które omówiono, pozwala na śledzenie ruchomych obiektów wykrywanych za pomocą kamer, na mapie numerycznej miasta. Ponadto, zaprezentowano nową generację algorytmów umożliwiających monitoring akustyczny aglomeracji przy wykorzystaniu opracowanych i zainstalowanych stacji węzłowych rozproszonego systemu monitoringowego.

Streszczenie A set of engineered algorithms for analyzing events in the video stream acquired from monitoring cameras and microphones was developed. A communication software was elaborated enabling an effective multimedia communication using handheld PDA devices. A GIS-based solution was implemented allowing for positioning any object tracked by the monitoring cameras on the numerical maps of the city. A new generation of acoustic surveillance system was also developed, basing on the engineered “node stations” that can be installed in the urban agglomerations in order to acquire acoustical signals, to analyze them and to transmit resulting data to a central server.

Entry No. 269

Entry type conference paper

Authors A. Czyżewski

English title Advanced monitoring projects realised at the Gdansk University of Technology

Polish title Projekty z dziedziny zaawansowanego monitoringu realizowane w PG

Conference "Bezpieczeństwo – Innowacyjność – Gospodarka”

Preprint

Number

Volume

Pages

Conference site Jurata, Polska

Conference date 25.8.2009- 29.8.2009

Abstract The engineered surveillance system consists of many input data sources (video cameras, microphones) and a central processing server. Video cameras are placed inside and around of a restricted zone in order to completely cover the monitored area. An array of microphones placed in the area supplements the unusual events recognition.

Streszczenie Budowany system monitoringu obejmuje wiele źródeł danych do analizy (kamery wizyjne, mikrofony) i centralny serwer. Kamery wizyjne są sytuowane wewnątrz nadzrowanego obszaru i na jego obrzeżach, aby możliwa była pełna obserwacja terenu. Zestaw mikrofonowy umieszczony w nadzorowanej strefie uzupełnia rozwiązanie o detekcję neitypowych zdarzeń dźwiękowych.

Entry No. 270

Entry type conference paper

Authors M. Szwarc, A. Czyżewski

English title A genetic algorithms application to railway noise prediction

Polish title

Conference EURONOISE 2009

Preprint

Number

Volume

Pages

Conference site Edynburg, Wielka Brytania

Conference date 26.10.2009- 28.10.2009

Abstract A proposition of a new innovative method for noise prediction based on Genetic Algorithms application is formulated. Genetic Algorithm principles are explained briefly. An explanation of how GeneticAlgorithms were used in the area of noise modeling is provided. Some results that were achieved with a prototype software created for the purpose of this paper are presented and discussed. Finally, some practical technical applications of the new method are pointed out.

Entry No. 271

Entry type conference paper

Authors A. Czyżewski

English title Selected double applications of multimedia technology

Polish title Wybrane „podwójne” zastosowania technologii multimedialnych

Conference Bezpiecz. i wyrównywanie szans kierunkiem działań Unii Europejskiej w XXI wieku

Preprint

Number

Volume

Pages

Conference site Owińska k. Poznania, Polska

Conference date 6.12.2009- 9.12.2009

Abstract Research profile included: Multimedia Systems Department research profile, advanced security technologies for supporting impaired people, audio-video marking of objects, controlling computer by eye-tracking and lips movement, acoustic radar technology.

Streszczenie Plan wystąpienia obejmował: profil badań Katedry Systemów Multimedialnych, zaawansowane technologie bezpieczeństwa jako podstawę do zastosowań w pomocach dla osób niepełnosprawnych, znakowanie wizyjno-foniczne obiektów i przedmiotów, sterowanie za pomocą ust i oczu, radar akustyczny.

Entry No. 272

Entry type conference paper

Authors P. Szczuko, B. Kostek, A. Czyżewski

English title New method for personalization of avatar animation

Polish title Nowa metoda personalizacji animacji wirtualnych postaci

Conference International Conference on Man-Machine Interactions

Preprint

Number

Volume

Pages 435 - 443

Conference site Kocierz, Polska

Conference date 25.9.2009- 27.9.2009

Abstract The paper presents a method for creating a personalized animation of avatar utilizing fuzzy inference. First the user designs a prototype version of animation, with keyframes only for important poses, roughly describing the action. Then animation is enriched with new motion phases calculated by the fuzzy inference system using descriptors given by the user. Various degrees of motion fluency and naturalness are possible to achieve. The proposed algorithm of the animation enrichment based on fuzzy description is thoroughly presented. The first part consists of creating fuzzy rules for the algorithm using results of subjective evaluation of the animated movement, the second one utilizes input descriptors for new motion phases calculation, which are finally added to the animation. Results of subjective evaluation of obtained animations are presented.

Streszczenie Referat przedstawia nową metodę tworzenia spersonalizowanych animacji wirtualnych postaci zwanych awatarami. Wykorzystuje ona wnioskowanie rozmyte. W pracy z przygotowanym systemem, animator projektuje prototypową wersję animacji, zawierającą klatki kluczowe dla najważniejszych póz postaci. Następnie animacja ta jest wzbogacana o nowe fazy, których parametry wyznaczane są w procesie wnioskowania rozmytego, z uwzględnieniem wartości opisowych zmiennych lingwistycznych podanych przez użytkownika. Możliwe jest uzyskiwanie ruchu o różnym stopniu płynności i naturalności, zgodnie z oczekiwaniami użytkownika. W referacie opisano sposób działania algorytmu. Pierwsza część wymaga analizy cech animacji ocenianych subiektywnie w celu wygenerowania bazy reguł. Następnie animacja prototypowa jest parametryzowana i w procesie wnioskowania rozmytego wyznaczane są nowe fazy ruchu, które są automatycznie wstawiane do animacji. Uzyskane w ten sposób animacje zostały poddane ocenie subiektywnej w celu potwierdzenia skuteczności modyfikacji charakteru animowanego ruchu.

Entry No. 273

Entry type journal paper

Authors G. Szwoch, A. Czyżewski, A. Ciarkowski

English title A Double-Talk Detector Using Audio Watermarking

Polish title Detektor mowy równoczesnej wykorzystujący znakowanie sygnałów fonicznych

Journal J. Audio Eng. Soc.

Volume 57

Number 11

Pages 916 - 926

Abstract A novel approach to double-talk detection in the acoustic echo canceler is proposed. A hidden signature is embedded into the arriving signal, using the echo-hiding method. Next detection of the presence of this signature in the microphone signal is performed. The results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

Entry No. 274

Entry type journal paper

Authors G. Szwoch, P. Dalka, A. Czyżewski

English title Estimation of object size in the calibrated camera image

Polish title Estymacja rozmiaru obiektów w obrazach ze skalibrowanej kamery

Journal Elektronika

Volume 50

Number 3

Pages 10 - 13

Abstract In the paper, a method of estimation of the physical sizes of the objects tracked by the camera is presented. First, the camera is calibrated, then the proposed algorithm is used to estimate the real width and height of the tracked moving objects. The results of size estimation are then used for classification of the moving objects. Two methods of camera calibration are compared, test results are presented and discussed. The proposed estimation algorithm is intended to be used in the video surveillance system for automatic detection of events in the camera images.

Entry No. 275

Entry type report

Authors P. Szczuko, A. Czyżewski, P. Dalka, G. Szwoch, A. Ciarkowski, . et al

English title Report on the collection and analysis of user requirements

Polish title Raport podsumowujący gromadzenie i analizę wymagań użytkowników

Report Number INDECT/D1.1

Notes Projekt INDECT. Nr umowy: 18188

Abstract The INDECT Project, dedicated to creation of Intelligent information system supporting observation, searching and detection for security of citizens in urban environment is the End- User driven enterprise. Therefore for WP1 first step is to name End-User requirements for functionality of the system, specifically for task of Intelligent Monitoring and Automatic Detection of Threats. For the purpose of End-User requirements analysis an End-User Questionnaire was established, created with cooperation of all INDECT Project Partners. This document, Deliverable D1.1., describes End-User Questionnaire structure, its purpose from the point of view of WP1, outcomes of analysis of answers related to WP1 work, and preliminary specification of functionality and hardware of the system fulfilling the requirements for intelligent monitoring and automatic detection of threats.

Entry No. 276

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title

Polish title ZASTOSOWANIE SPOWALNIANIA WYPOWIEDZI W CELU POPRAWY ROZUMIENIA MOWY PRZEZ DZIECI W SZKOLE

Conference XIII Międzynarodowe Sympozjum Reżyserii i Inżynierii Dźwięku ISSET 2009

Preprint

Number

Volume

Pages 81 - 87

Conference site Warszawa, Polska

Conference date 16.10.2009- 18.10.2009

Abstract This paper presents a time-scale modification algorithms that could be used for hearing impairment therapy supported by real-time speech stretching. In this paper the OLA based algorithms and Phase Vocoder were described. In the experimental part usability of those algorithms for real-time speech stretching was discussed.

Streszczenie W artykule przedstawiono algorytmy służące do modyfikacji czasu trwania dźwięku. Szczegółowo opisano algorytmy overlap and add, dwa rodzaje algorytmu synchronized overlap and add oraz algorytm phase vocodera. W części eksperymentalnej zbadano skuteczność działania algorytmów. Sprawdzono także czy możliwe jest przetwarzanie sygnału mowy w czasie rzeczywistym z wykorzystaniem opisanych algorytmów.

Entry No. 277

Entry type book

Authors A. Czyżewski, A. Ciarkowski, P. Dalka, P. Szczuko, G. Szwoch, P. Żwan

English title Multimodal system supporting identification and fight against crime and terrorism

Polish title Multimedialny system wspomagający identyfikację i zwalczanie przestępczości oraz terroryzmu

Editor Wolters Kluwer Polska

Pages 211 - 227

Notes Rozdział w monografi "Praktyczne elementy zwalczania przestępczości zorganizowanej i terroryzmu. Nowoczesne technologie i praca operacyjna" pod red. L. Paprzyckiego i Z. Rau

Abstract Dodatkowi autorzy rozdziału: W. Jędruch, P. Kozielecki. Artykuł zawiera przegląd zakresu prac badawczych, które prowadzone są w Politechnice Gdańskiej w ramach realizowanego projektu badawczo-rozwojowego. Opisany jest rozproszony system monitoringu i komunikacji multimedialnej, realizujący kompleksowe zarządzanie zasobami mobilnymi i komunikację multimedialną w czasie rzeczywistym pomiędzy elementami systemu. System realizuje równoczesną analizę obrazu, dźwięku i sygnałów pochodzących z dołączonych czujników w celu wykrywania określonych typów zdarzeń i automatycznego generowania alertów. Wyposażenie patroli w mobilne terminale multimedialne pozwala na ciągłe monitorowanie sytuacji w punkcie wystąpienia zdarzenia, również w trakcie przemieszczania się i przygotowania do interwencji. Na potrzeby wizualizacji statusu i pozycji jednostek odległych stworzony został system geoinformatyczny, przygotowany do pracy na jednostkach mobilnych, komputerach stacjonarnych oraz PDA. W dalszej części opisane są algorytmy analizy obrazu w celu wykrywania zdarzeń, metody klasyfikacji zdarzeń, przykłady zastosowań. Następnie przedstawione są algorytmy analizy i rozpoznawania dźwięków. Na zakończenie omówione są wyniki prac nad wykorzystaniem technologii RFID do lokalizacji i identyfikacji obiektów oraz przykłady zastosowań.

Streszczenie Additional authors: W. Jędruch, P. Kozielecki. Hitherto achieved results of research project conducted in the Gdańsk University of Technology are presented in the paper. A distributed monitoring communication system is being described allowing a management of mobile resources, the system extended with real-time multimedia transmission features. A simultaneous analysis of audio, video and data from various sensors is performed for detecting incidents and for automatic generation of alerts. Mobile patrols equipped with terminals will be able to continuously monitor events on site, also during translocation and intervention. For a clear and precise visualization of status and positions a geopositioning system was created compatible with desktops, notebooks and PDA-class computers. Algorithms for video analysis are described in the second part of the paper, and methods for detection of events and object classification are discussed. Sample applications are presented. Moreover, audio processing method is described, followed by sound classification methods brief description. The last part discusses results and applications of a RFID system made for localization and identification of objects.

Entry No. 278

Entry type conference paper

Authors P. Szczuko, B. Kostek, A. Czyżewski

English title Enhancement of computer character animation utilizing fuzzy rules

Polish title Poprawa jakości animacji komputerowych postacji z wykorzystaniem reguł w logice rozmytej

Conference KES - Intelligent Interactive Multimedia Systems and Services

Preprint

Number

Volume

Pages

Conference site Mogliano Veneto, Włochy

Conference date 16.7.2009- 17.7.2009

Abstract A new method for processing of character animation is presented. It involves fuzzy inference with both rules and membership functions derived from results of subjective evaluation tests. During processing a new motion phases are added to an animation increasing its quality and changing fluidity and stylization of motion. Animation parameterization is presented, new parameters are designed, and the relation between coefficients proposed and subjective features of motion are established. Quality and fluidity increase are verified during subjective evaluation of animations processed by the created animation enhancement system.

Streszczenie Referat przedstawia nową metodę przetwarzania komputerowych animacji postaci. Wykorzystuje ona wnioskowanie rozmyte, oparte na regułach i funkcjach przynależności uzyskanych w procesie analizy wyników testów subiektywnej oceny jakości animacji. W trakcie przetwarzania do animacji automatycznie dodawane są nowe fazy ruchu, co skutkuje poprawą jakości wizualnej oraz zmianą płynności i stylizacji ruchu w sposób zamierzony. W referacie opisano sposób parametryzacji animacji, zaproponowano nowe współczynniki, których wartości wykazały silną korelację z parametrami subiektywnymi animacji. Poprawa jakości i płynności ruchu zweryfikowane zostały w procesie testów oceny subiektywnej.

Entry No. 279

Entry type conference paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek, H. Skarżyński

English title Long-term continuous complex acoustical climate evaluation in selected schools

Polish title

Conference Euronoise 2009

Preprint

Number

Volume

Pages

Conference site Edynburg, Wielka Brytania

Conference date 26.10.2009- 28.10.2009

Abstract Results of the long-term continuous noise measurement in some selected schools are presented. The autonomous noise monitoring stations, engineered at the Multimedia Systems Department of the Gdansk University of Technology were used. A brief description of the measurement system including its main features is presented. The investigations of measured noise with a focus to the broadband and spectrum analysis both in 1/3 octave bands and critical bands are discussed. The harmfulness of the determined noise level, including the Temporary Threshold Shift simulation, is discussed. Additionally, measured air pollution is illustrated.

Entry No. 280

Entry type conference paper

Authors A. Czyżewski, A. Ciarkowski, P. Dalka, P. Szczuko, G. Szwoch, P. Żwan

English title Advanced technologies for video and audio monitoring

Polish title Zaawansowane technologie monitoringu wizyjnego i dźwiękowego

Conference Cyberspace 2009, Cyberprzestrzeń - zagrożenia i wyzwania

Preprint

Number

Volume

Pages

Conference site Warszawa,

Conference date 15.10.2009

Notes materiały w formie płyty CD-ROM

Abstract Referat omawia najważniejsze cechy inteligentnego monitoringu wizyjnego i dźwiękowego, możliwość połączenia róznych mediów oraz perspektywy wykorzystania zintegrowanych systemów teleinformatycznych i telekomunikacyjnych w dziedzinie bezpieczeństwa.

Streszczenie The paper presents principal features of the intelligent audio-visual monitoring system, discusses issues of media convergence and prospects of applications of integrated teleinformation systems to security domain.

Entry No. 281

Entry type conference paper

Authors B. Kunka, A. Czyżewski, B. Kostek

English title Concentration tests. An application of gaze tracker to concentration exercises

Polish title Zastosowanie systemu sledzenia punktu fiksacji wzroku w badaniach koncentracji uwagi

Conference 1st International Conference on Computer Supported Education

Preprint

Number

Volume

Pages 66 - 66

Conference site Lizbona, Portugalia

Conference date 23.3.2009- 26.3.2009

Bibliographic No. 8

Notes Dostepne streszczenie w Book of Abstracts

Abstract This paper presents different methods of concentration tests. Some existing methods are reviewed and more thoroughly described. The gaze tracking system developed at the Multimedia Systems Department of the Gdańsk University of Technology is presented and its principle of working is explained. Performed tests of the gaze tracker system show that it could make a useful system for concentration exercises. Some selected applications of the gaze tracker to concentration tests are also discussed in the paper.

Streszczenie W artykule zostały przedstawione różne podejścia badania koncentracji uwagi. Wybrane istniejące metody zostały dokładniej opisane. W artykule przedstawiono system śledzenia punktu fiksacji wzroku opracowany w Katedrze Systemów Multimedialnych Politechniki Gdańskiej. Badania przeprowadzone z wykorzystaniem systemu śledzenia wzroku potwierdzają jego użyteczność w prowadzeniu tego typu eksperymentów.

Entry No. 282

Entry type conference paper

Authors M. Lech, B. Kostek, A. Czyżewski, P. Odya

English title Gesture Recognition Framework for Multimedia Content Viewer Controlling

Polish title Środowisko rozpoznawania gestów dla zagadnienia przeglądania treści multimedialnych

Conference SPA 2009 Poznań

Preprint

Number

Volume

Pages 100 - 104

Conference site Poznań, Polska

Conference date 24.9.2009- 26.9.2009

Abstract In the paper a system for controlling a multimedia content viewer by hand gestures is presented. First, selected methods used for gesture recognition are described. Two different application cases of the system, i.e. for multimedia presentation purposes and for multimedia content viewing are outlined. Moreover, a proposal of improvement of the system combining these approaches is also given. The system work cycle is reviewed. The results of the system tests are provided.

Streszczenie W referacie przedstawiono system obsługi za pomocą gestów rąk przeglądarek treści multimedialnych. W pierwszej części przedstawiono wybrane metody rozpoznawania gestów. Przedstawiono dwa różne zastosowania systemu, tj. do prowadzenia prezentacji multimedialnych oraz do przeglądania treści multimedialnych. Omówiony został cykl pracy systemu. W końcowej części przedstawiono wyniki testów systemu.

Entry No. 283

Entry type journal paper

Authors D. Ellwart, A. Czyżewski

English title Interfered speech intelligibility improvement using an adaptive filtering based algorithm

Polish title Poprawa zrozumiałości mowy w obecności zakłóceń z wykorzystaniem algorytmu opartego na filtracji adaptacyjnej

Journal Zeszyty naukowe WE PG

Volume

Number 26/2009

Pages 33 - 36

Bibliographic No. 8

Abstract This paper describes a technique of improving the quality of speech signals recorded under interference (adaptive filter based algorithm). The whole idea of an algorithm is shown, and other possibilities – such as spectral subtraction – of sound processing are discussed. Results of the tests are presented. A way of integrating the elaborated method with an agglomeration acoustic monitoring system is proposed.

Streszczenie W komunikacie opisano nowy sposób wykorzystania filtracji adaptacyjnej do poprawy jakości dźwięków użytecznych nagrywanych w obecności zakłóceń. Przedstawiono stworzony algorytm adaptacji, omówiono możliwości przetwarzania dźwięku dodatkowymi algorytmami, opisano przeprowadzone eksperymenty. Zamieszczono i omówiono wyniki eksperymentów. Zaproponowano sposób integracji opracowanej metody z systemami akustycznego monitorowania aglomeracji miejskiej.

Entry No. 284

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński, Ł. Kosikowski, P. Odya, B. Kostek, G. Szwoch

English title

Polish title Mobline multimedialne systemy do badań przesiewowych słuchu u mowy na urządzenia klasy PDA

Conference XVI giełda polskich wynalazków nagrodzonych na światowych targach wynalazczości w 2008 roku

Preprint

Number

Volume

Pages 58 - 58

Conference site Warszawa, Polska

Conference date 9.3.2009- 15.3.2009

Streszczenie Program do diagnozy słuchu zawiera ankietę, w której pytania są podobne do tych zadawanych przez audiologa podczas typowej wizyty kontrolnej. Dodatkowo użytkownik ma możliwość wykonania dwóch testów. Pierwszy z nich bazuje na audiometrii tonalnej. Drugi oparty jest na audiometrii słownej w szumie. Po wypełnieniu ankiety i wykonaniu ćwiczeń, system ekspercki automatycznie podejmuje decyzję o tym, czy badana osoba nie ma problemów ze słuchem lub czy występują u niej zaburzenia słuchu i konieczna jest wizyta u audiologa. Program do diagnozy i terapii mowy zawiera ankietę, w której pytania są podobne do tych zadawanych przez foniatrę podczas typowej wizyty kontrolnej. System zawiera szereg interaktywnych testów dźwiękowych, umożliwiających detekcję potencjalnych dysfunkcji głosu i mowy m.in. ocenę słuchu fonemowego, ocenę motoryki narządów mowy, ocenę artykulacji, słownictwa i gramatyki. Na podstawie uzyskanych wyników system automatycznie generuje wynik. Cechami charakterystycznymi obu systemów jest duża mobilność oraz wyjątkowa prostota testów przy jednoczesnym zachowaniu wysokiej wiarygodności uzyskiwanych wyników.

Entry No. 285

Entry type conference paper

Authors P. Dalka, A. Czyżewski

English title Lip movement and gesture recognition for a multimodal human-computer interface

Polish title Rozpoznawanie ruchów i gestów wykonywanych ustami na potrzeby multimodalnego interfejsu komputerowego

Conference International Multiconference on Computer Science and Information Technology, 2nd International Symposium on Multimedia – Applications and Processing

Preprint

Number

Volume

Pages 365 - 369

Conference site Mrągowo, Polska

Conference date 12.10.2009- 14.10.2009

Abstract This paper presents an algorithm for lip movement tracking and lip gesture recognition for the purpose of the multimodal human-computer interface (HCI) called LipMouse. This solution allows a user to work on a computer using movement and gesture made by his/her mouths only and is especially useful for severely disabled and paralyzed people. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in the lower part of the face region and is used to track lip movements. Three lip gestures are recognized: mouth opening, sticking out the tongue and making the mouth into an „O” shape. Lip gesture recognition is performed by an artificial neural network and utilizes an accurate lip shape obtained by the means of lip image segmentation using fuzzy clustering.

Streszczenie Artykuł przedstawia algorytm do śledzenia ruchu ust i rozpoznawania gestów wykonywanych ustami na potrzeby multimodalnego interfejsu komputerowego nazwanego Ustomysz. Rozwiązanie to pozwala użytkownikowi na pracę na komputerze z wykorzystaniem jedynie ruchów i gestów wykonywanych ustami i jest szczególnie przydatne dla osób niepełnosprawnych ruchowo. Obraz twarzy użytkownika jest przechwytywany za pomocą standardowej kamerki internetowej. Detekcja twarzy bazuje kaskadzie klasyfikatorów AdaBoost. Obszar ust lokalizowany jest w dolnej części znalezionej twarzy i wykorzystywany do śledzenia ruchów ust. Rozpoznawane są trzy gesty wykonywane ustami: otwarcie ust, wysunięcie języka i złożenie ust w dzióbek. Klasyfikacja gestu odbywa się za pomocą sztucznej sieci neuronowej i wykorzystuje dokładny obraz ust uzyskany za pomocą aproksymacji ich kształtu za pomocą elipsy.

Entry No. 286

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski

English title Software for calculation of noise maps implemented on the supercomputer

Polish title

Journal Task Quarterly

Volume 13

Number 3-4

Pages

Notes w druku

Abstract This paper presents investigation results relevant to the implementation of the algorithms for the calculation of noise maps. The aim of the implementation of the algorithms on the computer cluster is explained. Selected implementation details of the software called the noise propagation model are described. The interaction of the software with the data acquisition system is presented. Noise maps obtained by exploitation of the described software are presented. A comparison between outcomes of implemented models and simulation results of the commercial program is presented. An analysis of the computation efficiency is also presented. The discussion concerning dynamic presentation of the noise maps is also presented.

Entry No. 287

Entry type conference paper

Authors P. Dalka, A. Czyżewski

English title LipMouse - Novel Multimodal Human-Computer Interaction Interface

Polish title Ustomysz- nowatorki, multimodalny interfejs komputerowy

Conference SIGGRAPH

Preprint

Number

Volume

Pages 1

Conference site New Orleans,

Conference date 3.8.2009- 7.8.2009

Notes Jednostronicowy opis posteru

Entry No. 288

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title TIME-SCALE MODIFICATION OF SPEECH SIGNALS FOR SUPPORTING HEARING IMPAIRED SCHOOLCHILDREN

Polish title

Conference NTAV/SPA 2009

Preprint

Number

Volume

Pages 159 - 162

Conference site Poznań, Polska

Conference date 24.9.2009- 26.9.2009

Abstract A study of time scale modification algorithms applied to hearing impaired schoolchildren supporting is presented. Variety of algorithms are considered, namely: overlap and add, two variations of synchronized overlap and add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.

Streszczenie W referacie przedstawiono analizę użyteczności algorytmów modyfikacji czasu trwania nagrania do celów wspomagania rozumienia mowy przez dzieci z pogorszoną rozdzielczością czasową słuchu. Sprawdzono skuteczność działania oraz złożoność obliczeniową algorytmów OLA, SOLA i wokodera fazowego.

Entry No. 289

Entry type conference paper

Authors Ł. Kosikowski, A. Czyżewski

English title Computer Based System for Strabismus and Amblyopia Therapy

Polish title System komputerowy do terapii zeza i amblyopii

Conference International Multiconference on Computer Science and Information Technology

Preprint

Number

Volume

Pages 407 - 410

Conference site Mrągowo, Polska

Conference date 12.10.2009- 14.10.2009

Abstract Development of the computer based system for strabismus and amblyopia therapy is discussed in the paper. In the case of amblyopia or 'lazy-eye' syndrome, the therapy is typically conducted in two ways: by wearing a patch over the non-amblyopic eye for several hours per day or blurring the vision in the good eye with penalizing drops or with extra power in the glasses. The disadvantage of this types of therapy is the lack of binocular vision. The proposed approach retains binocular vision. Parameters corresponded to strabismus can be measured much faster using the described system. Another advantage is that therapy may take place at user's home, without time-consuming visits to the clinic.

Entry No. 290

Entry type conference paper

Authors P. Maziewski, P. Suchomski, B. Kostek, A. Czyżewski

English title An Intuitive Graphical User Interface for the Parkinson’s Disease Patients

Polish title Intuicyjny interfejs graficzny dla osób z chorobą Parkinsona

Conference 4th International IEEE EMBS Conference on Neural Engineering

Preprint

Number

Volume

Pages 14 - 17

Conference site Antalya, Turcja

Conference date 29.4.2009- 2.5.2009

Abstract In this paper a discussion on the design and development of the graphical user interface (GUI) dedicated to Parkinson’s Disease (PD) patients is presented. The interface is intended for a group of PD patients with less severe motor symptoms, who are living at their home independently or with help of a caregiver. The GUI is designed to enable an interaction for the non-computer literate PD patients with a computer-based system. The system will allow for objectively recording the patient diaries, self assessments, taken medication confirmations and other features important for the diagnosis. This will enable physicians to prepare more accurate evaluation and better diagnostic decisions.

Streszczenie W referacie przedstawiono projekt i przykładowe wdrożenie graficznego interesu użytkownika przeznaczonego dla osób z chorobą Parkinsona. Interfejs przeznaczony jest dla osób w mniej zaawansowanym stadium choroby – z mniejsza ilością symptomów, żyjących samodzielnie lub korzystających z pomocy opiekuna. Interfejs zaprojektowano w celu umożliwieni interakcji z komputerem osób nieposiadających wcześniejszych doświadczeń w pracy z tego typu urządzeniem. Interfejs pozwala na obiektywną rejestrację dzienniczków pacjenta, jego samoocen, potwierdzeń przyjęcia lekarstw jak i wielu innych danych potrzebnych w celu poprawnej diagnozy. Dzięki wynikom uzyskiwanym za pomocą interfejsu, lekarz prowadzący może opracować lepszą – bardziej dokładną – terapię.

Entry No. 291

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski, P. Kozielecki

English title Dynamic computation of acoustic field distribution in the city area employing a computer cluster

Polish title

Conference NOVEM

Preprint

Number

Volume

Pages 1 - 6

Conference site Oxford, Wielka Brytania

Conference date 5.4.2009- 8.4.2009

Abstract Numerical computation of acoustic field distribution over the urban area employing the computer cluster are in focus. A method of gathering data for numerical model of the noise source, engineered at the Multimedia Systems Department, is introduced. A concept, assumptions and an implementation of the method of data acquiring needed for computations are then presented. Also, features of the computer cluster computation are discussed. The main idea of dynamic noise maps computation and visualization is thoroughly described. The system proposed by the authors used to noise data acquisition is also shown. Noise maps resulting from the algorithmical computations are compared to measured sound levels, leading towards an automated calibrating and updating noise maps.

Entry No. 292

Entry type journal paper

Authors M. Szczodrak, P. Dalka, A. Czyzewski

English title Performance evaluation of video object tracking algorithm in autonomous surveillance system

Polish title

Journal IEEE ICIT 2010

Volume

Number

Pages 31 - 34

Abstract Results of performance evaluation of a video object tracking algorithm are presented. The method of moving objects detection and tracking is based on background modelling with mixtures of Gaussians and Kalman filters. An emphasis is put on algorithm’s efficiency with regards to its settings. Utilized methods of performance evaluation based on comparison of algorithm output to manually prepared reference data are introduced. The experiments aimed at examining the performance achieved with various object detection algorithm parameter settings are presented and discussed.

Entry No. 293

Entry type conference paper

Authors G. Szwoch, P. Dalka, A. Czyżewski

English title

Polish title Rozwiązywanie konfliktów w śledzeniu obiektów ruchomych w celu automatycznej detekcji zdarzeń

Conference XIII Sympozjum Nowości w Technice Audio i Wideo

Preprint

Number

Volume

Pages

Conference site Szczecin, Polska

Conference date 14.10.2010- 16.10.2010

Notes Materiały konferencyjne na płycie CD

Abstract Performance of automatic event detection in video from monitoring cameras depends on accuracy of the low-level image analysis algorithms, such as detection and tracking of moving objects. The main problem in object tracking is related to situations in which relationship between detected moving objects and trackers is ambiguous. An algorithm for resolving such conflicts is presented in this paper. The algorithm utilizes predicted states calculated by Kalman filters for estimation of trackers position, then it uses color and texture descriptors in order to match moving objects with trackers. Problematic situations, such as splitting objects, are addressed. Test results are presented and discussed. The complete system is designed to extend functionality of current video monitoring systems by providing a solution for automatic detection of security threats.

Streszczenie W referacie przedstawiono algorytm rozwiązywania konfliktów w śledzeniu obiektów ruchomych. Proponowana metoda wykorzystuje predykcję stanu obiektu obliczaną przez filtry Kalmana oraz dopasowuje wykryte obiekty do struktur śledzących ich ruch na podstawie deskryptorów koloru i tekstury. Omówiono specyficzne sytuacje powodujące konflikty, takie jak rozdzielanie obiektów. Przedstawiono wyniki testów. Algorytm może być zastosowany w systemie automatycznego wykrywania zagrożeń w monitoringu wizyjnym.

Entry No. 294

Entry type conference paper

Authors G. Szwoch, P. Dalka, A. Czyżewski

English title A Framework for Automatic Detection of Abandoned Luggage in Airport Terminal

Polish title Framework do automatycznej detekcji porzuconego bagażu w terminalu lotniczym

Conference KES 2010, The 3rd International Symposium on Intelligent and Interactive Multimedia: Systems and Services

Preprint

Number

Volume

Pages 13 - 23

Conference site Baltimore, USA

Conference date 28.7.2010- 30.7.2010

Notes rozdział w książce G.A. Tsihrintzis et al. (Eds.): Intelligent Interactie Multimedia Systems and Services

Abstract A framework for automatic detection of events in a video stream transmitted from a monitoring system is presented. The framework is based on the widely used background subtraction and object tracking algorithms. The authors elaborated an algorithm for detection of left and removed objects based on mor-phological processing and edge detection. The event detection algorithm collects and analyzes data of all the moving objects in order to detect events defined by rules. A system was installed at the airport for detecting abandoned luggage. The results of the tests indicate that the system generally works as expected, but the low-level modules currently limit the system performance in some problematic conditions. The proposed solution may supplement the existing monitoring sys-tems in order to improve the detection of security threats.

Entry No. 295

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title Robustness Analysis of Watermarking-Based DTD Algorithm under Time-Variable Echo Conditions

Polish title

Conference MCSS 2010: IEEE International Conference on Multimedia Communications, Services and Security

Preprint

Number

Volume

Pages 25 - 29

Conference site Kraków, Poland

Conference date 6.5.2010- 7.5.2010

Notes ISBN 978-83-88309-92-2

Abstract A novel double-talk detection (DTD) algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The problem of DTD robustness to time-varying conditions of acoustic echo path is discussed and explanation as to why such conditions occur in practical situations is provided. The environment and the procedure used for simulation of test conditions and evaluation of DTD algorithms are presented. Results of comparing performance of the introduced watermarking DTD with the well-established Geigel DTD algorithm are presented.

Entry No. 296

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title Advanced Surveillance and Operational Communication System Employing Mobile Terminals

Polish title

Conference MCSS 2010: IEEE International Conference on Multimedia Communications, Services and Security

Preprint

Number

Volume

Pages 30 - 34

Conference site Kraków, Poland

Conference date 6.5.2010- 7.5.2010

Notes ISBN 978-83-88309-92-2; Projekt INDECT. Nr umowy: 18188

Abstract Distributed surveillance and operational communications system based on XMPP protocol is presented. Its architecture and assumptions leading to the depicted design are shown. Features of XMPP protocol are portrayed with the emphasis on those most important in the context of the application. Real-time multimedia transmission with the use of Jingle/XMPP extension is discussed. The use of PDA-class computers as mobile terminals is introduced. Technical aspects of multimedia communications session establishment in the presence of Network Address Translation devices and firewalls are presented.

Entry No. 297

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title

Polish title System eliminacji echa akustycznego bazujący na znakowaniu sygnałów fonicznych

Conference Usługi i sieci teleinformatyczne następnej generacji - aspekty techniczne, aplikacyjne i rynkowe

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 23.11.2010- 24.11.2010

Notes Projekt PBZ-MNiSW-02/II/2007

Streszczenie Przedstawiono nowatorski algorytm detekcji mowy równoczesnej (DTD), bazujący na technikach związanych ze znakowaniem sygnałów akustycznych. Algorytmy DTD wykorzystywane są w systemach eliminacji echa akustycznego (EAC) do sterowania procesem adaptacji filtra estymującego odpowiedź toru echa. Typowo spotykane rozwiązania DTD stanowią pewien kompromis pomiędzy wysoką jakością detekcji i akceptowalnym kosztem obliczeniowym. Algorytm zaproponowany przez autorów wykorzystuje specyficzne rodzaj znakowania sygnałów cyfrowych (tzw. semi-fragile watermarking), dzięki czemu możliwe staje się osiągnięcie zadowalających wyników przy niewielkim obciążeniu numerycznym. Niezauważalna dla uczestników konwersacji sygnatura jest osadzana w sygnale pochodzącym od rozmówcy odległego, tuż przed odtworzeniem go w głośniku rozmówcy lokalnego. Następnie obecność sygnatury jest wykrywana w sygnale zarejestrowanym przez mikrofon rozmówcy lokalnego. Przy właściwym wyborze zastosowanej techniki znakowania znak wodny zostaje zatarty i staje się niewykrywalny w momentach, gdy odzywa się rozmówca lokalny. Pozwala to na precyzyjne określenie fragmentów sygnału, podczas których występuje zjawisko mowy równoczesnej. Zastosowanie metody znakowania poprzez ukrywanie echa (echo-hiding) pozwala spełnić wymogi algorytmu. Wyniki przeprowadzonych eksperymentów dowodzą skuteczności proponowanego algorytmu DTD.

Entry No. 298

Entry type conference paper

Authors A. Czyżewski, A. Ciarkowski

English title Double talk detector based on audio watermarking

Polish title Detektor pasożytniczego echa w torach telekomunikacyjnych wykorzystujący znakowanie wodne sygnałów

Conference 2nd Pan-American/Iberian Meeting on Acoustics, Acoustical Soc. of America

Preprint

Number

Volume

Pages

Conference site Cancun, Mexico

Conference date 15.11.2010- 19.11.2010

Abstract 2aSP13. Doubletalk detector based on audio watermarking. Session: Tuesday Morning, Nov 16 Time: 11:45 Author: Andrzej Czyzewski Location: Multimedia Systems Dept., Gdansk Univ. of Technol.,Narutowicza 1112, 80233 Gdansk, Poland, ac@pg.gda.pl Author: Andrzej Ciarkowski Location: Multimedia Systems Dept., Gdansk Univ. of Technol.,Narutowicza 1112, 80233 Gdansk, Poland, ac@pg.gda.pl Abstract: The doubletalk detection (DTD) algorithm is a vital part of an acousticecho cancellation system commonly used in the telecommunication applications. The role of the DTD algorithm is to stop the adaptation process of the adaptivefilter used for the estimation of echo path response in the presence of doubletalkcondition. The typically used DTD solutions are often a compromise betweenhigh efficiency and affordable computational cost. The algorithm introducedby the authors is based on a novel principle related to the socalled semifragilewatermarking, making it possible to achieve satisfactory performance withlimited processing burden. A hidden signature is embedded into the arrivingfarend speaker signal just before it is replayed to the nearend speaker.Consequently, the presence of that signature is detected in the signal recordedby the nearend speaker microphone. The appropriate choice of the employedwatermarking technique causes the signature to become undetectable duringthe nearend speaker’s talkspurts, precisely identifying the doubletalkperiods. The use of echohiding watermarking method allows fulfilling thealgorithm requirements. The results of performed experiments prove the goodperformance of the proposed DTD solution. [Research funded by the Polish Ministry of Science and Higher Education within Grant No. PBZMNiSW02II2007.]

Streszczenie Słowa kluczowe: detekcja echa w kanałach telekomunikacyjnych; znakowanie wodne; filtracja adaptacyjna Algorytm detekcji i eliminacji echa jest istotnym składnikiem łańcucha przetwarzania sygnałów w torach telekomunikacyjnych. W zaproponowanym algorytmie kluczową rolę odgrywa znakowanie wodne sygnału mowy. Na tej podstawie wykrywane jest pasożytnicze echo a po jego wykryciu uaktywniany jest filtr adaptacyjny, który to echo usuwa.

Entry No. 299

Entry type journal paper

Authors Ł. Kosikowski, A. Czyżewski

English title

Polish title Badanie i terapia zaburzeń widzenia obuocznego wspomagana przez bezkontaktowy system śledzenia punktu fiksacji wzroku

Journal Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki PG

Volume

Number 28

Pages 89 - 92

Streszczenie Na rynku znajduje się klika systemów pozwalających na badanie lub trening syndromu leniwego oka z użyciem komputera PC. Niewiele z nich bazuje na wirtualnej rzeczywistości. Większość jedynie skupia się na terapii niedowidzenia bez mierzenia jakichkolwiek parametrów lub wykonuje tylko same pomiary. Proponowane rozwiązanie to kompletny system diagnostyczno - terapeutyczny do detekcji i terapii zaburzeń widzenia obuocznego – zwłaszcza zeza (małego i średniego stopnia) oraz syndromu leniwego oka. Aby zapewnić większy obiektywizm badań prowadzonych przy użyciu opracowanego systemu, zastosowano śledzenie punktu fiksacji wzroku. System śledzenia punktu fiksacji charakteryzuje się brakiem jakichkolwiek fizycznych elementów montowanych na ciele użytkownika, zaś detekcja punktu fiksacji bazuje na analizie odbić promieni w zakresie podczerwieni.

Entry No. 300

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title

Polish title Algorytmy do korygowania zapisu dźwięku filmowego

Conference Usługi i sieci teleinformatyczne następnej generacji - aspekty techniczne, aplikacyjne i rynkowe

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 23.11.2010- 24.11.2010

Notes Plakat

Streszczenie W pracy przedstawiono system służący do automatycznej rekonstrukcji nagrań foniczno-wizyjnych. Pozwala on na analizę i rekonstrukcję zniekształcenia obrazu polegającego na niestabilności klatek filmowych. Filmowa ścieżka dźwiękowa rekonstruowana jest poprzez eliminację dwóch typów zniekształceń: szerokopasmowego szumu oraz drżenia i kołysania dźwięku. Przestawione algorytmy zostały opracowane w ten sposób, aby zminimalizować konieczność uczestnictwa człowieka w procesie digitalizacji i rekonstrukcji nagrań gromadzonych w repozytorium foniczno-wizyjnym. Korzystanie z opisanych metod rekonstrukcji możliwe jest poprzez serwis internetowy pozwalający na dodawanie i rekonstruowanie własnych nagrań do repozytorium oraz przeglądanie nagrań dodanych przez innych użytkowników.

Entry No. 301

Entry type journal paper

Authors A. Czyżewski, P. Maziewski, A. Kupryjanow

English title Reduction of parasitic pitch variations in archival musical recordings

Polish title

Journal Signal Processing

Volume

Number 90/2010

Pages 981 - 990

Bibliographic No. 23

Abstract A new method for reducing parasitic pitchvariations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be easily tuned to analyze archival recordings. The new method is also compared to some previous approaches to pitch variation correction.

Streszczenie W artykule przedstawiono nową metodę redukcji pasożytniczych modulacji częstotliwości w archiwalnych nagraniach dźwiękowych. Metoda ta jest przeznaczona do analizy filmowej optycznej dźwiękowej ścieżki. Wykorzystuje ona analizę obrazu w celu wyznaczenia oraz usunięcia efektów związanych ze skurczem taśmy takich jak pasożytnicze modulacje towarzyszące dźwiękowi w filmie. Skuteczność działania nowej metody została porównana z kilkoma innymi metodami redukcji pasożytniczych modulacji częstotliwości.

Entry No. 302

Entry type

Authors A. Czyżewski, J. Smulko, M. Kotarski

English title

Polish title Sposób emisji substancji zapachowych

Notes patent nadany w 2014 r.

Abstract Istotą rozwiązania jest emiter kompozycji zapachowych programowany i sterowany za pomocą komputera, wykorzystujący zimną dyfuzję substancji aromatowych.

Entry No. 303

Entry type conference paper

Authors K. Łopatka, J. Kotus, A. Czyżewski

English title Improving automatic surveillance by sound analysis

Polish title Wzbogacenia automatycznego nadzoru bezpieczeństwa o analizę dźwięku

Conference 5th Security research conference

Preprint

Number

Volume

Pages 51 - 51

Conference site Berlin, Republika Federalna Niemiec

Conference date 7.9.2010- 9.9.2010

Notes rozszerzone streszczenie opublikowane na płycie CD

Abstract An automatic surveillance system, based on event detection in the video image can be improved by implementing algorithms for audio analysis. Dangerous or illegal actions are often connected with distinctive sound events like screams or sudden bursts of energy. A method for detection and classification of alarming sound events is presented. Detection is based on the observation of sudden changes in sound level in distinctive sub-bands or in parameter values. Among the parameters, there are specially defined features connected with the energy ratio in certain sub-bands of the spectrum and the shape of the signal around transients. The parameter set is completed with MPEG-7 descriptors chosen on the basis of experiments and statistical analysis. For classification a Support Vector Machine classifier is implemented. The model is built using a test set of sounds recorded in real conditions. Separate classifiers are implemented for different classes of sound events. The length of the analysis frame and threshold values for event detection are set to fit the characteristics of a certain type of sound. The accuracy is then improved by the decision procedure. The final indication of the system is derived from decisions of all classifiers, compared in a number of adjacent analysis frames. The classifier yields high accuracy of detecting typical alarming sound events like gunshot, scream, explosion, broken glass or horn abuse. It also provides the possibility to retrain the model and add a new type of event to be recognized by the system. The described solution can be implemented in an automatic surveillance system together with image analysis. Processing both sound and image leads to a significant improvement of the event detection rate. The developed algorithms can be implemented to detect dangerous events in large public areas like stations, airports or stadiums.

Streszczenie System automatycznego nadzoru bezpieczeństwa może być wzbogacony o algorytmy analizy danych fonicznych. Niebezpieczne lub niedozwolone sytuacje mogą być skojarzone z charakterystycznymi zdarzeniami dźwiękowymi, takimi jak krzyki lub nagłe emisje energii akustycznej. Przedstawiono metodę detekcji i klasyfikacji niepokojących zdarzeń dźwiękowych. Detekcja opiera się na obserwacji nagłych zmian poziomu dźwięku w charakterystycznych pasmach lub wartości parametrów. Wśród parametrów znajdują się parametry związane z energią sygnału oraz parametry zdefiniowane w standardzie MPEG-7. Zaimplementowano algorytmy klasyfikacji dostosowane do rozpoznawania różnych typów zdarzeń dźwiękowych. Końcowa decyzja jest podejmowana poprzez agregację wskazań klasyfikatorów. Klasyfikację cechuje wysoka skuteczność rozpoznawania sytuacji zagrażających bezpieczeństwu, takich jak wystrzał z broni, krzyk, wybuch, czy dźwięk tłuczonego szkła. Opisane rozwiązanie znajduje zastosowanie w systemie automatycznego nadzoru bezpieczeństwa, realizującego również funkcję rozpoznawania zdarzeń wizyjnych. Przetwarzanie współbieżne obu modalnośći prowadzi do zwiększenia skuteczności rozpoznawania niebezpiecznych sytuacji. Algorytmy mogą być zastosowane w nadzorze miejsc publicznych, takich jak dworce, lotniska i stadiony.

Entry No. 304

Entry type conference paper

Authors K. Łopatka, P. Suchomski, A. Czyżewski

English title Time-domain prosodic modifications for Text-to-speech synthesizer

Polish title Modyfikacja prozodii wypowiedzi w dziedzinie czasu w systemie syntezy mowy

Conference Signal Processing Algorithms, Architectures, Arrangements and Applications SPA 2010

Preprint

Number

Volume

Pages 73 - 77

Conference site Poznań, Polska

Conference date 23.8.2010- 25.8.2010

Abstract An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.

Streszczenie Przedstawiono zastosowanie algorytmów przetwarzania sygnałów dla celów modyfikacji prozodii wypowiedzi w systemie syntezy mowy języka polskiego. Modyfikacja prozodii ma na celu zwiększenie naturalności generowanej wypowiedzi. Metoda oparta jest an algorytmie TD-PSOLA. Opracowane algorytmy znajdują zastosowanie w multimodalnych interfejsach komputerowych.

Entry No. 305

Entry type journal paper

Authors A. Kupryjanow, A. Czyżewski

English title REALTIME SPEECH STRECHING FOR SUPPORTING HEARING IMPAIRED SCHOOLCHILDREN

Polish title Spowalnianie mowy w czasie rzeczywistym w celu poprawy rozumienia mowy przez dzieci w szkole

Journal ELEKTRONIKA - KONSTRUKCJE, TECHNOLOGIE, ZASTOSOWANIA

Volume

Number 3/2010

Pages 24 - 28

Abstract A study of time scale modification algorithms applied to hearing impaired schoolchildren supporting is presented. Variety of algorithms are considered, namely: overlap and add, two variations of synchronized overlap and add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.

Streszczenie W artykule przedstawiono przegląd algorytmów służących do modyfikacji czasu trwania mowy oraz analizę możliwość ich zastosowania w celu wspomagania procesu rozumienia mowy u dzieci w szkole. Zbadano następujące algorytmu: overlap and add, dwie odmiany algorytmu synchronous overlap and add oraz wokoder fazowy. W części eksperymentalnej zbadano skuteczność działania algorytmów oraz ich złożoność obliczeniową.

Entry No. 306

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title Real-time speech-rate modification experiments

Polish title

Conference Konwencja AES 2010

Preprint 8052

Number

Volume

Pages

Conference site Londyn, Wielka Brytania

Conference date 22.5.2010- 25.5.2010

Abstract The paper presents algorithm designed for real-time speech time scale modification (stretching). An algorithm is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness as well as quality of signal processing algorithms are examined experimentally.

Entry No. 307

Entry type conference paper

Authors D. Ellwart, A. Czyżewski

English title Corrupted speech intelligibility improvement using adaptive filter based algorithm

Polish title Poprawa zrozumiałości mowy z wykorzystaniem algorytmu opartego o filtrację adaptacyjną

Conference Audio Engineering Society, Audio Forensics

Preprint

Number

Volume

Pages 97 - 100

Conference site Hillerød, Dania

Conference date 17.6.2010- 19.6.2010

Notes ISBN: 978-0-937803-76-9

Abstract A technique for improving the quality of speech signals recorded in strong noise is presented. The proposed algorithm employing adaptive filtration is described and additional possibilities of speech intelligibility improvement are discussed. Results of the tests are presented.

Streszczenie W pracy przedstawiono metodę poprawy jakości sygnałów w towarzystwie zakłóceń o wysokim poziomie. Omówiono zaproponowany algorytm wykorzystujący filtrację adaptacyjną oraz odejmowanie widmowe. Opisano przeprowadzone eksperymenty oraz zaproponowano metody dalszej poprawy zrozumiałości mowy.

Entry No. 308

Entry type conference paper

Authors P. Marcinkowski, P. Szczuko, A. Czyżewski

English title

Polish title Algorytm ekstrakcji cech biometrycznych twarzy

Conference VI Sympozjum Naukowe Techniki Przetwarzania Obrazu 2010

Preprint

Number

Volume

Pages

Conference site Serock, Polska

Conference date 18.11.2010- 20.11.2010

Streszczenie W referacie zawarto opis metody automatycznej lokalizacji oraz parametryzacji punktów charakterystycznych w obrazie twarzy. Do lokalizacji punktów charakterystycznych wykorzystano zmodyfikowany algorytm EBGM (ang. Elastic Bunch Graph Matching). Algorytm ten pozwala lokalizować punkty w obrazie przy założeniu niezmienności topologii grafu połączeń między nimi. W referacie przedstawiono podstawy teoretyczne metody oraz zaimplementowany trójetapowy proces lokalizacji punktów: lokalizacja twarzy w obrazie za pomocą kaskadowego klasyfikatora Haara oraz nałożenie ogólnej reprezentacji grafu, pozycjonowanie ogólnej reprezentacji grafu względem wykrytych oczu, odkształcanie grafu w celu pokrycia węzłami odpowiednich punktów charakterystycznych nieznanej twarzy. Proces odkształcania grafu uwzględnia dodatkowe kryteria odległościowe, odzwierciedlające znajomość geometrii ludzkiej twarzy. Punkt charakterystyczny w obrazie, będący węzłem grafu, opisany jest zbiorem zespolonych współczynników (cech, ang. jet) wyliczanych jako seria dwuwymiarowych falek Gabora (40 zespolonych falek podzielonych na 5 skal i 8 orientacji). W opisywanych badaniach posługiwano się grafem środkowej części twarzy: ust, nosa i oczu. To podejście pozwoliło analizować obrazy twarzy en face oraz zniwelować przypadki zasłoniętych punktów charakterystycznych. Porównanie twarzy, dążące ostatecznie do weryfikacji osoby, zrealizowane zostało na poziomie wyliczonych parametrów jetów. Przyjęte są dwa podejścia: uwzględniające tylko amplitudy falek oraz dodatkowo fazy falek. Przedstawiono i skomentowano wyniki zrealizowanego eksperymentu rozpoznawania 80 osób sfotografowanych w pozycji en face (wykorzystano ogólnodostępną bazę FERET). Zamieszczono podsumowanie i wnioski.

Entry No. 309

Entry type conference paper

Authors T. Merta, A. Czyżewski

English title Superresolution Algorithm to Video Surveillance System

Polish title

Conference 7th International Conference on Multimedia & Network Information Systems

Preprint

Number

Volume 80

Pages 105 - 112

Conference site Wrocław, Polska

Conference date 23.9.2010- 24.9.2010

Abstract An application of a multiframe SR (superresolution) algorithm applied to video monitoring is described. The video signal generated by various types of video cameras with different parameters and signal distortions which may be very problematic for superresolution algorithms. The paper focuses on disadvantages in video signal which occur in video surveillance systems. Especially motion estimation and its influence on superresolution effectiveness is analyzed. In proposed initial solution a proper frame shift estimation is shown. Tests of the proposed algorithm performed video frames from real surveillance system in which many described difficulties were found. Result image examples show image resolution enhancement with plate numbers. The improvement of image quality is discussed in reference to further plate recognition.

Entry No. 310

Entry type journal paper

Authors P. Szczuko, B. Kostek, A. Czyżewski

English title Comparison between natural movements and automatically generated animated motion employing motion capture and fuzzy logic techniques

Polish title Porównanie pomiędzy ruchem naturalnym w animacji komputerowej a generowanym automatycznie z wykorzystaniem przechwytywania ruchu i logiki rozmytej

Journal J. Expert Systems

Volume

Number

Pages

Notes Po recenzjach i poprawkach (do uzupełnienia po wydaniu)

Abstract The paper describes a new method for automatic generation of animated motion with quality comparable to natural motion. First the reference motion data are gathered utilizing a motion capture system. Then these data are reduced and only main poses of the action are left. The resulting motion is simplified and its quality is considerably decreased. Then, utilizing the automatic motion enhancement system, ANIMATOR, a new version of the action is generated, based on input poses and subjective descriptors given by the user. Various degrees of motion fluency and naturalness are possible to achieve this way. The proposed algorithm of the animation enrichment is based on fuzzy description of motion parameters and motion subjective features. The first step consists in creating fuzzy rules for the algorithm based on subjective evaluation of the animated movement. The second stage utilizes input descriptors for the new motion phases calculation, which are finally added to the animation. It is assumed that such processing increases naturalness and quality of motion, and this is verified by subjective evaluation tests. Finally a comparison between the original and the recreated motion is performed. Scores obtained in evaluation tests suggest that a substantial increase in quality between reduced and recreated versions is obtained, matching the original one. The method for motion enhancement is useful for automatic motion generation and can be paired with motion data reduction procedure for regaining naturalness. Moreover the reduced version can easily be edited in the ANIMATOR system, and in this way a new action can be created

Streszczenie Referat opisuje nową metodę automatycznego generowania animowanego ruchu postaci o jakości porównywalnej z rzeczywistym ruchem człowieka. W pierwszym etapie gromadzone są dane referencyjne rzeczywistego, przechwyconego ruchu (wykorzystany system Motion Capture), następnie dane te są poddawane redukcji do głównych póz składających się na akcję aktora. Wynikowy ruch jest uproszczony i pozbawiony cech indywidualnych osoby i jego subiektywna jakość jest znacząco obniżona. Następnie w celu poprawy jakości wykorzystywany jest autorski system wzbogacania animowanego ruchu ANIMATOR i, w oparciu o pozostałe pozy oraz parametry wejściowe, generowana jest nowa wersja ruchu. Użytkownik ma możliwość modyfikować subiektywną płynność i naturalność wynikowego ruchu. Zastosowane przetwarzanie wykorzystuje logikę rozmytą i rozmyte parametry opisu subiektywnych cech ruchu. Wynikowe animacje porównywane są w testach oceny subiektywnej z ruchem rzeczywistych aktorów. Uzyskane wyniki wskazują na istotną poprawę jakości pomiędzy animacją zredukowaną do samych póz, a także finalną jakość zbliżoną do nieredukowanego oryginału. Opracowana metoda może znaleźć zastosowanie do automatycznego generowania wielu wersji uproszczonego ruchu. Ponadto system ANIMATOR dostarcza narzędzi intuicyjnej i efektywnej edycji ruchu uproszczonego, na bazie którego wygenerowane mogą być nowe akcje o wysokiej jakości subiektywnej.

Entry No. 311

Entry type journal paper

Authors P. Dalka, A. Czyżewski

English title Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition

Polish title Interfejs do komunikacji człowieka z komputerem wykorzystujący ruchy i gesty wykonywane ustami

Journal International Journal of Computer Science and Applications

Volume 7

Number 3

Pages 124 - 139

Abstract The multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in the lower part of the face region. Its position is used to track lip movements that allows a user to control a screen cursor. Three lip gestures are recognized: mouth opening, sticking out the tongue and forming puckered lips. Lip gesture recognition is performed by an artificial neural network and utilizes various image features of the lip region. An accurate lip shape is obtained by the means of lip image segmentation using fuzzy clustering.

Streszczenie Artykuł przedstawia interfejs komputerowy nazwany Ustomysz, który pozwala użytkownikowi na posługiwanie się komputerem jedynie za pomocą ruchów i gestów wykonywanych ustami. Szczegółowo opisano algorytmy śledzenia ruchów ust i rozpoznawania gestów. Obraz twarzy użytkownika jest pozyskiwany za pomocą zwykłej kamerki internetowej. Detekcja twarzy bazuje na kaskadzie klasyfikatorów AdaBoost. Obszar ust lokalizowany jest w dolnej części twarzy. Jego pozycja wykorzystywana jest do śledzenia ruchów ust, co pozwala użytkownikowi na sterowanie kursorem ekranowym. Rozpoznawane są trzy gesty wykonywane ustami: otwarcie ust, wysunięcie języka oraz złożenie ust w dzióbek. Gesty ust są klasyfikowane przez sieć neuronową, która bazuje na deskryptorach obrazu ust. Dokładny kształt ust uzyskuje się poprzez aproksymację ich kształtu za pomocą elipsy, wykorzystując grupowanie rozmyte.

Entry No. 312

Entry type conference paper

Authors Ł. Kulasek, B. Kunka, A. Czyżewski

English title FACE RECOGNITION BY HUMANS WITH GAZE-TRACKING SYSTEM CYBER-EYE

Polish title BADANIE ROZPOZNAWANIA TWARZY PRZEZ CZŁOWIEKA Z WYKORZYSTANIEM SYSTEMU ŚLEDZENIA FIKSACJI WZROKU CYBER-OKO

Conference New Trends in Audio and Video (NTiAV) 2010

Preprint

Number

Volume

Pages

Conference site Szczecin, Polska

Conference date 14.10.2010- 16.10.2010

Bibliographic No. 9

Notes materiały konferencyjne dostępne na płycie CD

Abstract In order to understand the way humans memorize and recognize faces, we conducted research experiments employing a group of 20 people using the previously prepared gaze-tracking system Cyber-Eye. Cyber-Eye’s dedicated software coupled with infrared diodes and a camera allow tracking sight focus on the screen. Every individual participating in the experiment was presented a few videos containing face images. Those videos were made separately for the two different stages: face memorizing and face recognition. Then, Cyber-Eye system rendered them with heat maps that presented the position of sight focus at every moment. The analysis of the videos showed which face regions are significant in recognizing and memorizing faces and in which order they are processed. The results of this paper can help to improve face recognition algorithms running on machines.

Streszczenie W celu dokładniejszego zrozumienia sposobu rozpoznawania i zapamiętywania twarzy przez człowieka przeprowadzono doświadczenie na grupie 20 osób z wykorzystaniem wcześniej opracowanego systemu śledzenia fiksacji wzroku Cyber-Oko [3]. Wykorzystując diody i kamerę podczerwieni wraz z dedykowanym oprogramowaniem Cyber-Oko, które pozwala na śledzenie punktu skupienia wzroku na ekranie. Każdej osobie biorącej udział w doświadczeniu pokazano plik filmowy zbudowany w oparciu o zdjęcia twarzy osób. Filmy wideo zostały przygotowane oddzielnie dla etapów rozpoznawania i zapamiętywania twarzy. Następnie system Cyber-Oko umożliwił połączenie ich z mapami ciepła przedstawiającymi pozycję skupienia wzroku w danej chwili. Analizując otrzymane w ten sposób filmy wideo udało się zaobserwować które regiony twarzy są znaczące przy rozpoznawaniu i zapamiętywaniu twarzy przez człowieka, oraz w jakiej kolejności są analizowane. Wyniki niniejszej pracy mogą pozwolić na ulepszenie algorytmów rozpoznawania twarzy.

Entry No. 313

Entry type book

Authors P Żwan, P. Sobala, P. Szczuko, A. Czyżewski

English title Audio Content Analysis in the Urban Area Telemonitoring System

Polish title Inteligentne usługi multimedialne: Analiza dźwięku w monitoringu miejskim

Editor Springer-Verlag Berlin Heidelberg

Pages 227 - 239

Notes rozdział w książce G.A. Tsihrintzis et al. (Eds.): Multimedia Services in Inteligent Environments

Abstract The digital sound processing is commonly used in many application fields. It can also be applied in the domain of the modern monitoring systems, since the recorded audio stream can contain a valuable information for security purposes. The additional audio analysis can be important especially when the video cameras are not able to observe the scene due to the improper lighting circumstances or when a suspicious objects is hidden for the view of a camera. Similarly, as in the case of the video analysis, the problem is the amount of data needed to be analyzed. In the case of audio stream, is a particular problem, since one person can’t monitor multi audio streams. Some automatic detection methods must be introduced in order to perform the automatic recognition of events which are non typical and can be related to a danger for citizens. The chapter presents the algorithm which automatically detects events like a broken window, a gun-shot and a scream. The crucial part of the algorithm is the modified spectral subtraction performed in order to diminish the influence of the significant and not stable background noise for the system performance. The resulting differential signal is parameterized and automatically recognized by the linear decision system.

Streszczenie Artykuł przedstawia możliwości rozwinięcie monitoringu miejskiego o automatyczną analizę dźwięku. Przedstawiono metody parametryzacji dźwięku, które możliwe są do zastosowania w takim systemie oraz omówiono aspekty techniczne implementacji. W kolejnej części przedstawiono system decyzyjny oparty na drzewach zastosowany w systemie. System ten rozpoznaje dźwięki niebezpieczne (strzał, rozbita szyba, krzyk) wśród dźwięków zarejestrowanych w warunkach miejskich (ruch uliczny). W drugiej części artykułu przedstawiono metody lokalizacji przestrzennej badanych dźwieków.

Entry No. 314

Entry type conference paper

Authors B. Kunka, R. Rybacki, K. Łopatka, A. Czyżewski, B. Kostek

English title VIRTUAL KEYBOARD CONTROLLED BY EYE GAZE EMPLOYING SPEECH SYNTHESIS

Polish title WIRTUALNA KLAWIATURA STEROWANA WZROKIEM, WYKORZYSTUJĄCA SYNTEZĘ MOWY

Conference New Trends in Audio and Video (NTiAV) 2010

Preprint

Number

Volume

Pages

Conference site Szczecin, Poland

Conference date 14.10.2010- 16.10.2010

Bibliographic No. 11

Notes materiały konferencyjne dostępne na płycie CD

Abstract The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents an algorithm of concatenative speech synthesis used in the engineered solution. Both modules of the system described were created by the Multimedia Systems Department. The work of the entire system was verified in real conditions. Conclusions focusing on the usefulness of this approach are provided.

Streszczenie W artykule przedstawiono zastosowanie syntezy mowy w zintegrowanym w systemie śledzenia punktu fiksacji wzroku. Takie podejście w znaczący sposób może przyczynić się do poprawy jakości życia osób niepełnosprawnych fizycznie, które nie mają możliwości komunikowania się. Interfejsem umożliwiającym wprowadzanie do syntetyzera mowy tekstu jest wirtualna klawiatura z rozkładem klawiszy QWERTY. W pierwszej części artykułu przedstawiono sposób wyznaczania punktu fiksacji wzroku na monitorze komputerowym za pomocą stworzonego w Katedrze Systemów Multimedialnych systemu o nazwie Cyber-Oko. W drugiej części zaprezentowano algorytm syntezy mowy konkatenacyjnej, który jest wykorzystywany w zaproponowanym rozwiązaniu. Sprecyzowano odpowiednie wnioski na temat użyteczności takiego podejścia oraz zweryfikowano pracę systemu w warunkach rzeczywistych.

Entry No. 315

Entry type conference paper

Authors M. Szczodrak, P. Dalka, A. Czyżewski

English title Performance Evaulation of Video Object Tracking Algorithm in Autonomous Surveillance System

Polish title Ocena skuteczności działania algorytmu śledzenia ruchomych obiektu w autonomicznym systemie monitoringu

Conference 2nd International Conference on Information Technology ICIT'2010

Preprint

Number 8

Volume 18

Pages 43 - 48

Conference site Gdańsk, Polska

Conference date 28.6.2010- 30.6.2010

Abstract Results of performance evaluation of a video object tracking algorithm are presented. The method of moving objects detection and tracking is based on background modelling with mixtures of Gaussians and Kalman filters. An emphasis is put on algorithm’s efficiency with regards to its settings. Utilized methods of performance evaluation based on comparison of algorithm output to manually prepared reference data are introduced. The experiments aimed at examining the performance achieved with various object detection algorithm parameter settings are presented and discussed.

Entry No. 316

Entry type conference paper

Authors J. Kotus, K. Kopaczewski, M. Szczodrak, A. Czyżewski

English title

Polish title Analiza zachowań tłumu w multimedialnym systemie bezpieczeństwa

Conference VI Sympozjum "Techniki Przetwarzania Obrazu"

Preprint

Number

Volume

Pages - 7

Conference site Serock, Polska

Conference date 18.11.2010- 20.11.2010

Streszczenie W niniejszym referacie zawarto opis metody detekcji zachowań tłumu na podstawie analizy obrazu. Koncepcja docelowego wykorzystania to wspomaganie pracy operatorów w systemach monitoringu, w szczególności podczas imprez masowych, np. na stadionach wyposażonych w wiele kamer. Celem opracowanej metody jest wykrywanie normalnych oraz potencjalnie niebezpiecznych zachowań tłumu, takich jak: panika, kierunkowy ruch masy ludzi, czy gromadzenie się. Schemat blokowy metody przedstawiony jest na rysunku 1. Można zauważyć dwie główne ścieżki przetwarzania. Pierwszą jest przekazanie parametrów pozyskanych w czasie analizy obrazów do modelu zachowań tłumu wykorzystującego metodę sił społecznych, a następnie jego symulacji i predykcji. Drugą ścieżką jest klasyfikacja zachowań tłumu i decyzja o wszczęciu alarmu w sytuacji nietypowej. Opisywana metoda bazuje na analizie ruchu punktów szczególnych w obrazie określonych za pomocą algorytmu Sparse Optical Flow. Proponowana metoda służy do uzyskania z obrazu informacji o dynamice tłumu. Klasyfikacji zachowań tłumu jest realizowana za pomocą sztucznej sieci neuronowej. W opracowywaniu przedstawionej metody wykorzystano zbiór wysokiej jakości nagrań zachowań tłumu wykonanych przez autorów na terenie Politechniki Gdańskiej. W procesie treningu algorytmów decyzyjnych wyodrębniono zbiory treningowe i weryfikacyjne.

Entry No. 317

Entry type conference paper

Authors P. Dalka, A. Czyżewski

English title Controlling Computer by Lip Gestures Employing Neural Network

Polish title Sterowanie komputerem za pomocą gestów ust z wykorzystaniem sieci neuronowych

Conference 7th International Conference on Rough Sets and Current Trends in Computing (RSCTC 2010)

Preprint

Number

Volume

Pages 80 - 89

Conference site Warszawa, Polska

Conference date 28.6.2010- 30.6.2010

Notes ISBN-10 3-642-13528-5

Abstract Results of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features. Lip region extraction is based on a lip shape approximation calculated by the means of lip image segmentation using fuzzy clustering. ANN is fed with a feature vector describing lip region appearance. The descriptors used include a luminance histogram, statistical moments and co-occurrence matrices statistical parameters. ANN is able to recognize with a good accuracy three lip gestures: mouth opening, sticking out the tongue and forming puckered lips.

Streszczenie Artykuł prezentuje eksperymenty z zakresu rozpoznawania gestów wykonywanych ustami przez sztuczną sieć neuronową. Moduł ten stanowi główny element multimodalnego interfejsu komputerowego zwanego Ustomysz. Rozwiązanie to pozwala na posługiwanie się komputerem jedynie za pomocą ruchów i gestów wykonywanych ustami. Twarz użytkownika jest wykrywana w strumieniu wizyjnym ze zwykłej kamery za pomocą kaskady klasyfikatorów AdaBoost. Obszar ust wyznaczany jest za pomocą aproksymacji kształtu ust za pomocą elipsy algorytmem bazującym na grupowaniu rozmytym. Sztuczna sieć neuronowa działa w oparciu o wektor parametrów opisujących wygląd ust. W jego skład wchodzą histogram luminancji, momenty statystyczne obrazu oraz parametry statystyczne macierzy współwystępowania. Sieć neuronowa jest w stanie rozpoznać z wysoką skutecznością trzy gesty wykonywane ustami: otwarcie ust, wysunięcie języka oraz złożenie ust w dzióbek.

Entry No. 318

Entry type conference paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek

English title Long-term comparative evaluation of an acoustic climate in selected schools before and after the acoustic treatment

Polish title

Conference Noise Control 2010

Preprint

Number

Volume

Pages

Conference site Wałbrzych, Polska

Conference date 6.6.2010- 9.6.2010

Abstract Results of the long-term continuous noise measurement in two schools are presented in the paper. Noise characteristics are measured continuously at selected locations for approximately 16 months. The autonomous noise monitoring stations, engineered at the Multimedia Systems Department of Gdansk University of Technology are used for this purpose. 8 months since the beginning of the measurements, the acoustic treatment of the corridors has been done. A comparative evaluation of acoustic climate in selected schools before and after the acoustic treatment is performed based on these two periods of the continuous measurements. Investigations of measured noise, particularly its influence on hearing, based on spectrum analysis in critical bands are discussed. Effects of occupational noise exposure, including the Temporary Threshold Shift simulation, are determined. The results of the above discussed measurements correlated with the instantaneous noise levels are also presented.

Entry No. 319

Entry type conference paper

Authors M. Szczodrak, P. Dalka, A. Czyżewski

English title Moving object tracking algorithm evaluation in autonomous surveillance system

Polish title

Conference MCSS 2010: IEEE International Conference on Multimedia Communications, Services and Security

Preprint

Number

Volume

Pages 219 - 223

Conference site Kraków, Polska

Conference date 6.5.2010- 7.5.2010

Notes ISBN 978-83-88309-92-2

Abstract Results of evaluation of video object tracking algorithm being a part of an autonomous surveillance system are presented. The algorithm was investigated employing a set of benchmarks recorded locally. The precision of object detection, evaluated with such metrics as fragmentation, object area recall and object precision, is in focus. The experiments aimed at examining the detection quality using various object detection algorithm parameter settings are described. The analysis of results of carried out experiments is included.

Entry No. 320

Entry type journal paper

Authors M. Lech, B. Kostek, A. Czyżewski, P. Odya

English title Gesture-based Computer Control System

Polish title System Sterowania Komputerem za Pomocą Gestów

Journal Elektronika - Konstrukcje, Technologie, Zastosowania

Volume

Number 3/2010

Pages 49

Abstract In the paper a system for controlling computer applications by hand gestures is presented. First, selected methods used for gesture recognition are described. The System hardware and a way of controlling a computer by gestures is presented. The architecture of the software along with hand gesture recognition methods and algorithms used is described. The set of basic gestures and, consisting of them, complex gestures recognized by the system is given.

Streszczenie W artykule przedstawiono System Sterowania Komputerem za Pomocą Gestów Rąk. W pierwszej części dokonano przeglądu wybranych metod rozpoznawania gestów. Następnie zaprezentowano część sprzętową Systemu oraz metodykę sterowania. Opisano również architekturę oprogramowania wraz z metodami i algorytmami zastosowanymi przy rozpoznawaniu gestów rąk. W dalszej części udostępniono zestaw prostych gestów oraz bazujących na nich gestów złożonych, rozpoznawanych przez system.

Entry No. 321

Entry type journal paper

Authors B. Kunka, B. Kostek, M. Kulesza, P. Szczuko, A. Czyżewski

English title Gaze-Tracking Based Audio-Visual Correlation Analysis Employing Quality of Experience Methodology

Polish title System sledzenia punktu fiksacji wzroku w badaniach korelacji sluchowo-wzrokowych uwzgledniajacych metodyke QoE

Journal Intelligent Decision Technologies (IDT) Journal

Volume

Number ISSN 1872-4981/10

Pages 217 - 227

Bibliographic No. 32

Notes DOI 10.3233/IDT-2010-0082

Abstract This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective tests carried out at the MSD showed a strong dependency between video presented in the screen and the perceived audio. It has also been shown that the application of gaze-tracking to the audio-visual correlation analysis allows for the objectivization of results obtained in subjective tests. Therefore this research study concentrates on the possibility to apply this methodology to the area of Quality of Experience.

Streszczenie W niniejszym artykule przedstawiono nowe podejście do badań korelacji wzrokowo-słuchowych z wykorzystaniem systemu śledzenia wzroku, opracowanego w Katedrze Systemów Multimedialnych (KSM) Politechniki Gdańskiej. Technika śledzenia wzroku wywodzącą się z technik HCI (ang. Human-Computer interactions) zostaje wykorzystana w nowym obszarze, jakim jest dziedzina Quality of Experience (QoE). Wyniki testów subiektywnych przeprowadzonych w KSM wskazują na silną zależność pomiędzy obrazem wizyjnym prezentowanym na ekranie a percypowanym bodźcem dźwiękowym. Pokazano również, że zastosowanie techniki śledzenia wzroku w analizie korelacji wzrokowo-słuchowych prowadzi do obiektywizacji wyników uzyskanych podczas testów subiektywnych.

Entry No. 322

Entry type journal paper

Authors A. Kupryjanow, K. Kaszuba, A. Czyżewski

English title INFLUENCE OF ACCELEROMETER SIGNAL PRE-PROCESSING AND CLASSIFICATION METHOD ON HUMAN ACTIVITY RECOGNITION

Polish title Wpływ przetwarzania wstępnego i wyboru metody klasyfikacji na skuteczność rozpoznawania aktywności ruchowych

Journal ELEKTRONIKA - KONSTRUKCJE, TECHNOLOGIE, ZASTOSOWANIA

Volume

Number 3/2010

Pages 18 - 23

Abstract A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy. In the test four methods of classification were used: support vector machine, decision trees, neural network, k-nearest neighbor.

Streszczenie W artykule przedstawiono wpływ przetwarzania wstępnego sygnału przyspieszenia na skutecznością rozpoznawania aktywności ruchowych. Przeanalizowano zależność filtracji sygnałów oraz ilości zastosowanych czujników na skuteczność klasyfikacji. W badaniach wykorzystano cztery różne klasyfikatory: maszynę wektorów wsparcia, drzewa decyzyjne, sztuczne sieci neuronowe oraz klasyfikator najbliższego sąsiada.

Entry No. 323

Entry type book

Authors P. Dalka, G. Szwoch, P. Szczuko, A. Czyżewski

English title Video Content Analysis in the Urban Area Telemonitoring System

Polish title Analiza materiału wideo w miejskim systemie telemonitoringu

Editor Springer-Verlag Berlin Heidelberg

Pages 241 - 261

Notes rozdział w książce G.A. Tsihrintzis et al. (Eds.): Multimedia Services in Inteligent Environments

Abstract The task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects of video signals processing dedicated to detection and monitoring of threats in urban areas. First the video analysis methods are presented, then recognition algorithms are introduced for detection of important events, and finally obtained results are discussed. The section 2. is dedicated to basic video analysis algorithms aimed at detection, tracking and classification of moving object appearing in a camera field of view. First in section 2.1 moving object detection is presented. It employs background modelling and subtraction for determination of non-stationary objects detection. Then the problem of objects shadows is discussed, and finally a stage of image segmentation is presented. In Sec. 2.2 moving object tracking is described. This process entails a necessity of solving many problems regarding object occlusions. An approach utilizing changes in time of object state described by Kalman filter is discussed, which concludes low level processing of visual media. An outcome of low-level processing serves as a input to higher-level analysis, where it is determined whether they are actually dangerous situations. The high-level video processing is presented in Sec. 3. It starts with object classification (Sec 3.1), and then event recognition is performed (Sec. 3.2). Modern video surveillance systems contain multiple cameras, covering a wide area, some of which are able to pan, tilt and zoom view area (PTZ cameras). A technique for positioning of PTZ cameras is presented in Sec. 3.3, allowing tracking of moving object.

Streszczenie Ciągłe monitorowanie strumieni wideo z wielu kamer i przeglądanie nagrań w celu odnalezienia zadanego zdarzenia jest zadaniem wymagającym dla operatora systemu monitoringu. Rozwiązaniem może być system automatycznej analizy, rozpoznający predefiniowane zdarzenia. W rozdziale przedstawiony jest przegląd najważnejszych zagadnień z dziedziny analizy obrazu ruchomoego w celu wykrywania i monitorowania zagrożeń w aglomeracji miejskiej.

Entry No. 324

Entry type book

Authors A. Czyżewski, J. Kotus

English title Automatic localization and continous tracking of mobile sound source using passive acoustic radar

Polish title

Editor Military University of Technology

Pages 441 - 453

Notes Concepts and Implementations for Innovative Military Communications and Information Technologies, ISBN 978-83-61486-70-1

Abstract A concept, practical realization and applications of the passive acoustic radar for localization and continuous tracking of fixed and mobile sound sources such as: cars, trucks, aircrafts and sources of shooting, explosions were presented in the paper. The device consists of the new kind of multi-channel miniature three dimensional sound intensity sensors invented by the Microflown company and a group of digital signal processing algorithms developed in the Multimedia System Department, Gdansk University of Technology. Contrarily to active radars, the passive acoustic radar does not emit any scanning beam but “listens to” surrounding sounds and in result it provides information about the directions of arriving acoustical waves. Hence, monitoring of the acoustic field in this way remains unnoticeable. For the sound source localization the two independent 3D sound intensity probes and triangulation technique were used. In order to increase accuracy of the sound source localization an additional algorithm of the resonant narrow-band filtration of acoustic signals was applied. The practical examinations of the sensitivity and accuracy of the developed PAR were conducted in an anechoic chamber and in typical reverberant conditions. The functionality and acoustic properties of the passive radar were examined in details using three types of signals for given environmental conditions: broadband sounds, pure-tones and impulsive sounds. Taking the obtained results of the realized experiments into consideration it was ascertained that even the inconsiderable value of the signal to noise ratio was sufficient to localize sound source suitably. The obtained measurement results including real sounds samples can be remotely sent to the control station. The passive radar can be operated both automatically as a stand-alone unit and in manual mode. The proposed technology can provide the operator with many essential data representing the activity of objects and targets in a given area. Moreover, the automatic and continuous tracking of the selected sound source movement is also possible. Additional procedures such as: sound source classification module or automatic control of the digital PTZ (Pan Tilt Zoom) camera can be used to extend the usefulness of the presented device.

Entry No. 325

Entry type conference paper

Authors P. Dalka, A. Czyżewski

English title Vehicle Classification Based on Soft Computing Algorithms

Polish title Klasyfikacja typu pojazdu za pomocą inteligentnych algorytmów decyzyjnych

Conference 7th International Conference on Rough Sets and Current Trends in Computing (RSCTC 2010)

Preprint

Number

Volume

Pages 70 - 79

Conference site Warszawa, Polska

Conference date 28.6.2010- 30.6.2010

Notes ISBN-10 3-642-13528-5

Abstract Experiments and results regarding vehicle type classification are presented. Three classes of vehicles are recognized: sedans, vans and trucks. The system uses a non-calibrated traffic camera, therefore no direct vehicle dimensions are used. Various vehicle descriptors are tested, including those based on vehicle mask only and those based on vehicle images. The latter ones employ Speeded Up Robust Features (SURF) and gradient images convolved with Gabor filters. Vehicle type is recognized with various classifiers: artificial neural network, K-nearest neighbors algorithm, decision tree and random forest.

Streszczenie Artykuł przedstawia eksperymenty i ich wyniki z zakresu klasyfikacji typu pojazdów. Trzy rodzaje pojazdów są rozróżniane: osobowe, samochody dostawcze/busy oraz samochody ciężarowe. System bazuje na kamerze z nieskalibrowanym polem widzenia, przez co nie wykorzystano bezpośrednio rozmiarów pojazdów. Przetestowano wiele deskryptorów obrazu pojazdów, w tym bazujące na tylko na kształcie pojazdu oraz takie bazujące na ich obrazie. Te ostanie wykorzystują lokalne deskryptory SURF oraz splot gradientu obrazu pojazdów z filtrami Gabora. Typ pojazdu jest klasyfikowany za pomocą sztucznych sieci neuronowych, algorytmu K najbliższych sąsiadów, drzewa decyzyjnego oraz lasu losowego (Random Forest)

Entry No. 326

Entry type conference paper

Authors J. Kotus, A. Czyżewski

English title AUTOMATYCZNA LOKALIZACJA ŹRÓDŁA DŹWIĘKU W OBECNOŚCI ZAKŁÓCEŃ Z WYKORZYSTANIEM WEKTOROWYCH CZUJNIKÓW AKUSTYCZNYCH

Polish title AUTOMATIC SOUND SOURCE LOCALIZATION IN DISTURBING CONDITIONS USING ACOUSTIC VECTOR SENSORS

Conference NOWOŚCI W TECHNICE AUDIO I WIDEO

Preprint

Number

Volume

Pages

Conference site Szczecin, Polska

Conference date 14.10.2010- 16.10.2010

Abstract A concept, practical realization and applications of a passive acoustic radar to automatic localization and tracking of sound sources in disturbing conditions were presented in the paper. The device consists of the new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. The sensitivity of the realized acoustic radar was examined in free sound field. Several kinds of sound signals were used, such as: pure tone from 125 to 16000 Hz, one third octave band noise in the same frequency range and impulsive sounds. As results from experiments, in some cases even the small value of the signal to noise ratio was sufficient to localize sound source correctly. A video PTZ (Pan Tilt Zoom) camera can be pointed automatically to the spot the detected acoustical source is localized.

Streszczenie W referacie przedstawiono pomysł i praktyczną realizację pasywnego radaru akustycznego do automatycznego lokalizowania i śledzenia źródeł dźwięku w warunkach zakłóceń. Urządzenie składa się z nowego typu wielokanałowych miniaturowych czujników natężeniowych oraz algorytmów cyfrowego przetwarzania sygnałów. Czułość radaru akustycznego została zbadana w warunkach pola swobodnego. Użyto sygnałów testowych takich jak: sygnały tonalne z zakresu od 125 do 16 kHz, szumowe (w tym samym zakresie częstotliwości) oraz o charakterze impulsowym. Uzyskane wyniki pomiarów wskazują, że nawet niewielka wartość stosunku sygnału do szumu była wystarczająca do poprawnego zlokalizowania źródła dźwięku. Informacja o kierunku dobiegania dźwięku może być zastosowana do automatycznego sterowania cyfrową kamerą typu PTZ (Pan Tilt Zoom).

Entry No. 327

Entry type journal paper

Authors J. Kotus, A. Czyżewski

English title Acoustic radar employing particle velocity sensors

Polish title

Journal Advances in Multimedia and Network Information System Technologies

Volume 80

Number

Pages 92 - 103

Abstract A concept, practical realization and applications of a passive acoustic radar to automatic localization, tracking of sound sources were presented in the paper. The device consist of the new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. Contrary to active radars, it does not emit the scanning beam but after receiving surroundings sounds it provide information about the directions of incoming acoustical signals. Practical examinations of the sensitivity and accuracy of the developed radar were also presented and discussed. The sensitivity of the realized acoustic radar was examined in free sound field. Several kinds of sound signals were used, such as: pure tone from 125 to 16000 Hz, one third octave band noise in the same frequency range and impulsive sounds. The obtained results for every kind of signal groups were presented and discussed. As results from experiments, in some cases even the small value of the signal to noise ratio was sufficient to localize sound source correctly. A video camera can be pointed automatically to the place the detected acoustical source is localized. Hence, the information about the sound event direction can be used to automatic and remote control of the PTZ (Pan Tilt Zoom) cameras. The automatic and continuous tracking in real time of the selected sound source movement is also possible. The proposed solution can significantly improve the functionality of the traditional surveillance monitoring systems.

Entry No. 328

Entry type conference paper

Authors J. Kotus, B. Kunka, A. Czyżewski, P. Szczuko, P. Dalka, R. Rybacki

English title Gaze-tracking and acoustic vector sensors technologies for PTZ camera steering and acoustic event detection

Polish title

Conference 1st International Workshop: Interactive Multimodal Pattern Recognition in Embedded Systems (IMPRESS 2010)

Preprint

Number

Volume

Pages 276 - 280

Conference site Bilbao, Hiszpania

Conference date 1.9.2010- 1.9.2010

Notes DEXA 2010

Abstract An innovative application of gaze-tracking and acoustic vector sensors (AVS) technologies for guidance of moving pan- tilt-zoom (PTZ) monitoring camera is presented. Gaze-tracking is used to steer and to zoom the camera to the gaze focus area. Additionally, it is combined with audio processing in two scenarios. First is called “audio slave”: directional acoustic monitoring is adjusted automatically to the camera direction. Second is called “audio master”: automatic detection of sound events directions is performed to take priority over user control and steer the camera towards sound source. An approach to gaze tracking is presented, utilizing new algorithmic methods for both image processing and PTZ camera steering. Then application of AVS for directional filtering of sound, and for detection of acoustic events direction is discussed. The implemented application is described, and user experience is reported. Finally, future work is discussed.

Streszczenie W artykule przedstawione zostało innowacyjne zastosowanie techniki śledzenia wzroku i sondy akustycznej (AVS) w sterowaniu obrotowymi kamerami (PTZ) monitoringu wizyjnego. Śledzenie wzroku zostało wykorzystane do obrotu kamery i zmiany jej ogniskowej w obszarze skupienia wzroku. Dodatkowo, zastosowano dwa podejścia związane z przetwarzaniem dźwięku. Pierwsze, nazwane "audio slave" oznacza, że monitoring akustyczny jest automatycznie dostosowany do kierunku patrzenia kamery. Drugie, nazwane "audio master" oznacza, że wykrycie kierunków zdarzeń akustycznych prowadzi do automatycznego przejęcia kontroli nad kamerą i ukierunkowanie jej na źródło dźwięku. W artykule omówiono również zastosowania kierunkowej filtracji dźwięku oraz wykrywania zdarzeń akustycznych. Przedstawiono zaimplementowaną aplikację oraz opisano wrażania użytkowników systemu. W zakończeniu zaproponowano etapy przyszłych prac.

Entry No. 329

Entry type journal paper

Authors A. Kupryjanow, B. Kunka, A. Czyżewski

English title VIRTUAL TOUCHPAD – VIDEO-BASED MULTIMODAL INTERFACE

Polish title Wirtualny TouchPad - interfejs multimodalny oparty na przetwarzaniu obrazu wizyjnego

Journal Zeszyty Naukowe Wydziału ETI PG

Volume

Number 8/2010

Pages 219 - 224

Notes Technologie Informacyjne TOM 19

Abstract A new computer interface named Virtual-Touchpad (VTP) is presented. The Virtual-Touchpad provides a multimodal interface which enables controlling computer applications by hand gestures captured with a typical webcam. The video stream is processed in the software layer of the interface. Hitherto existing video-based interfaces analyzing frames of hand gestures are presented. Then, the hardware configuration and software features of the Virtual-Touchpad are described.

Streszczenie W referacie przedstawiono interfejs multimodalny o nazwie Wirtualny Touchpad. Umożliwia on sterowanie aplikacjami komputerowymi za pomocą gestów dłoni, wyekstrahowanych z obrazów przechwytywanych w czasie rzeczywistym z kamery wizyjnej. Opisano konfigurację sprzętową oraz warstwę oprogramowania interfejsu. Warstwa oprogramowania przetwarza strumień wizyjny, dokonuje detekcji i klasyfikacji określonych gestów oraz interpretuje je w celu wykonania odpowiednich akcji.

Entry No. 330

Entry type journal paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek

English title Long-term comparative evaluation of an acoustic climate in selected schools before and after the acoustic treatment

Polish title

Journal Archives of Acoustics

Volume 35

Number 4

Pages 551 - 564

Abstract The results of long-term continuous noise measurements in two selected schools are presented in the paper. Noise characteristics were measured continuously there for approximately 16 months. Measurements started eight months prior to the acoustic treatment of the school corridors of both schools. An evaluation of the acoustic climates in both schools, before and after the acoustic treatment, was performed based on comparison of these two periods of continuous measurements. The autonomous noise monitoring stations, engineered at the Multimedia Systems Department of the Gdansk University of Technology were used for this purpose. Investigations of measured noise, especially its influence on hearing sense, assessed on ground of spectral analyses in critical bands, is discussed. Effects of occupational noise exposure, including the Temporary Threshold Shift simulation, are determined. The correlation of the above said measurement results with respective instantaneous noise levels is discussed, and concluding remarks are presented. Some additional indicators such as air pollution or video analysis aiming at the analysis of corridor occupancy are also measured. It should be remembered that excessive noise, or air pollution may be evidence of a dangerous event and may pose health risks.

Entry No. 331

Entry type

Authors A. Czyżewski, B. Kostek, J. Kotus

English title

Polish title System do identyfikacji i zwalczania szumów usznych

Notes Patent przyznany przez UPRP w dniu 02.08.2019

Streszczenie Sposób identyfikacji i zwalczania szumów usznych zawierający etap pomiaru charakterystyk słuchowych oraz parametrów odczuwanego szumu usznego za pomocą urządzenia komputerowego oraz etap odtwarzania sygnałów dźwiękowych w przenośnym urządzeniu fonicznym charakteryzuje się tym, że zmienia się płynnie parametry sygnału testowego w czasie rzeczywistym przesuwając znacznik w dwuwymiarowym układzie współrzędnych na ekranie dotykowym (ED) połączonym z urządzeniem komputerowym (K) aż do uzyskania dopasowania generowanego na tej podstawie sygnału testowego do odczuwanego przez jego odbiorcę (OD) szumu usznego i w oparciu o uzyskane parametry ustala się widmo sygnału tłumienia odpowiadającego danemu szumowi usznemu, po czym tworzy się filtr komplementarny do tego widma i zgodnie z jego charakterystyką tłumi się lub wzmacnia wybrane pasma częstotliwości w sygnałach plików dźwiękowych odtwarzanych w przenośnym urządzeniu fonicznym (1). System składa się z urządzenia komputerowego oraz połączonego z nim za pomocą interfejsu komunikacyjnego przenośnego urządzenia fonicznego i charakteryzuje się tym, że urządzenie komputerowe (K) posiada ekran dotykowy (ED) oraz aplikacje programowe (A) do regulowania za ich pomocą sygnałów dźwiękowych z generatorów i filtrów sygnałów testowych (G, F), a przenośne urządzenie foniczne (1) wyposażone jest w procesor sygnałowy (5) z programowalnymi środkami filtracji (6) w bloku dekodera plików dźwiękowych (3).

Entry No. 332

Entry type report

Authors B. Kunka, R. Rybacki, A. Czyżewski, B. Kostek

English title

Polish title Opracowanie aplikacji–przeglądarki wraz z nakładką umożliwiającą interakcję systemu śledzenia wzroku z treścią strony (wstępne koncepcje)

Report Number Raport 1

Streszczenie Rozwój systemów służących do analizy punktów fiksacji wzroku użytkownika otwiera szerokie perspektywy zastosowania takich systemów w typowych zadaniach użytkowych takich jak korzystanie z aplikacji internetowych. System do śledzenia punktu fiksacji wzroku mógłby stanowić typowe rozszerzenia aplikacji przeglądarki internetowej i w ten sposób umożliwiać interakcję wzrokową z treścią strony. Rangowanie materiałów przy wyszukiwaniu mogłoby opierać się nie tylko na tradycyjnych policzalnych informacjach, takich jak liczba uruchomień materiału, liczba kliknięć, liczba ściągnięć, ale również na danych zgromadzonych przez analizę interakcji wzrokowej. Wpływ na ocenę atrakcyjności danego materiału miałby również np. czas fiksacji wzroku użytkowników na odpowiednich elementach strony.

Entry No. 333

Entry type journal paper

Authors M. Kulesza, A. Czyżewski

English title Frequency based criterion for distinguishing tonal and noisy spectral components

Polish title Kryterium częstotliwościowe pozwalające na klasyfikację komponentów widmowych na tonalne i szumowe

Journal Signal Processing: An International Journal (SPIJ)

Volume 4

Number 1

Pages 1 - 16

Abstract A frequency-based criterion for distinguishing tonal and noisy spectral components is proposed. For considered spectral local maximum two instantaneous frequency estimates are determined and the difference between them is used in order to verify whether component is noisy or tonal. Since one of the estimators was invented specially for this application its properties are deeply examined. The proposed criterion is applied to the stationary and nonstationary sinusoids in order to examine its efficiency.

Entry No. 334

Entry type journal paper

Authors P. Odya, A. Czyżewski, A. Grabkowska, M. Grabkowski

English title Smart Pen - new multimodal computer control tool for graphomotorical therapy

Polish title Inteligentny długopis - nowe narzędzie do terapii zaburzeń grafomotorycznych

Journal Intelligent Decision Technologies Journal

Volume 4

Number 3

Pages 197 - 209

Bibliographic No. 1872-4981

Abstract Numerous researches indicate that dyslexia and dysgraphia are nowadays major problems in schools. Smart Pen is a tool for supporting the therapy of developmental dyslexia, with particular regard to dysgraphia. It comprises a display monitor equipped with a high-sensitivity touchpad and specially designed writing tool equipped with pressure sensors. The paper put emphasis on issues related to the design of the device and the development of software providing a vital part of the interface. The software allows monitoring some interface parameters that are important from the therapy point of view, such as pen grip or time taken to complete an activity being a part of an exercise. The interface designed allows interesting (play and learn) activities to be performed with kids (i.e. learning of proper handling writing tools, basic writing etc). Tests have been carried out to verify usability of the Smart Pen. The test results showed that children and therapists are keen on using the new tool. Furthermore, using Smart Pen it is possible to distinguish children without writing problems from those who have some motoric disruptions. A description of the tests carried out and of their results is included in the paper.

Streszczenie Liczne badania wskazują, że dysleksja i dysgrafia są poważnym problemem utrudniającym edukację dzieci. Inteligentny długopis jest narzędziem wspomagającym terapię dzieci z dysleksją rozwojową. Składa się z monitora wyposażonego w wysokiej czułości tablet oraz specjalnie zaprojektowanego piórka wyposażonego w czujniki ścisku. Artykuł skupia się na zagadnienia związanych z projektowaniem urządzenia i tworzeniem oprogramowania, które pozwala na prowadzenie ćwiczeń w sposób atrakcyjny dla ucznia. Oprogramowanie umożliwia jednocześnie monitorowanie parametrów ważnych z punktu widzenia terapii, takich jak ścisk długopisu lub czas niezbędny do wykonania ćwiczenia. W celu sprawdzenia przydatności Inteligentnego długopisu przeprowadzono testy z udziałem dzieci. Stworzony w ramach projektu interfejs spotkał się z bardzo przychylną opinią terapeutów i dzieci. Testy pokazały, że przy pomocy interfejsu możliwe jest rozróżnienie dzieci, które nie mają problemów z prawidłowym pisaniem od tych, które mają zaburzenia grafomotoryczne. Opis testów oraz ich wyników również zawarto w artykule.

Entry No. 335

Entry type conference paper

Authors Ł. Kosikowski, P. Dalka, A. Czyżewski

English title Multimedia Browser Controlled by Head Movements

Polish title Przeglądarka multimediów sterowana ruchami głowy

Conference SIGGRAPH

Preprint

Number

Volume

Pages 1

Conference site Los Angeles, USA

Conference date 25.7.2010- 29.7.2010

Abstract A contactless multimedia content browser for personal computers is presented, where the user browses data using movements of his/her head only. The presented solution supports browsing static images, videos and music clips can be browsed subsequently and zoomed on demand. Video clips can be viewed and paused. Additionally, a user may fast-forward or rewind the content. The same functionality applies to listening audio files. Multimedia files are arranged in a multi-level, hierarchical structure. A user navigates through the structure and displays an element by moving the head up, down, left and right. Keeping the head in a tilted position for the longer time is also recognized. An action executed in the system depends on the type of the content a user is viewing (e.g. moving the head to the right selects the next picture or allows for fast-forwarding audio files). The content for the multimedia browser is chosen and organized with a separate configuration application that was also developed within the framework of the project. The application is especially suitable to standalone, multimedia terminals where users may get acquainted with a company or a store offer in the fast and convenient way. The application may also be used by disabled people.

Streszczenie Prezentowane rozwiązanie wspomaga przeglądanie obrazów statycznych, filmów i plków dźwiękowych z użyciem interfejsu komputerowego sterowania komuterem za pomocą ruchów głowy. Aplikacja może zostać wykorzystana szczególnie w kioskach multimedialnych do prezentowania oferty sklepów z treścią multimedialną w przystępny dla użytkownika sposób. Z systemu mogą korzystać także osoby niepełnosprawne.

Entry No. 336

Entry type conference paper

Authors A. Czyżewski, K. Łopatka, B. Kunka, R. Rybacki, B. Kostek

English title Speech synthesis controlled by eye gazing

Polish title Synteza mowy sterowana ruchami gałki ocznej

Conference 129th Convention of the Audio Engineering Society

Preprint 8165

Number

Volume

Pages

Conference site San Francisco, USA

Conference date 4.11.2010- 7.11.2010

Abstract A method of communication based on eye gaze controlling is presented. Investigations of using gaze tracking have been carried out in various context applications. The solution proposed in the paper could be referred to as "talking by eyes" providing an innovative approach in the domain of speech synthesis. The application proposed is dedicated to disabled people, especially to persons in a so-called locked-in syndrome who cannot talk and move any part of their body. The paper describes a methodology of determining the fixation point on a computer screen. Then it presents an algorithm of concatenative speech synthesis used in the solution engineered. An analysis of working with the system is provided. Conclusions focusing on system characteristics are included.

Streszczenie Przedstawiono metodę komunikacji za pomocą ruchu gałek ocznych. Zaprezentowane rozwiązanie można rozumieć jako "mówienie oczami". Stanowi ono innowacyjne podejście do syntezy mowy. Zastosowanie jest przygotowane z myślą o ludziach niepełnosprawnych ruchowo, zwłaszcza dla osób z syndromem zamknięcia, którzy są niezdolni do mówienia i poruszania jakimikolwiek częściami ciał poza oczami. W referacie opisano metodę wyznaczania punktu fiksacji wzroku na ekranie komputera. Przy pomocy fiksacji wzroku wprowadzane są znaki, które są przekazywane modułowi syntezy mowy. Przedstawiono analizy jakości pracy z systemem. We wnioskach skupiono się na cechach systemu.

Entry No. 337

Entry type report

Authors P. Szczuko, A. Czyżewski, G. Szwoch, P. Dalka, . et al

English title Creation of event model in order to detect dangerous events

Polish title

Report Number INDECT/D7.2

Abstract In the Deliverable a parametric model of event is proposed suitable for automatic analysis. The model comprises detected objects parameters and detected actions and interactions between these objects. An extensive set of parameters is presented and discussed, followed by outline of a methodology of event detection employing conditional rules. Automatic acquisition of objects parameters is facilitated with audio and video processing algorithms, which are developed within WP7 as well as in cooperating WP1. Systematic description of audio and video algorithms is here omitted as this Deliverable serves as a general road map for future research: focuses Work Package Partners on particular issues, presenting a research topics rather than complete solutions. Next deliverables, namely: D7.3 Biometric features analysis component based on video and image information (M23), D7.4 Biometric features analysis component based on audio information (M30), D7.5 Prototype of automatic event detection system (M37), and D7.7 Prototype of complex multimodal biometric features detection system (M52), will extend thoroughly the topics discussed here.

Entry No. 338

Entry type journal paper

Authors P. Żwan, A. Czyżewski

English title Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger

Polish title Weryfikacja metod parametryzacji w kontekście automatycznego rozpoznawania dźwięków związanych z niebezpieczeństwem

Journal Journal of Digital Forensic Practice

Volume 3

Number 1

Pages 33 - 45

Abstract Digital signal processing of the sound is a domain with numerous applications in the telecommunications and informatics. These well developed algorithms of the analysis of the sound can be also applied in the field of security systems, where the traditional monitoring is still based mainly on video cameras. The commonly used monitoring cameras can be equipped with additional microphones and the audio content can be analyzed by a monitoring program running on a dedicated hardware. This application can automatically detect in the audio stream events like a broken window, gun-shot, explosion or scream. One of the main parts of this system is a parameterization block. In the paper two parameterization methods are proposed for this purpose. First of them is based on the frequency analysis of the examples of the sound events. Second one is based on using a standardized set of audio MPEG-7 and cepstral descriptors. The feature vectors calculated by these two methods have been used for the training of two intelligent classifiers: a Support Vector Machines classifier (SVM) and a Neural Networks Perceptron (NNP). The classifiers have been verified by using of the cross validation method. The results have been compared and conclusion derived. The application of the results in a system working in real conditions is presented and discussed at the end of the paper. The work has been done in the frame of international project: “INDECT” (Intelligent Information System Supporting Observation, Searching and Detection for Security of Citizens in Urban Environment).

Streszczenie W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających. Dokonano porównań wyników i przedyskutowano możliwości praktycznego zastosowania systemu.

Entry No. 339

Entry type conference paper

Authors J. Kotus, K. Łopatka, K. Kopaczewski, A. Czyżewski

English title Automatic Audio-Visual Threat Detection

Polish title Automatyczna akustyczno-wizyjna detekcja zagrożeń

Conference MCSS 2010: IEEE International Conference on Multimedia Communications, Services and Security

Preprint

Number

Volume

Pages 140 - 144

Conference site Kraków, Polska

Conference date 6.5.2010- 7.5.2010

Abstract The concept, practical realization and application of a system for detection and classification of hazardous situations based on multimodal sound and vision analysis are presented. The device consists of new kind multichannel miniature sound intensity sensors, digital Pan Tilt Zoom and fixed cameras and a bundle of signal processing algorithms. The simultaneous analysis of multimodal signals can significantly improve the accuracy of real-time automatic detection and classification of hazardous situations.

Streszczenie W referacie zostały przedstawione koncepcja, realizacja praktyczna i zastosowanie systemu do automatycznej detekcji i klasyfikacji niebezpiecznych sytuacji na podstawie multimodalnej analizy danych foniczno-wizyjnych. System składa się z nowego rodzaju czujnika akustycznego - wielokanałowej sondy natężeniowej, cyfrowej kamery PTZ i kamery stacjonarnej wraz z odpowiednimi algorytmami przetwarzania sygnałów. Jednoczesna analiza obu modalności pozwala na automatyczną detekcję i klasyfikację zagrożeń w czasie rzeczywistym.

Entry No. 340

Entry type

Authors A. Czyżewski, B. Kostek, J. Kotus

English title

Polish title Układ do identyfikacji szumów usznych

Streszczenie W zgłoszeniu ujawniono układ do identyfikacji szumów usznych. Uzupełniono zastrzeżenia patentowe.

Entry No. 341

Entry type

Authors A. Czyżewski, B. Kostek, J. Kotus

English title

Polish title Układ do zwalczania szumów usznych

Notes Zgłoszenie wydzielone ze zgłoszenia P.393167

Streszczenie W zgłoszeniu ujawniono układ do identyfikacji szumów usznych. Zgłoszenie wydzielone ze zgłoszenia P.393167. Uzupełniono zastrzeżenia patentowe.

Entry No. 342

Entry type conference paper

Authors K. Łopatka, P. Żwan, A. Czyżewski

English title Parametrization of sounds for recognizing hazarodus events

Polish title Parametryzacja dźwieków w celu wykrywania zdarzeń niebezpiecznych

Conference VIII Krajowa Konferencja Technologie Informacyjne

Preprint

Number

Volume 19

Pages 225 - 230

Conference site Gdańsk, Polska

Conference date 28.6.2010- 30.6.2010

Abstract Modern surveillance systems employ both acoustic and video signal analysis for dangerous event detection. Calculation of parameters is the first stage of a sound recognition algorithm. The key to efficient sound classification is to define parameters, which accurately reflect the differences between recognized classes. A method for parametrization of sounds for recognizing hazardous sound events is presented. A set of 28 parameters is described, which contains dedicated signal features and MPEG-7 descriptors chosen on the basis of experiments and statistical analysis.. Methods for calculation of features are presented. A classifier using the described parameters is tested, yielding high accuracy results.

Streszczenie Nowoczesne systemy monitoringu działają na zasadzie automatycznego wykrywania niebezpiecznych zdarzeń na podstawie analizy obrazu z kamer i dźwięku z mikrofonów. W niniejszej publikacji skupiono się na pierwszym etapie rozpoznawania zdarzeń dźwiękowych, jakim jest parametryzacja dźwięku. Podstawą do skutecznego działania systemu jest znalezienie parametrów, których zmienność najlepiej odzwierciedla cechy charakterystyczne dźwięku związane ze zdarzeniami niebezpiecznymi. W tym celu stworzono zbiór 28 parametrów, w którym znajdują się parametry opisane w standardzie MPEG-7 i parametry zdefiniowane specjalnie dla tego zastosowania. Przedstawiono metody obliczania parametrów z postaci czasowej lub widmowej sygnału. Następnie zbiór ten został sprawdzony poprzez badanie skuteczności klasyfikacji przykładowych próbek dźwiękowych przy pomocy klasyfikatora opartego o maszynę wektorów wspierających (SVM).

Entry No. 343

Entry type conference paper

Authors K. Łopatka, P. Żwan, A. Czyżewski

English title Dangerous sound event recognition using Support Vector Machine Classifiers

Polish title Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem klasyfikatora SVM

Conference 7th International Conference on Multimedia & Network Information Systems (MISSI’10, Wrocław)

Preprint

Number

Volume 80

Pages 49 - 57

Conference site Wrocław, Polska

Conference date 23.9.2010- 24.9.2010

Notes Publikacja w "Advances in intelligent and soft computing" (Springer)

Abstract A method of recognizing events connected to danger based on their acoustic representation through Support Vector Machine classification is presented. The method proposed is particularly useful in an automatic surveillance system. The set of 28 parameters used in the classifier consists of dedicated parameters and MPEG-7 features. Methods for parameter calculation are presented, as well as a design of SVM model used for classification. The performance of the classifier was tested on a set of 372 example sounds, yielding high accuracy

Streszczenie Referat przedstawia metodę rozpoznawania zdarzeń akustycznych związanych z zagrożeniem poprzez zastosowanie do klasyfikacji algorytmu maszyny wektorów wspierających (SVM). Metoda znajduje zastosowanie w systemie automatycznego nadzoru bezpieczeństwa. Do klasyfikacji wykorzystano wektor 28 parametrów. Zaprezentowano metody obliczania cech sygnału oraz parametry modelu SVM. Działanie klasyfikatora zostało sprawdzone z użyciem zbioru 372 nagrań zdarzeń dźwiękowych, osiągając wysoką skuteczność.

Entry No. 344

Entry type conference paper

Authors D. Ellwart, A. Czyżewski

English title Camera angle invariant shape recognition in surveillance systems

Polish title Rozpoznawanie kształtów w systemach monitoringu wizyjnego niezależne od kąta obserwacji sceny

Conference KES 2010, The 3rd International Symposium on Intelligent and Interactive Multimedia: Systems and Services

Preprint

Number

Volume 6

Pages 33 - 40

Conference site Baltimore, USA

Conference date 28.7.2010- 30.7.2010

Notes DOI: 10.1007/978-3-642-14619-0_4; rozdział w książce G.A. Tsihrintzis et al. (Eds.): Intelligent Interactie Multimedia Systems and Services

Abstract A method for human action recognition in surveillance systems is described. Problems within this task are discussed and a solution based on 3D object models is proposed. The idea is shown and some of its limitations are talked over. Shape description methods are introduced along with their main features. Utilized parameterization algorithm is presented. Classification problem, restricted to binary cases is discussed. Support vector machine classifier scores are shown and additional step for improving classification is introduced. Obtained results are discussed and further research directions are discussed.

Streszczenie W pracy opisano metodę rozpoznawania zachowania osób w obrazach z systemów monitoringu. Omówiono trudności napotkane w trakcie realizacji i przedstawiono proponowane rozwiązania. Krótko opisano istniejące metody opisu kształtu. Zaprezentowano wykorzystaną metodę deskrypcji. Przedstawiono zastosowane podejście do problemu klasyfikacji z wykorzystaniem Maszyny Wektorów Wspierających. Opisano przeprowadzone eksperymenty oraz omówiono otrzymane rezultaty. Przedstawiono dalsze możliwe kierunki prac w celu zwiększenia skuteczności proponowanej metody.

Entry No. 345

Entry type conference paper

Authors Ł. Kosikowski, A. Czyżewski

English title Binocular Vision Impairments Therapy Supported By Contactless Eye-gaze Tracking System

Polish title

Conference 12th International Conference on Computers Helping People with Special Needs

Preprint

Number

Volume

Pages 373 - 376

Conference site Wiedeń, Austria

Conference date 14.7.2010- 16.7.2010

Notes Referat na konferencji. Publikacja w Lecture Notes i Computer Science 6180. Computers Helping People with Special Needs.

Abstract Binocular vision impairments often result in partial or total loss of stereoscopic vision. The lack of binocular vision is a serious vision impairment that deserves more attention. Very important result of the binocular vision impairments is a binocular depth perception. This paper describes also a concept of a measurement and therapy system for the binocular vision impairments by using eye-gaze tracking system.

Streszczenie Częściowa lub całkowita utrata widzenia streoskopowego często jest spowodowana przez zaburzenie widzenia obuocznego. Ubytek widzenia obuocznego jest bardzo poważną dolegliwością i wymaga zwiększonej uwagi. Bardzo ważną konsekwencją prawidłowego widzenia obuocznego jest percepcja odległości. Publikacja opisuje koncepcję badania i terapii widzenia obuocznego z wykorzystaniem systemu śledzenia punktu fiksacji wzroku.

Entry No. 346

Entry type book

Authors A. Czyżewski, J. Kotus, M. Szczodrak, B. Kostek, P. Dalka

English title

Polish title Laureaci konkursu - Cudze chwalicie, swego nie znacie - Promocja osiągnięć nauki polskiej

Editor Innovatio Press

Pages 85 - 102

Notes Nagroda Kapituły Konkursu - Nauki Techniczne - Program Operacyjny Kapitał Ludzki, IV, 4.2

Streszczenie Celem projektu Multimedialny System Monitorowania Hałasu zrealizowanego w Politechnice Gdańskiej było opracowanie teleinformatycznego systemu monitorowania klimatu akustycznego, uwzględniając w szczególnym stopniu obrazowanie wpływu zagrożeń hałasowych na słuch. Rozwiązania wcześniej dostępne na rynku cechują wysokie koszty oraz ograniczone możliwości rozbudowy o nowe funkcje analizy sygnału akustycznego, ograniczenia technologiczne w zakresie transmisji danych, brak rozwiązań systemowych pozwalających na dynamiczne modelowanie hałasu na dużych obszarach. Obecny system pomiarowy został zaprojektowany w taki sposób, aby można było pzy jego zastosowaniu skompensować powyżej wspomniane niedobory oraz by zapewniał on maksymalną funkcjonalność przy stosunkowo niskich kosztach powielania.

Entry No. 347

Entry type journal paper

Authors K. Łopatka, A. Czyżewski

English title Text-to-speech synthesizer employing automatic prosodic modification

Polish title Syntetyzer mowy uwzględniający prozodię wypowiedzi

Journal Zeszyty naukowe WE PG

Volume

Number 28

Pages 89 - 92

Abstract The paper presents a Text-To-Speech synthesizer of Polish language employing automatic prosodic modification. The method used for synthesizing the speech signal is concatenative synthesis using constant-length segments – diphones. The subsequent modules of the synthesizer are introduced. Employed language analysis and signal processing techniques are described. The synthesized speech yields high intelligibility and naturalness, which is proved by auditory tests. The proposed system can be used in educational and therapeutic applications or multimodal interfaces for disabled people.

Streszczenie Przedstawiono system syntezy mowy polskiej uwzględniający w sposób automatyczny prozodię wypowiedzi. Zastosowano syntezę konkatenacyjną z wykorzystaniem jednostek o stałej długości – difonów. Opisano poszczególne moduły wchodzące w skład syntetyzera: przetwarzanie tekstu, bazę jednostek mowy oraz algorytmy związane z tworzeniem syntetyzowanego sygnału. Przeprowadzono testy subiektywne potwierdzające wysoką zrozumiałość generowanej mowy i skuteczność modyfikacji prozodycznych. Przedstawiono możliwość zastosowania opisanego systemu w aplikacjach edukacyjnych lub terapeutycznych oraz interfejsach multimodalnych przeznaczonych dla osób niepełnosprawnych.

Entry No. 348

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title Performance of Watermarking-based DTD Algorithm Under Time-varying Echo Path Conditions

Polish title

Conference KES 2010, The 3rd International Symposium on Intelligent and Interactive Multimedia: Systems and Services

Preprint

Number

Volume

Pages 69 - 79

Conference site Baltimore, USA

Conference date 28.7.2010- 30.7.2010

Notes rozdział w książce G.A. Tsihrintzis et al. (Eds.): Intelligent Interactie Multimedia Systems and Services

Abstract A novel double-talk detection (DTD) algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The problem of DTD robustness to time-varying conditions of acoustic echo path is discussed and explanation as to why such conditions occur in practical situations is provided. The environment and the procedure used for simulation of test conditions and evaluation of DTD algorithms are presented. Results of comparing performance of the introduced watermarking DTD with the well-established Geigel DTD algorithm are presented.

Entry No. 349

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title

Polish title Metody i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez uczniów z pogorszoną rozdzielczością czasową słuchu

Conference Politechnika Gdariska - uniwersytet przedsigbiorczy XXI wieku

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 24.5.2010- 25.5.2010

Notes Plakat

Streszczenie Szacuje się, że co najmniej u połowy dzieci z rozpoznanymi trudnościami w uczeniu się, dysleksją, zespołem zaburzeń uwagi i zachowania występują zaburzenia przetwarzania słuchowego. Zaburzenia te często objawiają się poprzez problemy z percepcją szybkiej mowy. Jak pokazują badania, modyfikacja czasu trwania mowy powoduje wzrost zrozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu. Opracowywane w ramach prac nad rozprawą doktorską algorytmy modyfikacja sygnału mowy, dzięki niewielkiej złożoności obliczeniowej a jednocześnie wysokiej jakości przetwarzania sygnału, staną się integralną częścią urządzenia służącego do wspomagania rozumienia mowy przez uczniów cierpiących na zaburzenia przetwarzania mowy związane z pogorszoną rozdzielczością słuchu.

Entry No. 350

Entry type conference paper

Authors Ł. Kulasek, J. Wolski, A. Czyżewski

English title 3D Morphable Models Application for Expanding Face Database Limited to Single Frontal Face Per Person

Polish title Zastosowanie morfowalnych modeli 3D do rozszerzenia bazy wzorców składającej się z pojedynczych ujęć frontalnych

Conference Image Processing & Communications 2010

Preprint Springer-Verlag

Number

Volume 2

Pages

Conference site Bydgoszcz, Polska

Conference date 20.10.2010- 22.10.2010

Abstract 1. Publication dealed with research on expanding posesed face database (limited to one frontal image of the identity) with novel samples with variables angles of acquisition. These novel samples were created in process of reconstructing 3D head model for given identity, based on available 2D sample. Both face texture and web of the face fiducial points were layed on the average 3D head model. Next, angle of acquisition was simulated and 2D sample was made again. The method proved to be successful. 2. The conerence was held by University of Technology and Life Sciences in Bydgoszcz. 3. The article was published in: "Advances in Soft Computing Series book", vol.2, by Springer - Verlag.

Streszczenie 1. Zaprezentowany materiał dotyczył badań nad rozszerzeniem dysponowanej bazy wzorców wizerunków twarzy, o dodatkowe wzorce z wariacją w ustawieniu. Dodatkowe wzorce były usyskiwane poprzez przejście z wizerunku twarzy 2D na model 3D, zasymulowanie zadanego ustawienia i powrót do dziedziny 2D (poprzez rzutowanie 3D->2D). W fazie konstrukcji modelu 3D, z wizerunku 2D była ściągana zarówno tekstura twarzy jak i siatka punktów charakterystycznych. Do rekonstrukcji wykorzystano uśredniony model głowy. Przydatność opracowanej metody została zweryfikowana pozytywnie. 2. Konferencja była zorganizowana przez Uniwersytet Technologiczno - Przyrodniczy w Bydgoszczy. 3. Prezentowany materiał został włączony do publikacji: "Advances in Soft Computing Series book", vol.2, wydawnictwo Springer - Verlag.

Entry No. 351

Entry type book

Authors N. T Nguyen, A. Zgrzywa, A. Czyżewski

English title Advances in Multimedia and Network Information System Technologies

Polish title Postępy w Dziedzinie Technologii Multimedialnych i Sieciowych Systemów Informacyjnych

Editor Springer-Verlag Berlin Heidelberg

Pages

Notes ISBN 978-3-642-14988-7

Abstract Preparing this book we have asked for cooperation many European research teams. In effect the monograph is a collection of carefully selected and the most representative - in our opinion - investigations, solutions, and applications presented by different scientific groups from nine countries. Content of the book has been divided into five parts: 1. Multimedia information technology 2. Data processing in information systems 3. Information system applications 4. Web systems and network technologies 5. E-learning methodologies and platforms

Streszczenie Słowa kluczowe: technologie multimedialne; sieciowe systemy informacyjne; przetwarzanie danych W procesie przygotowywania tej książki zaproszono do współpracy przedstawicieli licznych europejskich zespołów badawczych. W efekcie, opracowana monografia zawiera reprezentatywne wyniki badań dziewięciu zespołów badawczych. Zawartość książki dotyczy następujących zagadnień: 1. Multimedialne technologie informacyjne 2. Przetwarzanie danych w systemach informacyjnych 3. Zastosowania dotyczące systemów informacyjnych 4. Technologie sieciowe 5. Metody i platformy zdalnego nauczania

Entry No. 352

Entry type conference paper

Authors M. Kulesza, A. Czyżewski

English title A novel tonality estimation method of spectral components for audio coding applications

Polish title Nowa metoda estymowania tonalności składowych widmowych do zastosowań w kodowaniu sygnałów fonicznych

Conference 2nd Pan-American/Iberian Meeting on Acoustics, Acoustical Soc. of America

Preprint

Number

Volume

Pages

Conference site Cancun, Mexico

Conference date 15.11.2010- 19.11.2010

Abstract A novel algorithm providing adequate tonality estimates for constant and modulated sinusoidal components was developed. The proposed algorithm wascombined with the MPEG psychoacoustic model 2 in order to verify whether replacingthe standard tonality estimator with the proposed one leads to a more reliableestimate of hearing threshold. It was verified whether the proposed tonalityestimation algorithm may be used as a basis for detecting the signal bandscontaining only a noiselike component that can be encoded according to theperceptual noise substitution technique. Subsequently, it was proved thatit is possible to estimate the tonality of unmodulated or frequencymodulatedsinusoidal components of audio signals through the comparison of their instantaneousfrequency variations determined employing both an estimator processing spectralamplitude samples and an estimator processing spectral phase samples. It wasalso revealed that distortions introduced during perceptual audio coding maybe effectively limited employing the proposed tonality estimation algorithm.In order to fully explore the benefits of the proposed method for coding applications,the listening tests were performed in accordance with the ITUT BS.1534 recommendation.[Research partially funded by the Polish Ministry of Science and Higher Educationwithin Grants Nos. PBZMNiSW02II2007 and No. N N517 378736.]

Streszczenie Słowa kluczowe: kodowanie sygnałów fonicznych; estymacja widma; tonalność widmowa W wyniku prac badawczych opracowano nowy algorytm estymacji tonalności komponentów widma do zastosowania w systemach perceptualnego kodowania sygnałów fonicznych. Zaproponowana metoda opera się o analizę widma amplitudowego oraz fazowego sygnałów fonicznych. W kolejnych widmach amplitudowych dokonywana jest detekcja ich maksimów lokalnych. Na podstawie określonych kryteriów maksima lokalne wykryte w trzech kolejnych widmach wykorzystane są do utworzenia trójelementowych ciągów, stanowiących kandydatów do ścieżek tonalnych. Dla każdego z utworzonych ciągów określane są zmiany częstotliwości chwilowych z wykorzystanie dwóch różnych estymatorów. Pierwszy z estymatorów zmian częstotliwości chwilowej bazuje na analizie widma amplitudowego. Drugi z estymatorów, specjalnie opracowany do zastosowania w proponowanym algorytmie, bazuje na trzech kolejnych widmach fazowych sygnału. Dla każdego maksimum widma wchodzącego w skład trójelementowego ciągu dokonywana jest najpierw estymacja częstotliwości chwilowej z wykorzystaniem metody parabolicznej. Na podstawie wyników tej analizy określana jest zmiana częstotliwości chwilowej danego kandydata do ścieżki tonalnej. Następnie ta sama zmiana częstotliwości chwilowej określana jest z wykorzystaniem metody bazującej na analizie widma fazowego.

Entry No. 353

Entry type conference paper

Authors P. Suchomski, P. Odya, J. Kotus, A. Czyżewski

English title An Approach to Determining Tinnitus Acoustical Characteristic

Polish title Próba określenia parametrów akustycznych szumów usznych

Conference International Conference on Man-Machine Interactions

Preprint

Number

Volume

Pages 221 - 228

Conference site Gliwice, Polska

Conference date 6.10.2011- 9.10.2011

Notes Referaty konferencyjne wydane w Advances in Intelligent and Soft Computing

Abstract For many treatment methods, accurate estimation of Tinnitus(ringing in ears) concerning sound type, level, and bandwidth or frequency is inevitable. The proposed way of obtaining Tinnitus parameters is described in this paper. The method employs sound synthesis, aimed at obtaining sound which is closest to perceived Tinnitus. The proposed method assumes running a designed application on a multimedia PC provided with a special graphical user interface to facilitate sound generation and identification. Emphasis is put on issues related to the implementation of the proposed diagnostic procedure. The method was verified during preliminary tests in which people suering from Tinnitus participated. The obtained results are presented and discussed in this paper.

Streszczenie W przypadku wielu metod terapii szumów usznych dokładne określenie ich parametrów, takich jak rodzaj, poziom, a także pasmo czy częstotliwość odgrywa bardzo ważną rolę. Zaproponowana metoda pozwala na oszacowanie tych parametrów z wykorzystaniem syntezy dźwięku. Syntetyzowany dźwięk ma być jak najbardziej zbliżony do odczuwanego przez pacjenta szumu usznego. Opracowana aplikacja wymaga do pracy multimedialnego komputera PC. Najważniejszym elementem aplikacji jest specjalnie zaprojektowany interfejs użytkownika pozwalający na generowanie dźwięku w intuicyjny sposób. W pracy położono nacisk na kwestie związane z implementacją zaproponowanej procedury diagnostycznej. Opracowana aplikacja była przetestowana z udziałem osób cierpiących na szumy uszne. W pracy zaprezentowano i przedyskutowano uzyskane wyniki.

Entry No. 354

Entry type journal paper

Authors K. Łopatka, P. Suchomski, A. Czyżewski

English title Automatic prosodic modification in a Text-To-Speech synthesizer of Polish language

Polish title Automatyczna modyfikacja prozodii w syntetyzerze mowy polskiej

Journal Elektronika

Volume 52

Number 5

Pages 106 - 110

Abstract A Text-To-Speech synthesizer of Polish language with automatic prosodic modification is presented. The methods for automatic determination of accent and intonation are introduced. The application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. The impact of these modifications on the naturalness of the synthesized signal is discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.

Streszczenie Przedstawiono system syntezy mowy polskiej z funkcją automatycznej modyfikacji prozodii wypowiedzi. Opisane zostały metody automatycznego wyznaczania akcentu i intonacji wypowiedzi. Przedstawiono zastosowanie algorytmów przetwarzania sygnału mowy w procesie kształtowania prozodii. Omówiono wpływ zastosowanych modyfikacji na naturalność brzmienia syntezowanego sygnału. Zastosowana metoda oparta jest na algorytmie TD-PSOLA. Opracowany system syntezy mowy znajduje zastosowanie w aplikacjach wykorzystujących multimodalne interfejsy komputerowe.

Entry No. 355

Entry type conference paper

Authors Ł. Kosikowski, P. Dalka, P. Odya, A. Czyżewski

English title Multimedia Interface Using Head Movements Tracking

Polish title Komputerowy interfejs multimedialny wykorzystujący śledzenie ruchów głowy

Conference International Conference on Man-Machine Interactions

Preprint

Number

Volume

Pages 41 - 47

Conference site Gliwice, Polska

Conference date 6.10.2011- 9.10.2011

Notes Referaty konferencyjne wydane w Advances in Intelligent and Soft Computing

Abstract The presented solution supports innovative ways of manipulating computer multimedia content, such as: static images, videos and music clips and others that can be browsed subsequently. The system requires a standard web camera that captures images of the user face. The core of the system is formed by a head movement analyzing algorithm that finds a user face and tracks head movements in real time. Head movements are tracked with a Finite State Machine. State transitions are triggered by various spatial and temporal conditions. Whenever a state of the machine changes, an event is sent to the GUI application supposed to react accordingly. The system is immune to the presence of many faces in a video stream; only one face is tracked. The application is especially suitable to standalone, multimedia terminals where users may get acquainted with a company profile, situation layout or a store offer in a fast and convenient way. The application may also be used by disabled people.

Streszczenie Przedstawione rozwiązanie dotyczy innowacyjnego sposobu przeglądania zasobów multimedialnych, takich jak: statyczne obrazy, filmy, pliki muzyczne i inne. System wymaga standardowej kamery, która rejestruje obrazy twarzy użytkownika.Sercem systemu jest algorytm analizujący obraz, w którym wyszukuje twarz użytkownika i śledzi ruchy głowy w czasie rzeczywistym. Aplikacja została opracowana z myślą o terminalach multimedialnych, przy pomocy których użytkownicy mogą zapoznać się z profilem firmy, lub ofertą sklepu w sposób szybki i wygodny. Opracowane oprogramowanie może być również stosowane przez osoby niepełnosprawne.

Entry No. 356

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

Polish title Ocena jakości nowatorskiego algorytmu DTD opartego na znakowaniu sygnałów dźwiękowych

Conference International Conference on Signal Processing and Multimedia Applications SIGMAP 2011

Preprint

Number

Volume

Pages

Conference site Seville, Hiszpania

Conference date 18.7.2011- 21.7.2011

Notes http://sigmap.icete.org/Abstracts/2011/SIGMAP_2011_Abstracts.htm

Abstract Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The comparison of the proposed algorithm with very common, but simple Geigel algorithm and representing current state-of-the-art Normalized Cross-Correlation algorithms is performed. Both objective (ROC) and subjective (listening tests) performance evaluation methods are employed to obtain exhaustive evaluation results in simulated real-world conditions. The evaluation results are presented and their relevance is discussed. An issue of algorithms’ computational complexity is emphasized and conclusions are drawn.

Streszczenie Algorytmy eliminacji echa zwykle wykorzystują blok detekcji mowy równoczesnej (DTD) w celu zapobieżenia "rozstrojeniu" się filtra adaptacyjnego w obecności mowy pochodzącej od mówcy bliskiego lub innych zakłóceń w sygnale pochodzącym z mikrofonu. Autorzy zaproponowali nowatorski algorytm DTD oparty na technice zbliżonej do znakowania sygnałów dźwiękowych. Przedstawiono zastosowanie algorytmu w systemie eliminacji echa akustycznego. Przeprowadzono porównanie algorytmu z prostym, ale popularnym algorytmem DTD Geigela oraz współczesnym algorytmem NCC. Przeprowadzono testy obiektywne (ROC) oraz subiektywne (odsłuchowe) w celu uzyskania wyczerpujących danych porównawczych w symulowanych warunkach rzeczywistych. Przedstawiono wyniki badań i przedyskutowano ich znaczenie. Podkreślono znaczenie złożoności obliczeniowej algorytmów i przedstawiono wnioski.

Entry No. 357

Entry type journal paper

Authors A. Ciarkowski, A. Czyżewski

English title Communication Platform for Evaluation of Transmitted Speech Quality

Polish title Platforma komunikacyjna do oceny jakości transmitowanej mowy.

Journal Journal of Telecommunications and Information Technology

Volume 3

Number

Pages 95 - 101

Abstract A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing PESQ or PSQM algorithms. The functionality for the simulation of distortions typical for voice communication over the Internet was implemented, making it possible to obtain reproducible, quantifiable results. An application of the presented platform for evaluation of acoustic echo canceler algorithm based on watermarking techniques, which was developed earlier is presented as an example of an effective deployment of the described technology.

Streszczenie Opisano system komunikacji głosowej, który został zaprojektowany i wdrożony. Celem prezentowanej platformy było umożliwienie serii eksperymentów związanych z oceną jakości algorytmów wykorzystywanych w kodowaniu i transmisji mowy. System został wyekwipowany w narzędzia umożliwiające rejestrację sygnałów na każdym etapie przetwarzania, pozwalając na poddanie ich ocenie jakości za pomocą testów subiektywnych lub obiektywnych. Zaimplementowano funkcjonalność służącą do symulacji zniekształceń typowych dla komunikacji głosowej w sieci Internet, umożliwiając uzyskanie powtarzalnych, mierzalnych wyników badań. Przedstawiono zastosowanie omawianej platformy do oceny algorytmu AEC, który został opracowany przez autorów, jako przykład efektywnego wykorzystania systemu.

Entry No. 358

Entry type journal paper

Authors M. Kulesza, A. Czyżewski

English title Frequency based criterion for distinguishing tonal and noisy spectral components.

Polish title Kryterium częstotliwościowe do celu rozróżniania tonalnych i szumowych składowych mowy

Journal International Journal of Computer Science and Security

Volume 4

Number 1

Pages 1 - 16

Abstract A frequency-based criterion for distinguishing tonal and noisy spectral components is proposed. For considered spectral local maximum two instantaneous frequency estimates are determined and the difference between them is used in order to verify whether component is noisy or tonal. Since one of the estimators was invented specially for this application its properties are deeply examined. The proposed criterion is applied to the stationary and non-stationary sinusoids in order to examine its efficiency.

Streszczenie Słowa kluczowe: sygnał mowy; składowe tonalne; składowe szumowe W artykule zaproponowano nowe kryterium częstotliwościowe do celu rozróżniania tonalnych i szumowych składowych mowy. Dla lokalnego maksimum w reprezentacji widmowej obliczane są dwie estymaty chwilowe, zaś różnica pomiędzy nimi jest wykorzystywna do określania, czy dana składowa ma charakter tonalny, czy szumowy. Ponieważ jeden z tych estymatorów został zaproponowany specjalnie dla potrzeb tego zatsosowania, to jego właściwości zostały przebadane szczególnie dokładnie. Zaproponowane kryterium było stosowane w odniesieniu do zarówno sygnałów harmonicznych stacjonarnych, jak i niestacjonarnych.

Entry No. 359

Entry type journal paper

Authors M. Kotarski, J. Smulko, A. Czyżewski, S. Melkonyan

English title Fluctuation-enhanced scent sensing using a single gas sensor.

Polish title Usprawnienie detekcji zapachów z zastosowaniem pojedynczego sensora gazu.

Journal Sensors and Actuators B: Chemical

Volume

Number

Pages 1 - 7

Notes Elsevier B.V; SNB-12962

Abstract Scent or aroma sensing during aromatherapy can be carried out by applying only a single resistance gas sensor (TGS – Taguchi Gas Sensors). This paper considers the efficiency of detection of essential oils by DC resistance and its fluctuations observed in TGS sensors. A detailed study has been conducted for scents emitted by five popular essential oils using three sensor types (TGS 2600, TGS 2602, TGS 823). The research was focused on the practical use in aromatherapy to assure the same intensity of scents which are sprayed by a glass nebulizer. The prepared system for scent emission and control of its intensity was presented as well.

Streszczenie Słowa kluczowe: detekcja gazów; pomiar szumów; sensory gazowe; aromaterapia Zapach może być wykrywany w trakcie prowadzenia aromaterapii przy wykorzystaniu pojedynczego sensora gazowego (TGS – Taguchi Gas Sensors). W artykule skoncentrowano się na zagadnieniu detekcji wykrywania śladowych ilości olejków aromatycznych w powietrzu na podstawie obserwowania fluktuacji zmian rezystancji sensowrów gazowych różnych typów (TGS 2600, TGS 2602, TGS 823). Badania były nakierowane na zastosowania opracowanego urządzenia w aromaterapii.

Entry No. 360

Entry type conference paper

Authors A. Czyżewski, B. Kostek

English title Intelligent Multimedia Solutions Supporting Special Education Needs.

Polish title Inteligentne aplikacje multimedialne w zastosowaniach wspomagających kształcenie osób ze specjalnymi potrzebami edukacyjnymi

Conference 19th International Symposium, ISMIS 2011, Foundations of Intelligent Systems, LNAI 6804

Preprint

Number

Volume

Pages 1 - 15

Conference site Warszawa, Polska

Conference date 28.6.2011- 30.6.2011

Notes ISBN 978-3-642-21915-3

Abstract The role of computers in school education is brieﬂy discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality. Intelligent and adaptive algorithms application to the developed multimodal interfaces is discussed.

Streszczenie Słowa kluczowe: komputeryzacja procesów dydaktycznych; interfejsy multimodalne; wirtualna tablica szkolna; spowalnianie mowy Artykuł w pierwszej części poświęcony jest dyskusji na temat roli komputerów w dydaktyce szkolnej. Następnie opisana jest krótko historia rozwoju interfejsów multimodalnych, w szczególności do zastosowań w edukacji osób niepełnosprawnych na przykłądzie m. in. opracowanej wirtualnej tablicy szkolnej, aplikacji do sterowania komputerami za pomocą gestów wykonywanych ustami, systemu do spowalniania mowy nauczyciela. Na zakończenie omówiona jest rola algorytmów adaptacyjnych i uczących się w rozwiązaniach opracowanych interfejsów multimodalnych.

Entry No. 361

Entry type journal paper

Authors A. Czyżewski, B. Kostek

English title Intelligent video and audio applications for learning enhancement.

Polish title Inteligentne aplikacje wideofoniczne do celu wspomagania procesu edukacyjnego

Journal Journ. of Intelligent Information Systems

Volume

Number

Pages 1 - 20

Notes Springer; ISSN 0925-9902

Abstract The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality. Intelligent and adaptive algorithms applications to the developed multimodal interfaces are discussed.

Streszczenie Słowa kluczowe: komputeryzacja procesów dydaktycznych; interfejsy multimodalne; wirtualna tablica szkolna; spowalnianie mowy Artykuł rozpoczyna krótka dyskusja na temat roli komputerów w procesie kształcenia,. Następnie przedmiotem rozważań jest historaia rozwoju technologii interfejsów multimodalnych i przykłady ich zastosowań w edukacji osób niepełnosprawnych oparte na zastosowaniu wirtualnej tablicy interaktywnej, interfesu do sterowania komputerem na podstawie analizy gestów wykonywanych ustami oraz interfejsu fonicznego, który umożliwia rozciąganie mowy w czasie rzeczywistym. Na zakończenie artykułu podkreśloni znaczenie algorytmów inteligentnych i uczących się w tego rodzaju zastosowaniach.

Entry No. 362

Entry type conference paper

Authors A. Czyżewski, H. Skarżyński, B. Kostek

English title Telemedical hearing and vision screening system employing iOS based devices.

Polish title Telemedyczne przesiewowe systemy badania słuchu i wzroku na platformie iOS

Conference 6th National Conference of the Audiology and Phoniatrics Sections of the Polish Society of Oto-Rhino-Laryngologists and Head and Neck Surgeons

Preprint

Number 1

Volume 1

Pages 139

Conference site Warszawa, Polska

Conference date 22.6.2011- 25.6.2011

Notes abstracts (U-38) - Journal of Hearing Science, 2011; ISSN 2083-389X

Abstract A design and implementation of the hearing and vision screening system dedicated for the popular iOS (iPhone/iPad/iPod Operating System) based devices is presented. The aim of the system is to promote hearing and vision screening tests internationally and to analyze collected results. The examination consists of speech in noise and tone audiometry tests, color vision and contrast differentiation tests. Whenever a test is completed the system automatically evaluates user's answers and generates results.

Streszczenie W referacie przedstawiono projekt i impementację przesiewowych systemów badania słuchu i wzroku przeznaczonego do wykorzystywania na popularnej platformie iOS (iPhone/iPad/iPod Operating System). Celem opracowanego systemu jest promowanie komputerowego badania zmysłów komunikacji. Badanie opiera się na testowaniu rozumienia mowy w szumie, postzregania barw i różnicowania kontrastu. Za kazdym razem, gdy test zostaje wykonany system automatycznie przetwarzania wyniki badań i podaje badanemu do wiadomości diagnozę.

Entry No. 363

Entry type conference paper

Authors M. Kurkowski, A. Czyżewski, H. Skarżyński, K. Kochanek

English title Real-time speech stretching for diagnosing and supporting speech and hearing impaired patients.

Polish title Rozciąganie sygnału mowy w czasie rzeczywistym w celu wspomagania osób z zaburzeniami mowy i słuchu.

Conference 6th National Conference of the Audiology and Phoniatrics Sections of the Polish Society of Oto-Rhino-Laryngologists and Head and Neck Surgeons

Preprint

Number 1

Volume 1

Pages 141

Conference site Warszawa, Polska

Conference date 22.6.2011- 25.6.2011

Notes abstracts (U-42) - Journal of Hearing Science, 2011; ISSN 2083-389X

Abstract Modifying speech signal by stretching or shrinking it in the time-domain finds many interesting applications to adiology and speech therapy. A study of real time-scale modification algorithms applied to diagnosis and therapy of speech and hearing impaired patients including children and youth is presented. A variety of signal processing algorithms was considered, namely: the overlap-and-add and the phase vocoder. Their effectiveness as well as real-time processing capabilities were examined. The developed algorithm including an additional speech microstructure analysis was implemented to stationary and to mobile computer platforms. The digital speech time transposer device was engineered. Based on the performed test results it was shown that time stretching can influence speech understanding positively in children with hearing impairments.

Streszczenie Modyfikacje czasowe sygnału mowy w czasie rzeczywistym znajduje interesujące zastosowania w terapii logopedycznej i audiologicznej. Referat dotyczy badania zastosowań opracowanego algorytmu modyfikowania sklali czasowej mowy w diagnozowaniu i terapii dzieic i młodzieży mających problemy ze słyszeniem. Wzięto przy tym pod uwagę zróżnicowane algorytmy cyfrowego przetwarzania mowy, w tym: algorytm overlap-and-add i wokoder fazowy. Badano efektywność tych algorytmów i możliwośći ich stosowania dla czasu rzeczywistego. OPrcaowany algorytm analizuje strukturę sygnału mowy - został o zaimplementowany na komputerowej platformie mobilnej. N atej podstawie opracowano urządzneie - cyfrowy transpozer mowy. W oparciu o przeprowadzone badania testowe stwierdzono, że spowalnianie mowy w czasie rzeczywistym może wpływać na rozumienie mowy u dzieci z zaburzeniami słuchu.

Entry No. 364

Entry type journal paper

Authors M. Szwarc, A. Czyżewski

English title New approach to railway noise modeling employing Genetic Algorithms

Polish title

Journal Applied Acoustics

Volume 72

Number 8

Pages 611 - 622

Notes Elsevier; ISSN 0003-682X

Abstract Main goal of this paper was to describe an innovative method of noise prediction based on Genetic Algorithms. First part of the paper addresses the problem of growing noise, mainly in the context of a unified method for measuring noise. Further, Genetic Algorithms are described with regards to their fundamental features. Further a description is provided as to how Genetic Algorithms were used in the area of noise modeling. Next chapter shows results achieved with prototype software created for the purpose of research experiments. Finally, a practical technical application of this method is presented.

Streszczenie Głównym celem artykułui jest prezentacja innowacyjnej metodyki predykcji hałasu kolejowego opartej na algorytmie genetycznym. Pierwsza część artykułu dotyczy problematyki narastających zagrożeń hałasowych w kotekście potrzeby stworzenia uniwersalnej metody pomiaru hałasu. W dalszej części zostały w skrócie zaprezentowane algorytmy genetyczne w zastsowoaniu do modelowania hałasu. Następnie artykuł prezentuje eksperymenty z użyciem prototypowego oprogramowania do modelowania i przewidywania hałasu z użyciem zaproponowanej metody, która ma wiele zastosowań praktycznych.

Entry No. 365

Entry type journal paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak

English title Creating Acoustic Maps Employing Supercomputing Cluster

Polish title Tworzenie Map Akustycznych z Zastosowaniem Superkomputera

Journal Archives of Acoustics

Volume 36

Number 3

Pages 1 - 24

Abstract The implemented online urban noise pollution monitoring system is presented with regard to its conceptual assumptions and technical realization. A concept of the noise source parameters dynamic assessment is introduced. The idea of noise modeling, based on noise emission characteristics and emission simulations, was developer and practically utilized in the system. Furthermore, the working system architecture and the data acquisition scheme are described. The method for increasing the speed of noise map calculation employing a supercomputer is explained. The practical implementation of noise maps generation and visualization system is presented, together with introduced improvements in the domain of continuous noise monitoring and acoustic maps creation. Some results of tests performed using the system prototype are shown. The main focus is put on assessing the efficiency of the acoustic maps created with the discussed system, in comparison to results obtained with traditional methods.

Streszczenie W artykule przedstawiono opracowanie koncepcyjne i realizację praktyczną systemu monitorowania hałasu środowiskowego. Przedsawiono również koncepcję tworzenia dynamicznych map hałasu. Podstawowa idea polegająca na połączeniu rzeczywistych wyników pomiarów hałasu z wynikami modelowania została opracowana w ramach jednego systemu. W artykule przedstawiono również opis architektury systemu oraz modułów akwizycji danych pomiarowych. Wyjaśniono również metody zwiększania szybkości tworzenia map akustycznych z wykorzystaniem superkomputera. Przedstawiono praktyczne aspekty implementacji modułu obliczającego wartości poziomów hałasu na danym obszarze oraz modułu wizualizacji wyników w połączeniu z systemem ciągłego monitorowania hałasu. Przedstawiono wyniki badań uzyskane za pomocą prototypowego systemu. Główny nacisk położono na ocenę dokładności stworzonej mapy hałasu w oparciu o porównanie z wynikami uzyskanymi za pomocą metod tradycyjnych.

Entry No. 366

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title COMPARISON OF VARIOUS SPEECH TIME-SCALE MODIFICARTION METHODS

Polish title PORÓWNANIE METOD MODYFIKACJI CZASU TRWANIA SYGNAŁU MOWY

Conference 14th International Symposium on Sound Engineering and Tonmeistering

Preprint

Number 2

Volume 36

Pages 488 - 489

Conference site Wrocław, Polska

Conference date 19.5.2011- 21.5.2011

Bibliographic No. 12

Abstract The objective of this work is to investigate the influence of the different time-scale modification (TSM) methods on the quality of the speech stretched up using the designed non-uniform real-time speech time-scale modification algorithm (NU-RTSM). The algorithm provides a combination of the typical TSM algorithm with the vowels, consonants, stutter, transients and silence detectors. Based on the information about the content and on the estimated value of the rate of speech (ROS), the algorithm adapts to the scaling factor value, and removes the redundant signal i.e. silence, stutter and transients. TSM algorithms named: SOLA, PSOLA and WSOLA were examined in order to assess the quality of the stretched speech, the complexity of the calculation and the possibility of their usage in the NU-RTSM algorithm. Subjective tests were performed in order to compare the quality of various time-scaling meth-ods.

Streszczenie Streszczenie. Celem pracy jest zbadanie wpływu różnych metod modyfikacji czasu trwania sygna-łu mowy (ang. TSM -Time Scale Modification), na jakość mowy spowolnionej za pomocą algorytmu nierównomiernego spowalniania sygnału mowy w czasie rzeczywistym. Opracowany algorytm nie-równomiernego spowalniania sygnału jest połączeniem typowego algorytmu TSM, z detektorami: mowy, samogłosek, zająknięć oraz transjentów (głosek wybuchowych). Na podstawie informacji do-tyczących zawartości przetwarzanego sygnału (dostarczanej przez detektory) oraz estymowanego tempa przetwarzanej wypowiedzi, algorytm dobiera współczynnik spowalniania oraz usuwa nadmia-rową informację z sygnału t.j.: ciszę i zająknięcia. W celu dokonania oceny jakości spowolnionej mowy, złożoności obliczeniowej algorytmów oraz możliwości ich wykorzystania do nierównomier-nego spowalniania mowy w czasie rzeczywistym, przebadano następujące algorytmy: SOLA (ang. Synchronous Overlap and Add), WSOLA (ang. Wave Similarity Overlap and Add) oraz PSOLA (ang. Pitch-synchronous Overlap and Add). Ocena jakości mowy spowolnionej z wykorzystaniem wymie-nionych algorytmów, została uzyskana w wyniku testów subiektywnych.

Entry No. 367

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title A NON-UNIFORM REAL-TIME SPEECH TIME-SCALE STRETCHING METHOD

Polish title Metoda nierównomiernej modyfikacji czasu trwania sygnału mowy działająca w czasie rzeczywistym

Conference SIGMAP

Preprint

Number

Volume

Pages 1 - 7

Conference site Sewilla, Hiszpania

Conference date 18.7.2011- 21.7.2011

Bibliographic No. 16

Abstract An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were analysed. Subjective tests were performed in order to compare the quality of the proposed method with the output of the standard SOLA algorithm. Accuracy of the ROS estimation was assessed to prove its robustness.

Streszczenie W referacie przedstawiono algorytm nierównomiernej modyfikacji czasu trwania sygnału mowy działający w czasie rzeczywistym. Powstał on poprzez połączenie algorytmu SOLA (Synchronous Overlap and Add ) z detektorami samogłosek, spółgłosek i ciszy. Na podstawie informacji dotyczących zawartości analizowanego sygnału oraz estymowanego tempa mowy, proponowany algorytm dopasowuje wartości współczynnika skali. W części eksperymentalnej referatu zbadano możliwość przetwarzania sygnału w czasie rzeczywistym oraz jakość spowolnionego sygnału mowy. Jakości spowolnionej mowy została oceniona poprzez porównanie wyników testów subiektywnych przeprowadzonych dla mowy modyfikowanej z wykorzystanie proponowanego algorytmu oraz algorytmu SOLA. Zbadano także skuteczność algorytmu estymacji tempa mowy co pozwoliło wykazać jego odporność.

Entry No. 368

Entry type conference paper

Authors A. Czyżewski, B. Kostek, A. Kupryjanow

English title AUTOMATIC SOUND RESTORATION SYSTEM -CONCEPTS AND DESIGN

Polish title System automatycznej rekonstrukcji dźwięku – koncepcja i projekt

Conference SIGMAP

Preprint

Number

Volume

Pages 1 - 5

Conference site Sewilla, Hiszpania

Conference date 19.7.2011- 21.7.2011

Bibliographic No. 13

Abstract A concept of a system for automatic audio recording reconstruction is described. It is supported by the video image reconstruction algorithm, focused on the video instability analysis. Sound restoration is performed focusing on noise and wow and flutter analysis. Presented algorithms are designed to be automatic and to reduce the human effort during the restoration process. A web service designed especially for automatic restoration process is envisioned as an integration platform for these algorithms and for repository of recordings.

Streszczenie W referacie przedstawiono koncepcję systemu automatycznej rekonstrukcji nagrań fonicznych. Proces rekonstrukcji dźwięku jest wspierany przez zastosowanie analizy obrazu filmowego ukierunkowaną na śledzenie stabilności klatek filmowych. Z sygnału dźwiękowego usuwane są następujące zakłócenia: szerokopasmowy szum, drżenie i kołysanie dźwięku. Przedstawione algorytmy rekonstrukcji zastały opracowane tak by zminimalizować udział człowieka w procesie poprawy jakości dźwięku. W celu udostępnienia opracowanych mechanizmów rekonstrukcji oraz nagrań zgromadzonych w repozytorium zaproponowano specjalny serwis internetowy zoptymalizowany pod kątem pracy z nagraniami fonicznymi.

Entry No. 369

Entry type conference paper

Authors M. Papaj, A. Czyżewski

English title Facial features extraction for color, frontal images.

Polish title Ekstrakcja cech twarzy z kolorowych frontalnych obrazów

Conference 4th Image Pprocessing and Communications Challenges 3

Preprint

Number

Volume 102

Pages 23 - 31

Conference site Bydgoszcz, Polska

Conference date 5.9.2011- 7.9.2011

Notes series: Advances in intelligent and Soft Computing; ISBN 978-3-642-23153-7; Springer-Verlag,2011

Abstract The problem of facial characteristic features extraction is discussed. Several methods of features extraction for color en--face photographs are discussed. The methods are based mainly on the colors features related to the specific regions of the human face. The usefulness of presented methods was tested on a database of en--face photographs consisting of 100 photographs.

Streszczenie W referacie omówiony jest problem ekstrakcji cech twarzy. Przeanalizowanych zostało kilka metod ekstrakcji cech twarzy z kolorowych frontalnych obrazów twarzy, bazujących w większości na cechach kolorystycznych związanych ze specyficznymi regionami ludzkiej twarzy. Użyteczność zaprezentowanych metod została przetestowana na bazie danych zawierającej 100 frontalnych obrazów twarzy.

Entry No. 370

Entry type conference paper

Authors R. Kosikowski, Ł. Kosikowski, P. Odya, A. Czyżewski

English title SENSES - WHAT U SEE? Vision screening system dedicated for iOS based devices. Development and screening results.

Polish title SENSES - WHAT U SEE? System do przesiewowego badania wzroku dedykowany dla urządzeń mobilnych z systemem operacyjnym iOS. Wytwarzania i wyniki badań.

Conference International Conference on Signal Processing and Multimedia Applications

Preprint

Number

Volume

Pages

Conference site Sewilla, Hiszpania

Conference date 18.7.2011- 21.7.2011

Notes Publikacja: SciTePress Digital Library, oraz CD

Abstract This paper describes a design and implementation of the vision screening system dedicated for iOS (iPhone/iPad/iPod Operating System) based devices. The aim of the system is to promote and popularize the vision tests, especially among children and youth. The examination consists of color vision and contrast differentiation tests. After the examination the system automatically evaluates users’ answers and generates the results. Test data are anonymously sent to the server allowing for a detailed analysis. The paper contains analysis of the results on the population of about 3800 people. Presented data show that vision problems concern about half of users. The analysis was divided into two age groups (pre-school children and older) and two types of eye disorders - vision acuity and perceptions of colors including Dalton testing. Test for the first age group has been adapted to examine people with special educational needs.

Streszczenie W artykule opisano projekt i wdrożenie systemu do badań przesiewowych wzroku dedykowanego dla systemu operacyjnego iOS (iPhone / iPad / iPod). Celem aplikacji jest promocja i popularyzacja badań wzroku, zwłaszcza wśród dzieci i młodzieży. Egzamin składa się z widzenie kolorów i testów różnicowania kontrastu. Po dokonaniu analizy system automatycznie ocenia odpowiedzi użytkowników i generuje wyniki. Dane testowe są anonimowo wysyłane do serwera, co pozwala na szczegółową analizę zgromadzonych danych. Artykuł zawiera analizę wyników na populacji około 3800 osób. Przedstawione dane wskazują, że problemy z widzeniem dotyczą połowy użytkowników. Analiza została podzielona na dwie grupy wiekowe (dzieci w wieku przedszkolnym i starszych) oraz dwa rodzaje zaburzeń wzroku - ostrości widzenia i postrzeganie kolorów. Test dla pierwszej grupy wiekowej został dostosowany do badania osób ze specjalnymi potrzebami edukacyjnymi.

Entry No. 371

Entry type journal paper

Authors M. Szwarc, B. Kostek, J. Kotus, M. Szczodrak, A. Czyżewski

English title Problems of Railway Noise—A Case Study

Polish title Problemy z hałasem szynowym – studium przypadku

Journal International Journal of Occupational Safety and Ergonomics

Volume 17

Number 3

Pages 309 - 325

Abstract Under Directive 2002/49/EC relating to the assessment and management of environmental noise, all European countries are obliged to model their environmental noise levels in heavily populated areas. Some countries have their own national method, to predict noise but most have not created one yet. The recommendation for countries that do not have their own model is to use an interim method. The Dutch SRM II scheme is suggested for railways. In addition to the Dutch model, this paper describes and discusses 3 other national methods. Moreover, discrepancies between the HARMONOISE and IMAGINE projects are analysed. The results of rail traffic noise measurements are compared with national methods.

Streszczenie Zgodnie z postanowieniami zawartymi w dyrektywnie 2002/49/EC związanej z oceną i zarządzaniem hałasem w środowisku wszystkie kraje należące do UE są zobligowane do opracowania planów akustycznych na obszarach gęsto zaludnionych. Niektóre kraje mają opracowane narodowe metody obliczeniowe umożliwiające modelowanie hałasu lecz większość krajów nie posiada własnych opracowań. Dla tych krajów zaleca się tymczasowe stosowanie metody holenderskiej SRM II w odniesieniu do źródeł hałasu szynowego. W artykule przedstawiono, oprócz modelu holenderskiego, jeszcze analizę porównawczą dla 3 innych modeli narodowych. Ponadto, w opracowaniu różnic pomiędzy modelami uwzględniono również metodę opracowaną w ramach projektów HARMONOISE oraz IMAGINE. Wyniki pomiarów hałasu szynowego porównano z wynikami otrzymanymi za pomocą poszczególnych modeli narodowych.

Entry No. 372

Entry type conference paper

Authors M. Szczodrak, K. Kopaczewski, A. Czyżewski, H. Krawczyk

English title Repository of benchmark data and urban monitoring system aiding algorithms

Polish title Repozytorium nagrań testowych i algorytmy wspomagania systemów monitoringu przestrzeni publicznej

Conference VI Krajowa Konferencja Naukowa „INFOBAZY 2011 – Nauka, Projekty Europejskie, Społeczeństwo Informacyjne”

Preprint

Number

Volume

Pages 104 - 110

Conference site Gdańsk, Polska

Conference date 5.9.2011- 7.9.2011

Abstract The concept and implementation of benchmark video recordings repository for the purpose of evaluating of the video analysis algorithms. The aim of the completed work is gathering of audiovisual data comprising different types of crowd behavior with description which serves for verification of video analysis algorithms. The content of the recordings which consist of both typical and untypical behavior is thoroughly described. Metadata description methodology is introduced. The outcomes of experiment to assess the algorithm of untypical crowd behavior detection are discussed.

Streszczenie W referacie przedstawiono założenia i realizację repozytorium nagrań testowych dla potrzeb oceny algorytmów analizy obrazu. Celem wykonanej pracy jest zgromadzenie materiałów audiowizualnych zawierających różne rodzaje zachowań tłumu wraz z opisem służących do weryfikacji algorytmów analizy obrazu. Omówiono szczegółowo treść nagrań wprowadzonych do repozytorium, wśród których wyróżnić można zachowania typowe jak i nietypowe. Przedstawiono sposób opisu zgromadzonych materiałów meta danymi. Zaprezentowano schemat użycia repozytorium w dedykowanej platformie programistycznej.

Entry No. 373

Entry type conference paper

Authors M. Szczodrak, J. Kotus, K. Kopaczewski, K. Łopatka, A. Czyżewski, H. Krawczyk

English title Behavior Analysis and Dynamic Crowd Management in Video Surveillance System

Polish title Analiza zachowań i dynamiczne zarządzanie tłumem w systemie nadzoru wizyjnego

Conference 22nd International Workshop on Database and Expert Systems Applications (DEXA)

Preprint

Number

Volume

Pages 371 - 375

Conference site Toulouse, Francja

Conference date 29.8.2011- 2.9.2011

Notes doi.ieeecomputersociety.org/10.1109/DEXA.2011.16

Abstract A concept and practical implementation of a crowd management system which acquires input data by the set of monitoring cameras is presented. Two leading threads are considered. First concerns the crowd behavior analysis. Second thread focuses on detection of a hold-ups in the doorway. The optical flow combined with soft computing methods (neural network) is employed to evaluate the type of crowd behavior, and fuzzy logic aids detection of the hold-ups. The experiments with the behavior classification algorithm were performed employing prepared repository of typical and untypical behavior recordings. The effectiveness of the analysis was assessed by comparing algorithmic processing results to a set of prepared reference data, which provides a description of behavior type occurring in each video frame. Application of parallel image processing and influence of parallelization on achieved performance is explained. Apart from the crowd management the behavior analysis may be used in automatic surveillance system deployed in a city area.

Streszczenie Przedstawiono koncepcję i praktyczną realizację systemu wspomagającego zarządzanie tłumem, który pobiera informacje z kamer monitoringu. W referacie wyróżniono dwa główne wątki. Pierwszy dotyczy analizy zachowań tłumu, drugi przedstawia wykrywanie sytuacji blokowania się obszarów przy przejściach. Do określania zachowań tłumu używana jest metoda przepływu optycznego wraz z inteligentami metodami obliczeniowymi (sztuczna sieć neuronowa), a wykrywanie sytuacji blokowania obszarów przy wyjściach wykorzystuje logikę rozmytą. Przeprowadzono eksperymenty z wykorzystaniem stworzonego wcześniej repozytorium nagrań zachowań typowych i nietypowych. Zbadano efektywność algorytmów analizy przez porównanie ich wyników z zestawem danych referencyjnych. Przedstawiono zastosowany sposób równoległej analizy obrazu oraz wpływ zrównoleglenia na uzyskaną wydajność przetwarzania.

Entry No. 374

Entry type journal paper

Authors A. Czyzewski, J. Kotus, M. Szczodrak

English title Online Urban Acoustic Noise Monitoring System

Polish title System aktywnego monitoringu hałasu w algomeracji miejskiej

Journal Noise Control Engineering Journal

Volume

Number

Pages

Notes w recenzji

Abstract Concepts and implementation of the Online Urban Noise Monitoring System are presented. Principles of proposed solution used for dynamic acoustical maps creating are discussed. The architecture of the system and the data acquisition scheme are described. The concept of noise mapping, based on noise source model and propagation simulations, was developed and employed in the system. Dynamic estimation of noise source parameters utilized in the system is introduced. The details of implementation of noise map computation and visualization are presented. Advances introduced by the developed solution in the continuous noise monitoring and acoustic maps creation is in focus. The results of measurements and simulations performed by the system prototype are depicted. Noise measurements results gathered by system and created acoustic maps are compared with some other solutions in order to investigate accuracy.

Streszczenie W artykule przedstawiono koncepcję i praktyczną implementację internetowego systemu monitorowania hałasu. Przedyskutowano możliwości zastosowania proponowanych rozwiązań do tworzenia dynamicznych map hałasu. Opisano architekturę systemu oraz moduły akwizycji wyników pomiarowych. Opracowana koncepcja tworzenia map hałasu opiera się na wykorzystaniu modeli wybranych źródeł hałasu oraz części propagacyjnej. W systemie wprowadzono funkcjonalność umożliwiającą dynamiczną zmianę parametrów opisujących źródło hałasu. Ponadto przedstawiono szczegóły realizacji obliczeń i wizualizacji map hałasu. W dalszej części pracy przedstawiono wyniki długookresowych pomiarów hałasu oraz mapy hałasu uzyskane za pomocą opracowanych narzędzi. Przedstawiono również analizę dokładności uzyskanych wyników na podstawie porównania wyników modelowania opracowanym narzędziem oraz komercyjna aplikacją do sporządzania map hałasu.

Entry No. 375

Entry type journal paper

Authors G. Szwoch, P. Dalka, A. Czyżewski

English title Resolving conflicts in object tracking for automatic detection of events in video

Polish title Rozwiązywanie konfliktów w śledzeniu obiektów w celu automatycznej detekcji zdarzeń w obrazie z kamer

Journal Elektronika

Volume 52

Number 1

Pages 52 - 55

Abstract An algorithm for resolving conflicts in tracking of moving objects is presented. The proposed approach utilizes predicted states calculated by Kalman filters for estimation of trackers position, then it uses color and texture descriptors in order to match moving objects with trackers. Problematic situations, such as splitting objects, are addressed. Test results are presented and discussed. The algorithm may be used in the system for automatic detection of security threats.

Streszczenie Artykuł przedstawia algorytm rozwiązywania konfliktów w śledzeniu obiektów ruchomych. Proponowane podejście wykorzystuje predykcję stanu dokonywaną przez filtry Kalmana do estymacji pozycji obiektów oraz wartości deskryptorów koloru i tekstury do celu dopasowania trackerów do obiektów. Problematyczne sytuacje, takie jak dzielenie obiektów, są również brane pod uwagę. Przedstawiono i omówiono wyniki testów. Algorytm może być wykorzystany w systemie automatycznego wykrywania zagrożeń bezpieczeństwa.

Entry No. 376

Entry type book

Authors A. Czyżewski, G. Szwoch, P. Dalka, P. Szczuko, A. Ciarkowski, D. Ellwart, T. Merta, K. Łopatka, Ł. Kulasek, J. Wolski

English title Multi-stage video analysis framework

Polish title Wielopoziomowy system do analizy obrazu z kamery

Editor Intech

Pages 145 - 171

Notes Rozdział w książce Video Surveillance, Ed. Weiyao Lin, Chapter 9, ISBN 978-953-307-436-8

Abstract The authors designed a framework that analyses camera images on multiple levels, from basic detection of moving objects to advanced object recognition and automatic detection of important events. The proposed system has a flexible structure, with functional modules that may be selected so that the system suits the need of a particular application. These modules are based on algorithms proposed by various authors, adapted to the needs of the presented framework and enhanced by the authors in order to provide an efficient solution for automatic detection of important security threats in video monitoring systems.

Streszczenie Autorzy: Andrzej Czyżewski, Grzegorz Szwoch, Piotr Dalka, Piotr Szczuko, Andrzej Ciarkowski, Damian Ellwart, Tomasz Merta, Kuba Łopatka, Łukasz Kulasek, Jędrzej Wolski WWW: http://www.intechopen.com/articles/show/title/multi-stage-video-analysis-framework Autorzy opisali system do wielopoziomowej analizy obrazu z kamery, od podstawowej detekcji obiektów ruchomych do zaawansowanego rozpoznawania obiektów i automatycznego wykrywania istotnych zdarzeń. Proponowany system cechuje się elastyczną strukturą, moduły funkcjonalne mogą być dobierane pod kątem danego zastosowania. Moduły są oparte na algorytmach proponowanych przez innych autorów, zaadaptowanych do potrzeb proponowanego systemu i rozwiniętych przez autorów w celu dostarczenia wydajnego rozwiązania do automatycznego wykrywania zagrożeń bezpieczeństwa w systemach monitoringu wizyjnego.

Entry No. 377

Entry type conference paper

Authors D. Ellwart, A. Czyżewski

English title Viewpoint independent shape-based object classification for video surveillance

Polish title Niezależna od kąta obserwacji sceny klasyfikacja obiektów w oparciu o ich kształt

Conference 12th International Workshop on Image Analysis for Multimedia Interactive Services

Preprint

Number

Volume

Pages

Conference site Delft, Holandia

Conference date 13.4.2011- 15.4.2011

Bibliographic No. 8

Notes ISBN: 978-94-90818-00-5

Abstract A method for shape based object classification is presented. Unlike object dimension based methods it does not require any system calibration techniques. A number of 3D object models are utilized as a source of training dataset for a specified camera orientation. Usage of the 3D models allows to perform the dataset creation process semiautomatically. The background subtraction method is used for the purpose of detecting moving objects and Kalman filters based method is utilized for object tracking. Detected objects are parameterized and then classified using a set of SVM classifiers. Probability of each classification attempt is calculated and averaged over object lifetime resulting in effectiveness improvement. The method classification efficiency is tested during experiments for two various camera angles and for two various feature vector lengths.

Streszczenie W niniejszej pracy zaprezentowano metodę klasyfikacji obiektów w oparciu o ich kształt. W przeciwieństwie do metod wykorzystujących rzeczywiste rozmiary obiektów, przedstawione podejście nie wymaga przeprowadzania kalibracji. W celu przygotowania danych treningowych dla opracowanego klasyfikatora, zbudowano zestaw modeli 3D. Na potrzeby detekcji obiektów w obrazach kamer zastosowano metodę odejmowania tła. Natomiast w celu śledzenia obiektów wykorzystano filtrację Kalmana. Wykryte w obrazie obiekty poddawane są parametryzacji, a następnie przeprowadzany zostaje proces klasyfikacji, w którym posłużono się Maszyną Wektorów Wspierających (SVM). W celu poprawy skuteczności zastosowano uśredniane wyników klasyfikacji i odpowiadających im prawdopodobieństw, w czasie istnienia obiektu w obserwowanym kadrze. Przeprowadzono eksperymenty dla różnych długości wektora parametrów opisujących kształt oraz dla dwóch różnych kątów elewacji kamery. Omówiono otrzymane rezultaty.

Entry No. 378

Entry type conference paper

Authors D. Ellwart, P. Szczuko, A. Czyżewski

English title Camera sabotage detection for surveillance systems

Polish title

Conference International Joint Conference Security and Intelligent Information Systems

Preprint

Number

Volume 7053

Pages

Conference site Warszawa, Polska

Conference date 13.7.2011- 14.7.2011

Bibliographic No. 6

Notes Druk w Lecture Notes in Computer Science Vol. 7053 - w chwili obecnej jeszcze niedostępny (15.12.2011)

Abstract Camera dysfunction detection algorithms and their utilization in realtime video surveillance systems are described. The purpose of using the proposed analysis is explained. Regarding image tampering three algorithms for focus loss, scene obstruction and camera displacement detection are implemented and presented. Features of each module are described and certain scenarios for best performance are depicted. Implemented solutions are evaluated as independent events and final results are discussed. A detection efficiency improvement method is proposed.

Streszczenie W dokumencie opisano algorytmy detekcji nieprawidłowości pracy kamer i możliwości ich wykorzystania w systemach monitoringu. Przedstawiono powody konieczności stosowania takich metod. Zaprezentowano trzy podstawowe algorytmy, detekcji utraty ostrości, przesłonięcia kadru oraz zmiany pola patrzenia kamery. Opisano cechy każdego z tych modułów oraz omówiono optymalne warunki pracy poszczególnych algorytmów. Przedstawiono i skomentowano wyniki otrzymane w trakcie eksperymentach. Zaproponowano metody zwiększenia skuteczności detekcji w ramach opisanych modułów.

Entry No. 379

Entry type conference paper

Authors K. Łopatka, A. Czyżewski, H. Krawczyk

English title Automatic recognition of events in audio data using supercomputer cluster

Polish title Automatyczne rozpoznawanie zdarzeń w strumieniu fonicznym z wykorzystaniem klastra superkomputerowego

Conference 130th Convention of the AES

Preprint 8377

Number

Volume

Pages

Conference site Londyn, Wielka Brytania

Conference date 13.5.2011- 16.5.2011

Abstract Dangerous events automatic recognition by audio analysis employing parallel processing on a supercomputer cluster is described in the paper. Sound files recorded by microphones operating in a security surveillance system are processed by a sound event detection and classification algorithm. Because of the large amount of data, parallel computation is employed to speed up the analysis. The sound file recorded by the surveillance system is divided into chunks and processed by separate threads or processes. Several strategies for such parallel computation are introduced and discussed. Results obtained in tests using a supercomputer cluster are presented.

Streszczenie W referacie przedstawiono metodę automatycznego rozpoznawania niebezpiecznych sytuacji na podstawie analizy dźwięku z wykorzystaniem klastra superkomputerowego. Dane dźwiękowe zarejestrowane przy użyciu mikrofonów środowiskowych są przetwarzane algorytmami detekcji i klasyfikacji zdarzeń dźwiękowych. Z powodu dużej ilości danych do przetworzenia zastosowano równoległe przetwarzanie strumieni danych. Dane są dzielone na porcje i poddawane równoległej analizie na klastrze. Omówiono kilka strategii zrównoleglenia obliczeń. Strategie te zostały porównane i poddane ocenie.

Entry No. 380

Entry type conference paper

Authors K. Łopatka, J. Kotus, A. Czyżewski

English title Monitoring of public events audience employing acoustic vector sensors

Polish title Nadzorowanie publiczności wydarzeń masowych z wykorzystaniem wektorowych czujników akustycznych

Conference 14th International Symposium on Sound Engineering and Tonmeistering

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 19.5.2011- 21.5.2011

Notes materiały zamieszczone na płycie CD, rozszerzony referat opublikowano w "Archives of Acoustics"

Abstract A method for localization of sounds in the audience of public events is presented. Acoustic vector sensors, which provide multichannel output signals of acoustic pressure and particle velocity are employed. The methods for detection of acoustic events are introduced. The algorithm for localizing the sound event in the audience is presented. The system set up in a lecture hall, which serves as a demonstration of the proposed technology, is described. The accuracy of the proposed method is evaluated by the described measurements. The analysis of the results is followed by conclusions about the usability of the system. The concept of the multimodal audio-visual detection of events in the audience is also introduced.

Streszczenie Przedstawiono metodę lokalizacji źródeł dźwięku na widowni masowych wydarzeń. Zastosowano wektorowe czujniki akustyczne, dostarczające wielokanałowego sygnału ciśnienia i prędkości cząsteczek powietrza. Opisano algorytm lokalizacji źródeł dźwięku w pomieszczeniu. Jako demonstrator proponowanej technologii przedstawiono system działający w sali wykładowej. Dokładność systemu zbadano, przeprowadzając serię pomiarów. Referat podsumowuje analiza wyników i dyskusja nad przyczynami niedokładności systemu. Przedstawiono również koncepcję multimodalnej wizyjno-akustycznej detekcji zdarzeń na widowni.

Entry No. 381

Entry type conference paper

Authors K. Łopatka, J. Kotus, M. Szczodrak, P. Marcinkowski, A. Korzeniewski, A. Czyżewski

English title Multimodal Audio-Visual Recognition of Traffic Events

Polish title Multimodalne wizyjno-akustyczne rozpoznawanie zdarzeń w ruchu drogowym

Conference 22nd International Workshop on Database and Expert Systems Applications (DEXA)

Preprint

Number

Volume

Pages 376 - 380

Conference site Tuluza, Francja

Conference date 29.8.2011- 2.9.2011

Abstract A demonstrator of traffic events detector based on the multimodal analysis of audio and video signals is described. The subsystem is a part of smart surveillance applications. It uses surveillance cameras and microphones as the data source. The algorithms employed to the analysis of data - sound event recognition and video analytics, are explained. Results of the multimodal analysis of recordings of dangerous and passive traffic situations are presented and discussed.

Streszczenie Przedstawiono demonstrator systemu wykrywania niebezpiecznych zdarzeń w ruchu drogowym oparty na jednoczesnej analizie danych wizyjnych i akustycznych. System jest częścią systemu automatycznego nadzoru bezpieczeństwa. Wykorzystuje on kamery i mikrofony jako źródła danych. Przedstawiono wykorzystane algorytmy - algorytmy rozpoznawania zdarzeń dźwiękowych oraz analizy obrazu. Zaprezentowano wyniki działania algorytmów na przykładzie zarejestrowanych symulowanych zdarzeń.

Entry No. 382

Entry type conference paper

Authors D. Ellwart, A. Czyżewski, P. Darminio

English title Automatic Data relevancy Discrimination for a PRIVacy-sensitive video surveillance

Polish title Automatyczna dyskryminacja danych prywatnych w systemach monitoringu wizyjnego

Conference Summit on the new technologies for Urban Security EuroMed

Preprint

Number

Volume

Pages

Conference site Genua, Włochy

Conference date 20.5.2011- 21.5.2011

Notes Plakat

Abstract The poster presents ADDPRIV project overview. Main goals and assumptions of the reasearch are introduced. The developed system logical design is described and the expected results are shown.

Streszczenie Na plakacie przedstawiono opis zagadnień projektu ADDPRIV. Pokazano główne założenia i cele badawcze. Zaprezentowano logiczny schemat działania opracowywanego systemu oraz przedstawiono spodziewane rezultaty projektu.

Entry No. 383

Entry type journal paper

Authors M. Lech, B. Kostek, A. Czyżewski

English title Recognition of Dynamic and Static Hand Gestures Applied to Computer Application Controlling

Polish title Rozpoznawanie dynamicznych i statycznych gestów rąk w zastosowaniu do sterowania aplikacjami komputerowymi

Journal Zeszyty Naukowe Wydziału ETI PG

Volume 1

Number 10

Pages 177 - 186

Notes Seria: Wytwarzanie Gier Komputerowych.

Abstract In the paper an interface, methods and algorithms of controlling a computer by dynamic and static hand gestures have been presented. The solution consists of a PC on which engineered software is installed, a webcam and a multimedia projector. Gestures are recognized based on analysis of a video stream obtained from the webcam attached to the multimedia projector and on analysis of video stream displayed by the projector (retrieved from the computer). For the purpose of dynamic gestures recognition motion trajectory has been modeled by fuzzy rules. Static gestures are recognized using Support Vector Machines. In the paper results of efficiency examination of the interface engineered have been given. In conclusions, further plans to extend the system with algorithms enabling to work with the camera placed front-faced have been presented.

Streszczenie W referacie przedstawiono interfejs, metody oraz algorytmy sterowania komputerem za pomocą dynamicznych i statycznych gestów rąk. Komponentami opracowanego rozwiązania są komputer klasy PC wraz z opracowanym interfejsem i oprogramowaniem, kamera internetowa oraz projektor multimedialny. Gesty rozpoznawane są w procesie analizy obrazu wizyjnego pozyskanego z kamery internetowej przymocowanej do projektora oraz analizy obrazu wyświetlanego przez projektor (pozyskanego z komputera). Do rozpoznawania gestów dynamicznych zastosowano modelowanie trajektorii ruchu rąk za pomocą reguł logiki rozmytej. Gesty statyczne rozpoznawane są za pomocą maszyn wektorów nośnych (SVM). W referacie przedstawiono wyniki badania wydajności opracowanego systemu. We wnioskach przedstawiono również plany dotyczące rozbudowy systemu o algorytmy umożliwiające pracę z kamerą umieszczoną przed użytkownikiem.

Entry No. 384

Entry type conference paper

Authors B. Kostek, B. Kunka, A. Czyżewski

English title Analysis of Video Accompanied by Audio Employing Gaze Tracking

Polish title Analiza treści wizyjnej powiązanej z dźwiękiem z wykorzystaniem techniki śledzenia wzroku

Conference Seventh Symposium on Computational Aesthetics in Graphics, Visualization, and Imaging, 2011

Preprint

Number

Volume ACM Press. ISBN 978-

Pages

Conference site Vancouver, Kanada

Conference date 5.8.2011- 7.8.2011

Notes Plakat

Abstract The objectivization process of carrying out correlation tests in the audio-visual domains employing gaze-tracking system was outlined. The reliability of tested subjects was checked with the statistical analysis of test results. Comparing outcomes of the dynamic heat maps generated by the gaze tracking system with the associated movie samples, it was observed that the localization of the fixation point representing view direction is directly related to the localization of the virtual sound source in the stereo phantom basis. Experiments performed show that visual objects attract the viewers’ attention, thus sound sources perceived seem to be localized closer, centrally to the screen. It was also possible to analyze whether the subject’s attention remains stable.

Streszczenie W referacie przedstawiono proces obiektywizacji badań w dziedzinie korelacji wzrokowo-słuchowych z wykorzystaniem systemu śledzenia punktu fiksacji wzroku. Uzyskane wyniki świadczą o możliwości prowadzenia tego typu testów w oparciu o system śledzenia punktu fiksacji wzroku. System ten pozwala na badanie uwagi osób testowanych, a także odpowiada na pytanie, w jaki sposób wyświetlane obiekty wizualne wpływają na percepcję dźwięku towarzyszącemu obrazowi. Dziedzina: percepcja, psychoakustyka

Entry No. 385

Entry type journal paper

Authors J. Kotus, A. Czyżewski

English title AUTOMATIC SOUND SOURCE LOCALIZATION IN DISTURBING CONDITIONS USING ACOUSTIC VECTOR SENSORS

Polish title AUTOMATYCZNA LOKALIZACJA ŹRÓDŁA DŹWIĘKU W OBECNOŚCI ZAKŁÓCEŃ Z WYKORZYSTANIEM WEKTOROWYCH CZUJNIKÓW AKUSTYCZNYCH

Journal Elektronika

Volume

Number 1

Pages 36 - 38

Abstract A concept, practical realization and applications of a passive acoustic radar to automatic localization and tracking of sound sources in disturbing conditions were presented in the paper. The device consists of the new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. The sensitivity of the realized acoustic radar was examined in free sound field. Several kinds of sound signals were used, such as: pure tone from 125 to 16000 Hz, one third octave band noise in the same frequency range and impulsive sounds. As results from experiments, in some cases even the small value of the signal to noise ratio was sufficient to localize sound source correctly. A video PTZ (Pan Tilt Zoom) camera can be pointed automatically to the spot the detected acoustical source is localized.

Streszczenie W referacie przedstawiono pomysł i praktyczną realizację pasywnego radaru akustycznego do automatycznego lokalizowania i śledzenia źródeł dźwięku w warunkach zakłóceń. Urządzenie składa się z nowego typu wielokanałowych miniaturowych czujników natężeniowych oraz algorytmów cyfrowego przetwarzania sygnałów. Czułość radaru akustycznego została zbadana w warunkach pola swobodnego. Użyto sygnałów testowych takich jak: sygnały tonalne z zakresu od 125 do 16 kHz, szumowe (w tym samym zakresie częstotliwości) oraz o charakterze impulsowym. Uzyskane wyniki pomiarów wskazują, że nawet niewielka wartość stosunku sygnału do szumu była wystarczająca do poprawnego zlokalizowania źródła dźwięku. Informacja o kierunku dobiegania dźwięku może być zastosowana do automatycznego sterowania cyfrową kamerą typu PTZ (Pan Tilt Zoom).

Entry No. 386

Entry type conference paper

Authors J. Kotus, K. Łopatka, A. Czyżewski

English title Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications

Polish title Detekcja i lokalizacja wybranych zdarzeń akustycznych w polu akustycznym dla zastosowań w systemach monitorowania bezpieczeństwa

Conference Multimedia Communications, Services and Security, Communications in Computer and Information Science

Preprint

Number 149

Volume

Pages 55 - 63

Conference site Kraków, Polska

Conference date 2.6.2011- 3.6.2011

Abstract A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals from the multichannel acoustic vector probe is performed upon the detection. The described algorithms can be useful in a surveillance systems to monitor the behavior of participants of public events. The results can be used to detect the position of sound source in real time or to calculate the spatial distribution of sounds in the environment. Moreover, spatial filtration can be performed to separate the sounds coming from the chosen direction.

Streszczenie W artykule przedstawiono metodę automatycznego określania położenia źródła dźwięku w przestrzeni 3D w oparciu o analizę wybranych zdarzeń akustycznych takich jak mowa lub sygnały o charakterze impulsowym. Źródło dźwięku jest lokalizowane w rzeczywistych warunkach akustycznych (liczne odbicia) w oparciu o analizę zdarzenia akustycznego za pomocą wektorowego czujnika akustycznego. Głos człowieka i impulsowe dźwięki są wykrywane za pomocą adaptacyjnie przestrajanych detektorów, których działanie opiera się na analizie różnić w pikach widma oraz poziomu dźwięku. Moduł lokalizacji źródła dźwięku działa w oparciu o analizę wielokanałowego sygnału pozyskanego za pomocą czujnika wektorowego. Jedynie fragmenty z wykrytym zdarzeniem akustycznym są analizowane pod kątem określenia pozycji źródła dźwięku. Przedstawione algorytmy mogą znaleźć zastosowanie w systemach monitorowania bezpieczeństwa podczas imprez masowych. Wyniki działania opracowanych algorytmów mogą być wykorzystywane do wykrywania pozycji źródła dźwięku w czasie rzeczywistym lub do obliczania przestrzennego rozkładu energii dźwięku w środowisku. Ponadto, możliwe jest wyodrębnienie dźwięku dobiegającego z wybranego kierunku w oparciu o filtrację przestrzenną.

Entry No. 387

Entry type conference paper

Authors J. Kotus, K. Łopatka, A. Czyżewski, H. Krawczyk

English title

Polish title Multimedialny system wspomagania wykładowcy i prelegenta

Conference VI Krajowa Konferencja Naukowa „INFOBAZY 2011 – Nauka, Projekty Europejskie, Społeczeństwo Informacyjne”

Preprint

Number

Volume

Pages 80 - 86

Conference site Gdańsk – Sopot, Polska

Conference date 5.9.2011- 7.9.2011

Abstract A multimedia system for assisting the lecturer is presented in the paper. The demonstration system is installed in chosen lecture halls in the building of the Faculty of Electronics, Telecommunication and Informatics, Gdansk University of Technology. The system comprises of acoustic vector sensors, fixed cameras and Pan-Tilt Zoom cameras (PTZ). The multimodal audio-video system is a part of the MAYDAY EURO 2012 project infrastructure. The streams from acoustic sensors and cameras are transmitted to the supercomputer cluster. The KASKADA framework enables parallel analysis of multimedia streams in real time. The algorithms for analysis of multimedia streams are described. The algorithm for detection of acoustic events is presented. The methods for calculation of the position of the sound source are introduced. The results are used to aim the camera in the direction of the sound source. The calibration procedure is described and the results of detection and localization of sounds are discussed. The presented technology can be used to assist the lecturer by detecting such activities as taking part in the discussion or disturbing sounds. The system can also be used in smart surveillance to monitor public events. The combined analysis of audio and video stream can lead to a significant increase in the efficiency of the system.

Streszczenie W referacie przedstawiono multimedialny system wspomagania wykładowcy i prelegenta, zainstalowany w wybranych salach audytoryjnych w nowym gmachu Wydziału Elektroniki Telekomunikacji i Informatyki Politechniki Gdańskiej. System ten tworzą: wektorowe czujniki akustyczne, kamery stacjonarne oraz kamery obrotowe z możliwością regulacji ogniskowej. Opracowywany system akustyczno wizyjny stanowi część infrastruktury technicznej budowanej w ramach realizacji projektu MAYDAY EURO 2012. Sygnały pozyskiwane z poszczególnych urządzeń są w sposób ciągły przekazywane do systemu KASKADA, umożliwiającego przetwarzanie strumieni multimedialnych na klastrze superkomputerowym. W dalszej części referatu opisano algorytmy analizy strumieni w czasie rzeczywistym. Poszczególne modalności, tj. sygnał akustyczny i sygnał wizyjny podlegają współbieżnej analizie z uwzględnieniem synchronizacji czasowej. Przedstawiony algorytm przetwarzania sygnałów akustycznych umożliwia automatyczną detekcję wybranych zdarzeń akustycznych o charakterze impulsowym lub ustalonym (trzask, mowa). Dla wykrytych zdarzeń akustycznych określana jest pozycja źródła dźwięku. W oparciu o uzyskane dane możliwe jest nakierowanie kamery na wykryte źródło dźwięku. W ostatniej części referatu przedstawiono procedurę kalibracji systemu akustyczno-wizyjnego oraz wyniki detekcji i lokalizacji wybranych źródeł dźwięku. Opracowany system może stanowić istotną pomoc dydaktyczną dla wykładowcy lub prelegenta, zwiększając interakcję pomiędzy wykładowcą a słuchaczami obecnymi na sali. Opracowana technologia może również znaleźć zastosowanie w systemach monitorowania obiektów w trakcie trwania imprez masowych. Połączenie analizy akustycznej i wizyjnej może w istotny sposób większych skuteczność wykrywania zdarzeń niebezpiecznych.

Entry No. 388

Entry type conference paper

Authors A. Czyżewski, J. Kotus, P. Szczuko, K. Łopatka, P. Dalka

English title Passive Acoustic Radar

Polish title Pasywny Radar Akustyczny

Conference Brussels Innova 60th Edition

Preprint

Number

Volume

Pages

Conference site Bruksela, Belgia

Conference date 17.11.2011- 19.11.2011

Abstract Passive acoustic radar is a solution utilizing sound intensity probe (AVS – Acoustic Vector Sensor) for automatic detection, classification, localization and continuous tracking of sound sources such as: screams, gunshots, explosions and breaking glass. The invention comprises: sound intensity probe, module conditioning the signal for processing, computer with signal processing application and moveable pan-tilt-zoom camera. Multichannel output signal from the sensor is used in calculation of acoustic direction of arrival and identification of sound event type. That information is used to control the camera so as it points in the direction of incoming sound. The invention can be used with automatic surveillance systems (acoustic monitoring), in monitoring during mass events, and in military applications - to identify positions of shooters and artillery. The innovation consists in utilization, instead of commonly used microphone matrices or dummy heads, of a single multi-directional miniature passive (operating without sending a beam) sensor which enables identification and tracing of sound sources with high angular accuracy.

Streszczenie Pasywny Radar Akustyczny jest urządzeniem, które umożliwia automatyczną detekcję, klasyfikację, lokalizację i śledzenie źródeł dźwięku, takich jak: krzyki, wystrzały, wybuchy, tłuczone szkło. Składa się z nowego rodzaju wielokanałowych, miniaturowych wektorowych czujników akustycznych oraz algorytmów cyfrowego przetwarzania sygnałów. Umożliwia sterowanie kamerami PTZ oraz odsłuch dźwięku z wybranego kierunku.

Entry No. 389

Entry type journal paper

Authors K. Łopatka, J. Kotus, A. Czyżewski

English title Application of vector sensors to acoustic surveillance of a public interior space

Polish title Zastosowanie wektorowych czujników akustycznych do nadzoru wnętrz w przestrzeni publicznej

Journal Archives of Acoustics

Volume 36

Number 4

Pages 851 - 860

Abstract A method for precise sound sources detection and localization in interiors is presented. Acoustic vector sensors, which provide multichannel output signals of acoustic pressure and particle velocity were employed. Methods for detecting acoustic events are introduced. The algorithm for localizing sound events in the audience is presented. The system set up in a lecture hall, which serves as a demonstrator of the proposed technology, is described. The accurracy of the proposed method is evaluated by the described measurement results. The analysis of the results is followed by conclusions pertaining the usability of the proposed system. The concept of the multimodal audio-visual detection of events in the audience is also introduced.

Streszczenie Przedstawiono metodę precyzyjnej detekcji i lokalizacji źródeł dźwięku w pomieszczeniach. Wykorzystano wektorowe czujniki akustyczne, dostarczające sygnałów ciśnienia akustycznego i prędkości cząsteczek powietrza. Zaprezentowano metodę lokalizacji źródeł dźwięku na widowni wydarzenia publicznego. Przedstawiono demonstracyjny system zainstalowany w sali wykładowej. System poddano ocenie dokładności na podstawie przeprowadzonych pomiarów. Przedstawiono analizę wyników i dyskusję nad poprawą jakości działania systemu.

Entry No. 390

Entry type journal paper

Authors G. Szwoch, P. Dalka, A. Ciarkowski, P. Szczuko, A. Czyzewski

English title Visual Object Tracking System Employing Fixed and PTZ Cameras

Polish title System śledzenia obiektów ruchomych z wykorzystaniem kamer stacjonarnych i PTZ

Journal Journal of Intelligent Decision Technologies

Volume 5

Number 2

Pages 177 - 188

Notes http://iospress.metapress.com/content/m5060n24tk125406/?p=2aa903da834b4371955e56c56b058b6b&pi=5

Abstract The paper presents a video monitoring system utilizing fixed and PTZ cameras for tracking of moving objects. First type of camera provides image for background modelling, being employed for foreground objects localization. Estimated objects locations are then utilised for steering of PTZ cameras when observing targeted objects with high close-ups. Objects are classified into several classes, then basic event detection is being performed. Event type, object localisation and images acquired by the cameras are presented visually in a "live map" system. In the paper details related to detection of moving objects are presented. Next, camera calibration procedure and geopositioning coordinates way of processing are discussed. Event detection is described. Finally an experiment is presented, organised in order to verify the camera tracking system accuracy.

Streszczenie W artykule przedstawiono system monitoringu wizyjnego z wykorzystaniem kamer stacjonarnych i PTZ do celów śledzenia obiektów ruchomych. Obraz z kamery stacjonarnej jest wykorzystywany do detekcji i śledzenia obiektów ruchomych. Estymaty pozycji obiektów są używane do nakierowania kamery PTZ na obserwowany obiekt. Po klasyfikacji obiektów następuje detekcja zdarzeń. Dane o wykrytych zdarzeniach są prezentowane na mapie. W artykule opisano metody detekcji obiektów, kalibracji kamer i estymacji położenia geograficznego obiektu oraz detekcji zdarzeń. Opisano eksperyment przeprowadzony w celu weryfikacji dokładności systemu.

Entry No. 391

Entry type conference paper

Authors P. Marcinkowski, A. Korzeniewski, A. Czyżewski

English title Human Tracking in Multi-camera Visual Surveillance System

Polish title Śledzenie osób w wielokamerowym systemie kamer

Conference 4th International Conference Multimedia Communications, Services and Security (MCSS)

Preprint

Number

Volume 149

Pages 277 - 285

Conference site Kraków, Polska

Conference date 2.6.2011- 3.6.2011

Notes A. Dziech, A. Czyżewski: Multimedia Communications, Services and Security. Springer 2011, ISBN 978-3-642-21511-7

Abstract A short survey of visual human tracking technologies used in intelligent surveillance systems is presented. Face recognition algorithms combined with human tracking systems are not meant to identify human face and personality. There is no database with persons’ biometric features employed, thus in this case there is no problem with violating privacy policy. The concept of combining human tracking technology with face recognition techniques, in order to increase efficiency, has been described. The paper also includes the description of KASKADA - hardware and software supercomputer platform for development of multimodal (audio and video) algorithms, including object and person tracking video monitoring systems. Face recognition algorithm on the KASKADA platform was proposed. Method of implementation of the proposed algorithm was described.

Streszczenie Artykuł prezentuje krótkie podsumowanie wykorzystywanych technologii z dziedziny śledzenia osób z wykorzystaniem inteligentnych systemów bezpieczeństwa. Opisane w niniejszym opracowaniu systemy rozpoznawania twarzy, w połączeniu ze śledzeniem osób, nie mają na celu rozpoznawania tożsamości osób. Nie powstaje żadna baza danych łącząca cechy biometryczne z konkretnymi osobami, co sprawia że przestrzegane jest prawo w zakresie ochrony danych osobowych. W niniejszym opracowaniu opisano koncepcję wykorzystania algorytmów rozpoznawania twarzy w celu poprawienia skuteczności rozpoznawania osób w obrazie z kamer. Dokument ten zawiera również opis platformy KASKADA - superkomputerowej platformy kontekstowej analizy strumieni danych multimedialnych do identyfikacji wyspecyfikowanych obiektów lub niebezpiecznych zdarzeń. Zaproponowano wykonanie algorytmu rozpoznawania twarzy z wykorzystaniem platformy KASKADA, oraz przedstawiono sposób implementacji właściwego algorytmu.

Entry No. 392

Entry type conference paper

Authors A. Czyżewski, B. Kostek, A. Kupryjanow

English title Online Sound Restoration for Digital Library Applications

Polish title Sieciowa rekonstrukcja dźwięku przeznaczona dla cyfrowych bibliotek

Conference SYNAT Workshop - Post-Conference Event by The International Symposium on Methodologies for Intelligent Systems (ISMIS 2011)

Preprint

Number

Volume 390

Pages 227 - 242

Conference site Warszawa, Polska

Conference date 1.7.2011- 1.7.2011

Bibliographic No. 29

Abstract A system for sound restoration having the following features was conceived and engineered: no special sound restoration software is needed to perform audio restoration; the process of online restoration employs automatic reduction of noise, wow and impulse distortions; no skills in digital signal processing are required from the user. The principles of the created system and its features as well as hitherto achieved results are discussed in the paper.

Streszczenie W referacie przedstawiono system rekonstrukcji dźwięku posiadający następujące własności: nie wymaga specjalistycznego oprogramowania służącego do rekonstrukcji sygnałów fonicznych; proces sieciowej rekonstrukcji dźwięku pozwala na automatyczne usunięcie z nagrań dźwiękowych szerokopasmowego szumu, zniekształceń impulsowych oraz drżenia i kołysania dźwięku; użytkownicy systemu nie muszą posiadać wiedzy związanej z cyfrowym przetwarzaniem sygnałów. W referacie przedstawiono zasadę działania systemu jego funkcjonalność oraz dotychczas osiągnięte wyniki.

Entry No. 393

Entry type conference paper

Authors J. Cichowski, A. Czyżewski

English title Reversible Video Stream Anonymization for Video Surveillance Systems Based on Pixels Relocation and Watermarking

Polish title Odwracalna Anonimizacja Strumieni Wizyjnych w Systemach Monitoringu Wykorzystująca Relokację Pikseli oraz Znakowanie Wodne

Conference 13th IEEE International Conference on Computer Vision - ICCV2011

Preprint

Number

Volume

Pages 1971 - 1977

Conference site Barcelona, Hiszpania

Conference date 6.11.2011- 13.11.2011

Notes Plakat, WS22: The Eleventh IEEE International Workshop on Visual Surveillance. Wersja na CD, materiały konferencyjne zostaną wydrukowane później przez IEEE.

Abstract A method of reversible video image regions of interest anonymization for applications in video surveillance systems is described. A short introduction to the anonymization procedures is presented together with the explanation of its relation to visual surveillance. A short review of state of the art of sensitive data protection in media is included. An approach to reversible Region of Interest (ROI) hiding in video is presented, utilizing a new relocation algorithm for hashing and a watermarking technique for extra data embedding. Implemented application is described, and results obtained using it are reported. Future work and possible improvements to introduced algorithms are discussed.

Streszczenie Opisano metodę odwracalnej anonimizacji obszarów zainteresowań w obrazie wideo dla zastosowań w wizyjnych systemach nadzoru. Krótkie wprowadzenie do procedur anonimizacji zostało zaprezentowane wraz z wyjaśnieniem ich powiązania z systemami monitoringu. Umieszczono również przegląd aktualnego stanu techniki dotyczącego ochrony danych delikatnych w mediach. Zaprezentowano podejście do odwracalnego ukrywania obszarów w strumieniu wideo, wykorzystujące nowy algorytm relokacji do zamazywania treści oraz technikę cyfrowego znakowania wodnego w celu osadzania dodatkowych danych w obrazie. Opisano stworzoną aplikację oraz omówiono uzyskane dzięki niej wyniki badań. Przedyskutowano przyszły kierunek rozwoju i możliwe udoskonalenia zastosowanych algorytmów.

Entry No. 394

Entry type

Authors A. Czyżewski, A. Kupryjanow

English title

Polish title Sposób i system wspomagania rozumienia mowy.

Notes zgłoszenie patentowe

Streszczenie System wspomagania charakteryzuje się tym, że w jednostce przetwarzania znajdują się połączone ze sobą moduły: detektor mowy, detektor samogłosek, moduł redukcji nadmiarowości i synchronizacji, dyskryminator tempa mowy oraz moduł transpozycji czasowej, którego wyjście stanowi źródło wyjściowego sygnału fonicznego dla odbiorcy

Entry No. 395

Entry type

Authors A. Czyżewski, A. Kupryjanow

English title

Polish title System wspomagania rozumienia mowy.

Notes zgłoszenie patentowe

Streszczenie System wspomagania rozumienia mowy charakteryzuje się według wynalazku tym, że jednostka przetwarzania wbudowana jest do implantu ślimakowego wszczepionego odbiorcy zwłaszcza z pogorszoną rozdzielczością czasową słuchu.

Entry No. 396

Entry type

Authors A. Czyżewski, A. Kupryjanow

English title

Polish title Sposób i system wspomagania rozumienia mowy

Notes patent nadany w 2014 r.

Abstract Istotą wynalazku jest sposób transpozycji czasowej mowy nie naruszającej jej zrozumiałości.

Entry No. 397

Entry type

Authors M. Lech, B. Kostek, A. Czyżewski

English title

Polish title Układ do miksowania dźwięku

Notes Prawo wyłączne, data publikacji WUP: 2016-11-30

Streszczenie wykaz towarów Sposób miksowania dźwięku polegający na zmianie parametrów i sterowaniu parametrami sygnału zapisanego na poszczególnych ścieżkach dźwiękowych składających się na końcowy sygnał foniczny za pomocą aplikacji komputerowej udostępniającej operacje miksowania dźwięku charakteryzuje się tym, że określone operacje miksowania wybiera się i wykonuje bezkontaktowo za pomocą gestów obiektów sterujących (OS) odbieranych przez moduł akwizycji gestów (K), które po ich przetworzeniu metodami cyfrowymi w urządzeniu sterującym (U) współpracującym z komputerem (C) wykorzystuje się do generowania sygnałów elektronicznych sterujących wyborem operacji miksowania dla aplikacji komputerowej udostępniającej operacje miksowania dźwięku, przy czym użytkownik dowolnie określa i modyfikuje powiązania gestów z poszczególnymi operacjami miksowania. System miksowania dźwięku zawiera zespół głośników (G) współpracujących z komputerem (C) wyposażonym w aplikację komputerową (AM) udostępniającą operacje miksowania dźwięku i wyposażony jest w urządzenie sterujące (U) sprzężone z komputerem (C) i posiadające moduł akwizycji gestów (K) sprzężony bezkontaktowo z obiektami sterującymi (OS).

Entry No. 398

Entry type book

Authors A. Dziech, A. Czyżewski

English title Multimedia Communications, Services and Security

Polish title Multimedialna komunikacja, usługi i ich bezpieczeństwo

Editor Springer, Communications in Computer and Information Science 287

Pages

Notes Redakcja wydania książkowego

Abstract Proceedings of 5th International Conference, MCSS 2012 Krakow, Poland, May/June 2012

Streszczenie Prace V Miedzynarodowej konferencji MCSS 2012 Kraków, Polska, maj-czerwiec 2012

Entry No. 399

Entry type journal paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak

English title Online Urban Acoustic Noise Monitoring System

Polish title System aktywnego monitoringu hałasu w algomeracji miejskiej

Journal Noise Control Engineering Journal

Volume 60

Number 1

Pages 69 - 84

Abstract Concepts and implementation of the Online Urban Noise Monitoring System are presented. Principles of proposed solution used for dynamic acoustical maps creating are discussed. The architecture of the system and the data acquisition scheme are described. The concept of noise mapping, based on noise source model and propagation simulations, was developed and employed in the system. Dynamic estimation of noise source parameters utilized in the system is introduced. The details of implementation of noise map computation and visualization are presented. Advances introduced by the developed solution in the continuous noise monitoring and acoustic maps creation is in focus. The results of measurements and simulations performed by the system prototype are depicted. Noise measurements results gathered by system and created acoustic maps are compared with some other solutions in order to investigate accuracy.

Streszczenie W artykule przedstawiono koncepcję i praktyczną implementację internetowego systemu monitorowania hałasu. Przedyskutowano możliwości zastosowania proponowanych rozwiązań do tworzenia dynamicznych map hałasu. Opisano architekturę systemu oraz moduły akwizycji wyników pomiarowych. Opracowana koncepcja tworzenia map hałasu opiera się na wykorzystaniu modeli wybranych źródeł hałasu oraz części propagacyjnej. W systemie wprowadzono funkcjonalność umożliwiającą dynamiczną zmianę parametrów opisujących źródło hałasu. Ponadto przedstawiono szczegóły realizacji obliczeń i wizualizacji map hałasu. W dalszej części pracy przedstawiono wyniki długookresowych pomiarów hałasu oraz mapy hałasu uzyskane za pomocą opracowanych narzędzi. Przedstawiono również analizę dokładności uzyskanych wyników na podstawie porównania wyników modelowania opracowanym narzędziem oraz komercyjna aplikacją do sporządzania map hałasu.

Entry No. 400

Entry type conference paper

Authors J. Cichowski, A. Czyżewski

English title Sensitive Audio Data Encryption for Multimodal Surveillance Systems

Polish title Szyfrowanie danych wrażliwych w ścieżce audio dla wielomodalnych systemów monitoringu

Conference 132nd Audio Engineering Society Convention 2012

Preprint

Number 8605

Volume

Pages

Conference site Budapeszt, Węgry

Conference date 26.4.2012- 29.4.2012

Bibliographic No. 8605

Notes http://www.aes.org/e-lib/browse.cfm?elib=16243

Abstract Novel algorithms for data processing in audiovisual surveillance systems were developed allowing for a better personal data protection. The solution merging the image and audio encryption for privacy-sensitive data protection employing audio stream is described. The main objectives of this research study including motivation and the state of the art are presented with a comprehensive explanation of audio stream relation to the surveillance. The invertible encryption methodology for privacy preserving using audio container is applied. The experiments are described and obtained results are reported including prospects for future improvements.

Streszczenie Nowy algorytm przetwarzania danych audiowizualnych w systemach monitoringu pozwalający na lepszą ochronę prywatności został opracowany. Podejście łączące dane wizyjne i foniczne w celu szyfrowania danych wrażliwych zostało opisane. Głównym celem przedstawionej pracy było wyjaśnienie motywacji, przegląd stanu techniki oraz opisanie relacji pomiędzy danymi wizyjnymi i fonicznymi w systemach monitoringu. Odwracalna metoda szyfrowania stosowana do ochrony prywatności bazująca na wykorzystaniu kontenera audio została zaimplementowana. opisane zostały eksperymenty, uzyskane wyniki oraz możliwe ulepszenia metody.

Entry No. 401

Entry type book

Authors A. Czyżewski

English title Department of Multimedia Systems

Polish title Katedra Systemów Multimedialnych

Editor Gdansk University of Technology

Pages 135 - 140

Notes rozdział w książce wydanej z okazji 60-lecia WETI

Abstract History, the curriculum, the range of research, offer for industry, awards associated with the achievements have been presented as a chapter in a book published to commemorate the 60th anniversary of the Faculty of Electronics, Telecommunication and Informatics of the Gdansk University of Technology.

Streszczenie Historia, program nauczania, zakres badań, oferta dla przemysłu i nagrody związane z osiągnięciami zostały zaprezentowane jako rozdział w książce wydanej dla upamietnienia 60-lecia Wydziału ETI PG.

Entry No. 402

Entry type conference paper

Authors A. Czyżewski, Ł. Kosikowski, B. Kostek, J. Kotus, P. Suchomski

English title New Tools for Hearing Loss Screening and Tinnitus Diagnosing

Polish title

Conference AES 47th international conference

Preprint

Number

Volume

Pages

Conference site Chicago, USA

Conference date 20.6.2012- 22.6.2012

Notes Dostępne w materiałach konferencyjnych - pamięć usb

Abstract A theoretical model of Tinnitus (ringing in ears) based on the existence of a parasitic quantization, that accompanies hearing loss has been formulated in the previous work presented at the 120th AES Convention, linking hearing loss, dithering and Tinnitus. Accurate estimation of the Tinnitus characteristic concerning sound type, level, bandwidth or frequency is inevitable for many treatment methods. The proposed way of obtaining Tinnitus characteristic is described in the paper, preceded by a description of developed applications for screening hearing testing.

Streszczenie W publikacji zaproponowano sposób uzyskania charakterystyki szumów usznych oraz opis opracowanych aplikacji służących do przesiewowego badania słuchu.

Entry No. 403

Entry type conference paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak, B. Kostek

English title Employing Supercomputing Cluster to Acoustic Noise Map Creation

Polish title Zastosowanie gridu superkomputerowego do tworzenia map hałasu

Conference Audio Engineering Society Convention 133

Preprint 8775

Number

Volume

Pages 1 - 7

Conference site San Francisco, Stany Zjednoczone Ameryki

Conference date 26.10.2012- 29.10.2012

Abstract A system is presented for determining acoustic noise distribution and assessing its adverse effects in short time periods inside large urban areas owing to the employment of a supercomputing cluster. A unique feature of the system is the psychoacoustic noise dosimetry implemented to inform interested citizens about predicted auditory fatigue effects which may be caused by the exposure to excessive noise. The noise level computing is based on the engineered Noise Prediction Model (NPM) stemmed from the Harmonoise model. Sound level distribution in the urban area can be viewed by users over the prepared www service. An example of a map is presented in consecutive time periods to show the capability of the supercomputing cluster to update noise level maps frequently.

Streszczenie W referacie przedstawiono system do wyznaczania rozkładu poziomu hałasu i określanie jego niekorzystnego wpływu w krótkich okresach czasu w dużych obszarach miejskich z zastosowaniem klastrów superkomputerowych. Unikalną właściwością systemu jest psychoakustyczna dozymetria hałasowa zaimplementowana w celu dostarczenia informacji zainteresowanym mieszkańcom o przewidywanych efektach zmęczenia słuchu, które mogą być spowodowane przez ekspozycję na nadmierny hałas. Obliczanie poziomu hałasu bazuje na opracowanym Modelu Prognozowania Hałasu (MPH) wzorowanego na modelu Harmonoise. Rozkład poziomu dźwięku w obszarze aglomeracji może być obserwowany przez użytkowników poprzez przygotowaną stronę www. Zaprezentowano przykład mapy hałasu w różnych okresach doby w celu pokazania możliwości klastrów superkomputerowych do częstego odświeżania map poziomu hałasu.

Entry No. 404

Entry type journal paper

Authors B. Kunka, A. Czyżewski, A. Kwiatkowska

English title AWARENESS EVALUATION OF PATIENTS IN VEGETATIVE STATE EMPLOYING EYE-GAZE TRACKING SYSTEM

Polish title Ocena świadomości pacjentów w stanie wegetatywnym z wykorzystaniem systemu śledzenia punktu fiksacji wzroku

Journal International Journal on Artificial Intelligence Tools (IJAIT)

Volume 21

Number 2

Pages 1 - 11

Bibliographic No. 25

Notes DOI No: 10.1142/S0218213012400076

Abstract Application of eye-gaze tracking system to awareness evaluation is demonstrated. Hitherto awareness evaluation methods are presented. The assumptions of proposed method based on analysis of visual activity of patients in vegetative state are demonstrated. The eye-gaze tracking system “Cyber-Eye” developed at the Multimedia Systems Department employed to conducted experiments is presented. Research described in the paper indicates that awareness level of 13 of 15 tested patients was misdiagnosed before the new method of awareness evaluation is introduced.

Streszczenie W niniejszym artykule przedstawiono zastosowanie systemu śledzenia punktu fiksacji wzroku w określaniu stopnia świadomości. Przedstawiono dotychczas stosowane metody określania stopnia świadomości. Założenia zaproponowanej przez Autorów metody są oparte na analizie aktywności wzrokowej pacjentów przebywających w stanie wegetatywnym. W artykule przedstawiono również system śledzenia punktu fiksacji wzroku "Cyber-Oko" opracowany w Katedrze Systemów Multimedialnych, który został wykorzystany w przeprowadzonych eksperymentach. Badania prezentowane w niniejszym artykule wskazują na to, że 13 spośród 15 pacjentów, poddanych badaniu z wykorzystaniem zaproponowanej metody, zostało błędnie zdiagnozowanych.

Entry No. 405

Entry type journal paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek

English title Distributed System For Noise Threat Evaluation Based On Psychoacoustic Measurements

Polish title Rozproszony system do ewaluacji zagrożeń hałasem bazujący na pomiarach psychoakustycznych

Journal Metrology And Measurement Systems

Volume XIX

Number 2

Pages 219 - 230

Abstract An innovative system designed for the continuous monitoring of acoustic climate of urban areas was presented in the paper. The assessment of environmental threats is performed using online data, acquired through a grid of engineered monitoring stations collecting comprehensive information about acoustic climate of urban areas. The grid of proposed devices provides valuable data for the purpose of long and short time acoustic climate analysis. Dynamic estimation of noise source parameters and real measurement results of emission data are utilized to create dynamic noise maps accessible to the general public. This operation is performed through the noise source prediction employing a propagation model being optimized for a computer cluster implementation requirements. It enables the system to generate noise maps in a reasonable time and to publish regularly map updates in the Internet. Moreover, the functionality of the system was extended with new techniques for assessing noise-induced harmful effects on the human hearing system. The principle of working of the dosimeter is based on a modified psychoacoustic model of hearing and on the results of research performed with participation of volunteers concerning the impact of noise on hearing. The primary function of the dosimeter is to estimate, in real time, auditory effects which are caused by exposure to noise. The results of measurements and simulations performed by the system prototype are depicted and analyzed. Several cases of long-term and short-term measurements of noise originating from various sources were considered in details. Presented outcomes of predicted degree of the hearing threshold shift induced during the noise exposure can increase awareness of harmfulness of excessive sound levels.

Streszczenie W artykule przedstawiono innowacyjny system do ciągłego monitorowania klimatu akustycznego w obszarach miejskich. Szacowanie zagrożeń środowiskowych jest wykonywane z użyciem danych zbieranych w czasie rzeczywistym z użyciem sieci opracowanych stacji monitorujących, pozyskujących kompletne dane o klimacie akustycznym w obszarze miejskim. Sieć proponowanych urządzeń dostarcza cennych danych do celów długo- i krótkookresowej analizy klimatu akustycznego. Dynamiczne oszacowanie parametrów źródła hałasu oraz wyniki pomiarów są używane do tworzenia dynamicznych map hałasu dostępnych publicznie. Ta operacja jest przeprowadzana z zastosowaniem modeli źródła i propagacji hałasu zoptymalizowanych do działania na klastrze komputerowym. W ten sposób system umożliwia tworzenie map hałasu w akceptowalnym czasie i publikowanie regularnie uaktualnianych map w Internecie. Dodatkowo, funkcjonalność systemu została rozszerzona o nowe techniki określania szkodliwych efektów dla słuchu wywołanych przez hałas. Zasada działania dozymetru jest oparta o zmodyfikowany model psychoakustyczny słuchu i na wynikach badań przeprowadzonych z udziałem ochotników dotyczących wpływu hałasu na słuch. Podstawową funkcją dozymetru jest szacowanie w czasie rzeczywistym efektów słuchowych, które spowodowane są ekspozycją na hałas. Wyniki pomiarów i symulacji przeprowadzonych za pomocą prototypu systemu są przedstawione i przedyskutowane. Prezentowane wyniki przewidywanego stopnia podniesienia progu słyszenia wywołanego przez narażenie na hałas mogą przyczynić się do wzrostu świadomości o szkodliwości nadmiernych poziomów dźwięku.

Entry No. 406

Entry type conference paper

Authors M. Szczodrak, J. Kotus, A. Czyżewski, B. Kostek

English title Application of Grid Infrastructure to Noise Map Calculation of Large City Areas

Polish title Zastosowanie infrastruktury gridowej do obliczania map hałasu dużych obszarów miejskich

Conference Cracow Grid Workshop 2012

Preprint

Number

Volume

Pages 2

Conference site Kraków, Polska

Conference date 22.10.2012- 24.10.2012

Abstract Concept and implementation of the system for creating dynamic noise maps in PlGrid infrastructure are presented. The methodology of dynamic acoustical maps creating is introduced. The concept of noise mapping, based on noise source and propagation models, was developed and employed in the system. The details of incorporation of the system to the PlGrid infrastructure are presented. The results of simulations performed by the system prototype are depicted. The results in form of noise maps obtained by system are compared with some other solutions in order to investigate accuracy.

Streszczenie W referacie przedstawiono koncepcję i implementację systemu do tworzenia dynamicznych map hałasu w infrastrukturze PLGrid. Omówiono metodologię tworzenia map akustycznych odświeżanych dynamicznie. Zastosowano i zaimplementowano w systemie metodę bazującą na koncepcji zastosowania modeli źródła i propagacji dźwięku. Przedstawiono szczegółowe informacje o zintegrowaniu systemu z infrastrukturą PLGrid. Pokazano wyniki symulacji przeprowadzonych za pomocą systemu. Wyniki w postaci map hałasu są porównane z otrzymanymi za pomocą innych narzędzi w celu zbadania dokładności opracowanego systemu.

Entry No. 407

Entry type conference paper

Authors K. Kopaczewski, M. Szczodrak, A. Czyżewski, H. Krawczyk

English title Application of virtual gate for counting people participating in large public events

Polish title Zastosowanie wirtualnej bramki do liczenia osób biorących udział w imprezach masowych

Conference Multimedia Communications, Services and Security

Preprint

Number

Volume 287

Pages 316 - 327

Conference site Kraków, Polska

Conference date 31.5.2012- 1.6.2012

Abstract The concept and practical application of the developed algorithm for people counting in crowded scene is presented. The aim of the work is to estimate the number of people passing towards entrances of a large sport hall. The details of implemented the Virtual Gate algorithm are presented. The video signal from the camera installed in the building constituted the input for the algorithm. The most challenging problem was the unpredicted behavior of people while entering the building. A series of experiments during real sport events and concerts was made. The case of improved organization of people passing is described and the influence on the counting results is shown. The results of the studies are shown and achieved outcomes are discussed.

Streszczenie Przedstawiono koncepcję i praktyczne wykorzystanie opracowanego algorytmu zliczania osób w tłumie. Celem pracy jest oszacowanie liczby osób przechodzących przez wejścia do dużej hali widowiskowo-sportowej. Omówiono szczegóły zaimplementowanego algorytmu Wirtualnej Bramki. Dane wejściowe algorytmu stanowiły strumienie wizyjne z kamer zainstalowanych w budynku. Największy problem stanowiło nieprzewidywalne przemieszczanie się osób podczas wchodzenia do obiektu. Przeprowadzono serię eksperymentów podczas rozgrywanych meczy oraz koncertów. Opisano eksperyment polegający na ulepszeniu organizacji ruchu ludzi wchodzących do budynku i jego wpływ na osiągnięte wyniki. Przedstawiono i przedyskutowano otrzymane wyniki.

Entry No. 408

Entry type conference paper

Authors B. Kostek, J. Kotus, A. Czyżewski

English title Noise Monitoring System Employing Psychoacoustic Noise Dosimetry

Polish title System monitorowania hałasu wykorzystujący psychoakustyczny dozymetr hałasowy

Conference AES 47th International Conference

Preprint 13

Number

Volume

Pages 1 - 12

Conference site Chicago, Stany Zjednoczone Ameryki

Conference date 20.6.2012- 22.6.2012

Abstract New ways of assessing noise-induced harmful effects on human hearing system were presented at the 126th AES Convention. They resulted from a long-term study allowing authors to define new indicators that were proposed on the basis of hearing examination done in the real noise exposure situations. However, it seems now that the topic was raised prematurely at that time (in 2009), because it did not entail any discussion on this matter in the AES community. Meanwhile, the authors continued their work on this subject. Consequently, the mentioned new ideas were disseminated in some following papers and an advanced distributed noise monitoring system was implemented employing the conceived psychoacoustic noise dosimetry. The proposed noise exposure indicators are reviewed in the present paper. The practical applicability of the proposed indicators were confirmed experimentally using hearing testing with real noise exposures and also on the basis of simulation results employing some standard test signals.

Entry No. 409

Entry type journal paper

Authors A. Czyżewski, Ł. Kosikowski, B. Kunka, A. Kupryjanow, M. Lech, P. Odya

English title Series of multimodal computer interfaces

Polish title Typoszereg komputerowych interfejsów multimodalnych

Journal Przegląd Telekomunikacyjny

Volume

Number 8-9

Pages 1292 - 1303

Bibliographic No. 12

Notes dostępne na płycie

Abstract Multimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people.

Streszczenie W referacie opisano opracowywane w ramach realizowanego projektu, multimodalne interfejsy multimodalne, ułatwiające użytkowanie urządzeń komputerowych, w tym również terminali mobilnych. Przedstawiono zasady działania poszczególnych interfejsów oraz dotychczasowo uzyskane rezultaty. Wyniki uzyskane zostały drogą prób i eksperymentów z udziałem grup użytkowników docelowych, obejmujących zarówno użytkowników standardowych, jak również dzieci, pacjentów, osoby sparaliżowane i in. W podsumowaniu przedstawiono wnioski i wstępną ocenę przydatności rozwijanych rozwiązań.

Entry No. 410

Entry type conference paper

Authors P. Czyżyk, J. Cichowski, B. Kostek, A. Czyżewski

English title Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection

Polish title Analiza wpływu stratnej kompresji dźwięku na odporność znaku wodnego osadzonego w dziedzinie DWT w celu ochrony praw autorskich typu non-blind

Conference 5th International Conference on Multimedia Communications, Services and Security, MCSS'12

Preprint

Number

Volume

Pages 36 - 46

Conference site Kraków, Polska

Conference date 31.5.2012- 1.6.2012

Bibliographic No. 4

Notes Communications in Computer and Information Science 287

Abstract A methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported. The possible attacks on the embedded watermark are described and the procedure of simulating the attacks is explained. The research is focused on the influence of lossy compression on the embedded watermark degradation. The peak signal to noise ratio and bit error rate are analyzed and compared. Advantages and disadvantages of the proposed approach are discussed. Future work and some possible improvements to the introduced methodology are explained.

Streszczenie Zaproponowana metodologia związana jest ze znakowaniem wodnym materiału fonicznego typu non-blind. Przedyskutowano problem ochrony praw autorskich nagrań fonicznych oraz motywację do praktycznego zastosowania proponowanej metody. Przedstawiono podstawy teoretyczne dotyczące technik znakowania wodnego. Zaprezentowano architekturę systemu wraz z przepływem danych wykorzystywanych do realizacji procedur osadzania i ekstrakcji znaków wodnych. Wyniki rzeczywistej implementacji systemu zostały opisane. Hipotetyczne ataki skierowane na znak wodny zostały zaprezentowane, wyjaśniono procedury symulacji ataków. Przeprowadzone badania skoncentrowane zostały pod kątem analizy wpływu stratnej kompresji dźwięku na degradację osadzonego znaku wodnego. Stosunek sygnału do szumu oraz bitowa stopa błędów zostały użyte do porównania otrzymanych wyników. Zalety i wady proponowanego podejścia zostały omówione. Przyszłe prace oraz możliwe usprawnienia wprowadzonej metodologii zostały wyjaśnione.

Entry No. 411

Entry type journal paper

Authors M. Lech, B. Kostek, A. Czyżewski

English title Virtual Whiteboard: A gesture-controlled pen-free tool emulating school whiteboard

Polish title

Journal Intelligent Decision Technologies

Volume 6

Number 2/2012

Pages 161 - 169

Abstract In the paper the so-called Virtual Whiteboard is presented which may be an alternative solution for modern electronic whiteboards based on electronic pens and sensors. The presented tool enables the user to write, draw and handle whiteboard contents using his/her hands only. An additional equipment such as infrared diodes, infrared cameras or cyber gloves is not needed. The user’s interaction with the Virtual Whiteboard computer application is based on dynamic hand gesture recognition. Gestures are recognized in the process of analyzing video stream obtained from a webcam coupled with a multimedia projector displaying whiteboard contents. The tracking positions of hands in the image is supported by Kalman filtering. In the paper the hardware and software of the Virtual Whiteboard is presented with a special focus on utilizing Kalman filters for prediction of consecutive hand positions. For the gestures applied to handle whiteboard contents, examined efficacy of Kalman filter supported recognition and the efficacy without using the filtering is given. In addition, the results of system efficiency tests are provided.

Entry No. 412

Entry type book

Authors A. Czyżewski, A. Kupryjanow, B. Kostek

English title Online Sound Restoration for Digital Library Applications

Polish title Sieciowa rekonstrukcja dźwięku przeznaczona dla cyfrowych bibliotek

Editor Springer-Verlag

Pages 227 - 242

Bibliographic No. 29

Notes rozdział w książce

Abstract A system for sound restoration having the following features was conceived and engineered: no special sound restoration software is needed to perform audio restoration; the process of online restoration employs automatic reduction of noise, wow and impulse distortions; no skills in digital signal processing are required from the user. The principles of the created system and its features as well as hitherto achieved results are discussed in the paper.

Streszczenie W referacie przedstawiono system rekonstrukcji dźwięku posiadający następujące własności: nie wymaga specjalistycznego oprogramowania służącego do rekonstrukcji sygnałów fonicznych; proces sieciowej rekonstrukcji dźwięku pozwala na automatyczne usunięcie z nagrań dźwiękowych szerokopasmowego szumu, zniekształceń impulsowych oraz drżenia i kołysania dźwięku; użytkownicy systemu nie muszą posiadać wiedzy związanej z cyfrowym przetwarzaniem sygnałów. W referacie przedstawiono zasadę działania systemu jego funkcjonalność oraz dotychczas osiągnięte wyniki.

Entry No. 413

Entry type conference paper

Authors K. Przyłucka, B. Kostek, A. Czyzewski

English title Testing audio restoration algorithms

Polish title

Conference 27th Tonmeistertagung – VDT International Convention

Preprint

Number

Volume

Pages 1 - 10

Conference site Kolonia, Niemcy

Conference date 22.11.2012- 25.11.2012

Abstract Nowadays audio material stored on analog carriers is being increasingly digitized. It often is corrupted by distortions, e.g. impulse, white noise, or clipping. There are many methods to reduce these types of distortion. The aim of this paper is to provide guidelines for making automatic decisions about the sequence of specific procedures that will bring the best results in terms of audio restoration. Moreover, an optimization process based on informal listening tests is performed to determine the best restoration algorithm settings. For this purpose restoration algorithms are developed and implemented at the Multimedia Systems Department of Gdansk University of Technology. They are shortly recalled in the paper. Test procedures, and the description of the reconstructed excerpts and results of tests are presented in the paper.

Streszczenie Celem badań było przeprowadzenie testów subiektywnych, mających na celu efektywność opracowanych algorytmów rekonstrukcji nagrań fonicznych. W referacie przedstawiono pokrótce algorytmy rekonstrukcji nagrań fonicznych, a następnie wyniki testów odsłuchowych. Zawarto wnioski dotyczace rozwoju prowadzonych badań.

Entry No. 414

Entry type conference paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak, B. Kostek

English title System for creating maps of noise threatening hearing with grid computing on supercomputing platforms

Polish title System do tworzenia map zagrożeń hałasem z zastosowaniem obliczeń gridowych na platformach superkomputerowych

Conference 61 Brussels Innova 2012

Preprint

Number

Volume

Pages

Conference site Bruksela, Belgia

Conference date 15.11.2012- 17.11.2012

Abstract Doveloped system allows determination of noise level and noise impact on hearing in metropolian environments. Results are visualized in the form of noise map and the map of permanet threshold shift in people exposed to noise. Based on the emission model of noise sources (rail, road) and sound propagation algorithms immission noise maps for given area are generated. Generated noise maps are presented via Internet website, hence the target audience is practically unlimited. The use of grid computing, allowing free of charge access to computing resources, subject to grants awarded, allows researchers to develop their own simulations without the need for purchasing specialised software.

Streszczenie Opracowany system pozwala na określenie poziomu oraz wpływu na słuch hałasu występującego w środowisku aglomeracji miejskich. Wynik przedstawiany jest w postaci mapy hałasu oraz mapy przesunięcia progu słyszenia człowieka narażonego na hałas. W oparciu o emisyjny model źródła hałasu (drogowego, kolejowego) oraz algorytmy propagacji dźwięku w środowisku wyznaczane są immisyjne mapy hałasu dla danego obszaru. Prezentacja przygotowanych map hałasu jest realizowana poprzez serwis internetowy, dzięki czemu grono odbiorców jest praktycznie nieograniczone. Obok możliwości wykorzystania dynamicznie odświeżanych map hałasu przez mieszkańców, administrację, służby ochrony środowiska, służby odpowiedzialne za monitorowanie ruchu drogowego - opracowane rozwiązania mogą służyć prowadzeniu eksperymentów badawczych. Zastosowanie obliczeń gridowych, umożliwiających bezpłatny dostęp na zasobów obliczeniowych na podstawie przyznawanych grantów, umożliwia wykonanie własnych symulacji przez zainteresowanych badaczy, bez konieczności zakupu specjalistycznego oprogramowania.

Entry No. 415

Entry type journal paper

Authors A. Czyżewski, J. Kotus

English title ACOUSTIC

Polish title AKUSTYKA

Journal Newsletter PLGrid Plus

Volume

Number 2

Pages 3 - 6

Notes http://www.plgrid.pl/projekty/plus/materialy_promocyjne/newsletter/Newsletter_PLGrid_Plus-wrzesien_2012.pdf

Streszczenie W artykule przedstawiono zadania realizowane w ramach projektu PL GRID Plus przez zespół wykonawców Katedry Systemów Multimedialnych. Zadanie te obejmują przygotowanie zestawu usług umożliwiających wykonywanie obliczeń map hałasu i wpływu hałasu na słuch z wykorzystaniem infrastruktury PL GRID.

Entry No. 416

Entry type conference paper

Authors Ł. Kosikowski, A. Czyżewski, M. Kurkowski, P. Odya, H. Skarżyński, A. Kupryjanow

English title Audiovisual stymulator of attention

Polish title Audio-wizualny stymulator uwagi

Conference Otwarcie światowego centrum słuchu

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 10.5.2012- 11.5.2012

Notes Plakat

Abstract Na plakacie przedstawiono koncepcje stymulacji sluchowo-wzrokowej do treningu lateralizacji.

Streszczenie In the proposed solution vision and hearing senses are induced. Sense of hearing is stimulated using the time scale modification (TSM) combined with amplitude modification of the speech used during the training. Sight is stimulated by the text of the spoken speech. During the training, text is modified i.e. part of the text that is currently heard in the headphone is marked using various technics (e.g. zooming, coloring, etc.).

Entry No. 417

Entry type conference paper

Authors J. Kotus, A. Czyżewski, B. Kostek, M. Szczodrak, H. Skarżyński

English title Creating maps of noise threatening hearing with supercomputing grids

Polish title Tworzenie map zagrożeń hałasem z zastosowaniem klastrów obliczeniowych

Conference Otwarcie Światowego Centrum Słuchu

Preprint

Number

Volume

Pages

Conference site Kajetany, Polska

Conference date 10.5.2012- 11.5.2012

Notes Prezentacja plakatowa

Streszczenie Na plakacie przedstawiono przykładowe wyniki symulacji wpływu hałasu na słuch podczas koncertu plenerowego uzyskane za pomocą opracowanego w Katedrze Systemów Multimedialnych systemu do tworzenia map zagrożeń hałasem z zastosowaniem klastrów obliczeniowych.

Entry No. 418

Entry type

Authors A. Czyżewski, Ł. Kosikowski, P. Odya, A. Kupryjanow, P. Suchomski

English title

Polish title Sposób prowadzenia treningu słuchowo-wzrokowego, zwłaszcza u osób z zaburzeniami lateralnymi

Notes Zgłoszenie patentowe

Abstract The patent describes a method of auditory-visual training, especially in patients with abnormal lateralisation. The patent covers the use of the eye tracking system as an objective source of information about the visual fixation point while reading the modified text on the computer screen in conjunction with the heard voiceover.

Streszczenie W patencie opisano sposób prowadzenia treningu słuchowo-wzrokowego zwłaszcza u osób z zaburzeniami lateralnymi. Patent dotyczy wykorzystania systemu śledzenia punktu fiksacji wzroku jako obiektywnego źródła informacji na temat punktu fiksacji wzroku podczas czytania zmodyfikowanego graficznie tekstu na ekranie monitora komputerowego w połączeniu ze spowalnianiem mowy.

Entry No. 419

Entry type conference paper

Authors M. Łukaszewicz, A. Czyżewski

English title BIOMEMS TECHNOLOGY - REVIEW oF APPLICATIONS

Polish title TECHNOLOGIA BIOMEMS - PRZEGLĄD ZASTOSOWAŃ

Conference ICT Young

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 26.5.2012- 27.5.2012

Abstract The paper reviews the issues related to technology, MEMS (Micro-Electro Systems-Mechanical) in the context of applications in biotechnology and biomedical systems. Systems of this type, with applications in biology and medicine, have the name of our common BioMEMS systems. This pertains to a wide range of technology BioMEMS. The aim of this study is to determine future medical MEMS microphones with particular emphasis on their application in the treatment of ear (middle ear implants).

Streszczenie W referacie dokonano przeglądu zagadnień związanych z technologią MEMS (Systemy Micro-Electro-Mechaniczne) w kontekście zastosowań w systemach biomedycznych i w biotechnologii. Systemy tego typu, mające zastosowania w biologii i medycynie naszą wspólną nazwę systemów BioMEMS. Niniejszy przegląd do szeroko pojętej technologii BioMEMS, stanowi wstęp teoretyczny do prac projektowych w tej dziedzinie - w pierwszej kolejności do badań nad mikrofonami MEMS realizowanych w ramach pracy inżynierskiej M. Łukaszewicz. Celem tej pracy jest określenie przyszłych zastosowań medycznych mikrofonów MEMS ze szczególnym uwzględnieniem ich aplikacji w leczeniu narządu słuchu (implanty ucha środkowego).

Entry No. 420

Entry type journal paper

Authors A. Czyżewski

English title Multimedia Systems Department

Polish title Katedra Systemów Multimedialnych

Journal Przegląd Telekomunikacyjny

Volume

Number 5

Pages

Abstract The article provides a historical overview, discuss the issues of teaching and research, and presents the current offer of the Department to industry. This article was written on the occasion of the 60th anniversary of the Faculty of Electronics, Telecommunication and Informatics of Gdansk University of Technology.

Streszczenie W artykule zamieszczono rys historyczny, omówienie programu dydaktycznego i tematyki badań naukowych oraz aktualną ofertę katerdy dla przemysłu. Artykuł powstał z okazji 60-lecia Wydziału Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej.

Entry No. 421

Entry type conference paper

Authors K. Lisowski, A. Czyżewski

English title Cartographic representation of route reconstruction results in video surveillance system

Polish title Reprezentacja kartograficzna wyników rekonstrukcji ścieżki obiektu w systemie monitoringu

Conference Multimedia & Network Information Systems 2012

Preprint

Number

Volume 183

Pages 35 - 44

Conference site Wrocław, Polska

Conference date 19.9.2012- 21.9.2012

Abstract The video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the perception of events. Methods of augmenting cartographic data are proposed in this chapter. Making it possible to merge and to present a large amount of useful data on a single screen of surveillance.

Streszczenie Systemy monitoringu rozproszone na dużym obszarze mogą dostarczać dużej ilości różnorodnych danych. W celu zapewnienia podglądu w czasie rzeczywistym strumienie wideo są dostarczane do operatora. Ponad to uzyskiwane są metadane jako wynik działania algorytmów zaimplementowanych w danym systemie. Wiele metod tj. wykrywanie lub śledzenie obiektu są mocno powiązane ze zlokalizowaniem danego obiektu. Dlatego naniesienie tych informacji na plan budynku lub mapę miasta może ułatwić ich percepcję. W tym referacie metody wzbogacania danych kartograficznych są przedstawione. Zastosowanie takiego rozwiązania umożliwia połączenie dużej ilości istotnych danych na jednym ekranie.

Entry No. 422

Entry type journal paper

Authors A. Kupryjanow, A. Czyżewski

English title Improved method for real-time speech stretching

Polish title Udoskonalona metoda spowalniania mowy w czasie rzeczywistym

Journal Inteligent Decision Technologies

Volume 6

Number 2

Pages 177 - 185

Bibliographic No. 15

Abstract An algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal in real-time with high quality and provides “global” synchronization of the input and output signals. Finally, the effectiveness of the engineered algorithms as well as the quality of the processed speech are discussed.

Streszczenie W referacie przedstawiono algorytm przeznaczony do spowalniania sygnału mowy w czasie rzeczywistym. Został on opracowany taki sposób by modyfikował czas trwania nagrania zależnie od jego zawartości. Jest on połączenie algorytmów analizy sygnału takich jak detektor mowy, samogłosek, zająknięć, estymator tempa wypowiedzi oraz algorytmu modyfikacji czasu trwania sygnału opartego na metodzie SOLA (ang. Synchronous-Overlap-and-Add). Takie podejście umożliwia spowalnianie sygnału mowy w czasie rzeczywistym. W artykule przedstawiono wyniki oceny jakości mowy zmodyfikowanej za pomocą opracowanej motody.

Entry No. 423

Entry type book

Authors A. Czyżewski

English title Intelligent control of spectral subtraction algorithm for noise removal from audio

Polish title Inteligentne sterowanie algorytmem redukcji szumów w nagraniach fonicznych

Editor Springer

Pages

Notes rozdział w książce

Abstract In this paper ‘soft computing’ algorithms for audio signal restoration are considered in regard to a practical digital sound library application. The methods presented are designed to reduce empty channel noise, being applicable to the restoration of noisy audio recordings. The audio signal is processed iteratively by a noise-reduction algorithm based on an intelligent comparator, improving the signal-to-noise ratio slightly at each iteration. At each time step, a fuzzy reasoning algorithm processes two values representing spectral power density estimates considered as linguistic variables. We describe a comparator module based on a neural network which approximates the distribution representing a non-linear function of spectral power density estimates. We demonstrate experimentally that the methods examined may produce meaningful noise reduction results without degrading the original sound fidelity. They have been applied to a practical Internet-based sound library (http://www.youarchive.net).

Streszczenie W rozdziale przedstawiono zastosowania inteligentnych algorytmów przetwarzania dźwięku, mających na celu poprawę subiektywnej jakości brzmienia w zastosowaniach do bibliotek cyfrowych. Sygnał foniczny jest przetwarzany na drodze iteracyjnej redukcji szumu z zastosowaniem algorytmu opartego na inteligentnym komparatorze, który poprawia stosunek sygnału do szumu w każdej iteracji. Rozmyty algorytm decyzyjny przetwarza dwa parametry reprezentujące wartości widmowe gęstości mocy, traktując je jako zmienne lingwistyczne. Opisano moduł komparatora działającego z użyciem sztucznej sieci neuronowej, który wykorzystuje rozkład widmowej gęstości mocy. Wykazano eksperymentalnie, że badane metody dostarczają wymiernych rezultatów redukcji szumu, bez pogarszania wierności oryginalnego dźwięku. Zostały one zastosowane w praktycznym internetowym systemie rekostruowania nagrań archiwalnych, dostępnym pod adresem: http://www.youarchive.net.

Entry No. 424

Entry type conference paper

Authors A. Czyżewski

English title New Applications of Multimodal Human-Computer Interfaces

Polish title Nowe zastosowania interfejsów multimodalnych człowiek-komputer

Conference NTAV/SPA 2012 - 2012 Joint Conference New Trends in Audio & Video and Signal Processing: Algorithms, Architectures, Arrangements, and Applications

Preprint

Number

Volume

Pages 19 - 22

Conference site Łódź, Polska

Conference date 27.9.2012- 29.9.2012

Notes referat plenarny na zaproszenie organizatorów konferencji

Abstract Multimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness evaluation is demonstrated. The proposed method assumes analysis of visual activity of patients remaining in vegetative state. The scent emitting multimodal computer interface is an important supplement of the polysensoric stimulation process, playing an essential role in education and therapy of children with developmental disorders. A new approach to diagnosing Parkinson’s disease is shown. The progression of the disease can be measured by the UPDRS (Unified Parkinson Disease Rating Scale) scale which is used for evaluating motor and behavioral symptoms of Parkinson’s disease, employing the multimodal interface called Virtual-Touchpad (VTP) to support medical diagnosis. The paper is concluded with some general remarks concerning the role of multimodal computer interfaces applied to learning, therapy and everyday usage of computerized devices.

Streszczenie Zaprezentowano opracowane interfejsy komputerowe oraz przykłady ich zastosowań do oprogramowania edukacyjnego oraz dla osób niepełnosprawnych. Proponowane interfejsy obejmują interaktywną elektroniczą tablicę działającą na podstawie analizy obrazu wideo, aplikację do sterowania komputerami za pomocą gestów i interfejs audio dla mowy transponowanej czasowo dla niesłyszących i osób jąkających się. Zastosowanie systemu śledzenia wzroku do oceny świadomości zostało przedstawione w kontekście uzyskanych wyników eksperymentalnych.Zaproponowana metoda zakłada analizę aktywności wzrokowej pacjentów pozostających w stanie wegetatywnym. Emitujący zapach multimodalny interfejs komputerowy jest ważnym uzupełnieniem procesu stymulacji polisensorycznje, odgrywając istotną rolę w edukacji i terapii dzieci z zaburzeniami rozwojowymi. Zademonstrowano nowe podejście do diagnozowania choroby Parkinsona, w którym postęp choroby może być mierzony z wykorzystaniem skali UPDRS (Unified Parkinson Disease Rating Scale), która jest używana do oceny nasilenia objawów choroby Parkinsona, wykorzystując multimodalny interfejs o nazwie Virtual-Touchpad (VTP) do wspierania diagnozy medycznej.Podsumowaniem referatu są zawarte w nim ogólne uwagi, dotyczące roli multimodalnych interfejsów komputerowych w zastosowaniach w edukacji, terapii i codziennego użytkowania urządzeń komputerowych.

Entry No. 425

Entry type journal paper

Authors A. Kupryjanow, A. Czyżewski

English title Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit

Polish title Metoda wspomagania rozumienia mowy u osób z pogorszoną rozdzielczością czasową słuchu

Journal Diangnostic Pathology

Volume 7

Number 129

Pages 1 - 17

Notes Artykuł Open Access

Abstract Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based on the non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of the proposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearing impaired children and elderly listeners. It was shown that for the speech with average rate equal to or higher than 6.48 vowels/s, all of the proposed methods have statistically significant impact on the improvement of speech intelligibility for hearing impaired children with reduced hearing resolution and one of the proposed methods significantly improves comprehension of speech in the group of elderly listeners with reduced hearing resolution.

Streszczenie W referacie przedstawiono metody modyfikacji czasu trwania sygnału działające w czasie rzeczywistym. Zostały on oparte na nierównomiernym, zależnym od tempa mowy algorytmie SOLA (Synchronous Overlap and Add). Zbadano wpływ opracowanych metod na rozumienie mowy w dwóch grupach słuchaczy tj. dzieci głuchych oraz osób starszych. Zostało wykazane, że w przypadku mowy której tempo jest większe bądź równe 6,48 samogłosek/s wszystkie zaproponowane metody powodują statystycznie istotny wzrost jej rozumienia w grupie dzieci z obniżoną rozdzielczością czasową słuchu a jedna z opracowanych metod pozwoliła uzyskać statystycznie istotny wzrost rozumienia mowy u osób starszych z pogorszoną rozdzielczością czasową słuchu.

Entry No. 426

Entry type conference paper

Authors A. Kupryjanow, P. Suchomski, P. Odya, A. Czyzewski

English title System supporting speech perception in special educational needs schoolchildren

Polish title System wspomagający rozumienie mowy przez dzieci o specjalnych wymaganiach edukacyjnych

Conference ICCHP 2012

Preprint

Number 2

Volume 7383

Pages 133 - 136

Conference site Linz, Austria

Conference date 11.7.2012- 13.7.2012

Abstract The system supporting speech perception during the classes is presented in the paper. The system is a combination of portable device, which enables real-time speech stretching, with the workstation designed in order to perform hearing tests. System was designed to help children suffering from Central Auditory Processing Disorders.

Streszczenie W referacie przedstawiono system wspomagający rozumienie mowy przez dzieci biorące udział w zajęciach lekcyjnych. Jest on połączeniem mobilnego urządzenia spowalniającego mowę w czasie rzeczywistym oraz stacji roboczej pozwalającej na przeprowadzenie badania słuchu. Opracowany system został stworzony z myślą o dzieciach z zaburzeniami centralnego układu nerwowego.

Entry No. 427

Entry type journal paper

Authors A. Kupryjanow, A. Czyżewski

English title A Method of Real-Time Non-uniform Speech Stretching

Polish title Metoda nierównomiernego spowalniania sygnału mowy operująca w czasie rzeczywistym

Journal Communications in Computer and Information Science

Volume 314

Number

Pages 362 - 373

Abstract Developed method of real-time non-uniform speech stretching is presented. The proposed solution is based on the well-known SOLA algorithm (Synchronous Overlap and Add). Non-uniform time-scale modification is achieved by the adjustment of time scaling factor values in accordance with the signal content. Dependently on the speech unit (vowels/consonants), instantaneous rate of speech (ROS), and speech signal presence, values of the scaling factor are selected. This provides as low as possible difference in the duration of the input and output signal and high naturalness and quality of the modified speech. In the experimental part of the paper accuracy of the proposed ROS estimator is examined. Quality of the speech stretched using the proposed method is assessed in the subjective tests.

Streszczenie W referacie przedstawiono algorytm nierównomiernej modyfikacji czasu trwania sygnału mowy działający w czasie rzeczywistym. Powstał on poprzez połączenie algorytmu SOLA (Synchronous Overlap and Add ) z detektorami samogłosek, spółgłosek i ciszy. Na podstawie informacji dotyczących zawartości analizowanego sygnału oraz estymowanego tempa mowy, proponowany algorytm dopasowuje wartości współczynnika skali. W części eksperymentalnej referatu zbadano możliwość przetwarzania sygnału w czasie rzeczywistym oraz jakość spowolnionego sygnału mowy. Jakości spowolnionej mowy została oceniona poprzez porównanie wyników testów subiektywnych przeprowadzonych dla mowy modyfikowanej z wykorzystanie proponowanego algorytmu oraz algorytmu SOLA. Zbadano także skuteczność algorytmu estymacji tempa mowy co pozwoliło wykazać jego odporność.

Entry No. 428

Entry type conference paper

Authors A. Kupryjanow, A. Czyżewski

English title System for Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit

Polish title System wspomagający rozumienie mowy przez osoby z pogorszoną rozdzielczością czasową słuchu

Conference 6th International Forum on Innovative Technologies for Medicine ITMED 2012

Preprint

Number

Volume

Pages 18 - 19

Conference site Białystok, Polska

Conference date 21.11.2012- 23.11.2012

Bibliographic No. 7

Abstract We have developed system which is devoted to the people with the hearing resolution deficit (they could not understand fast spoken speech). System was designed based on the assumption that Time Scale Modification (TSM) namely time stretching of the speech signal could improve the speech intelligibility. Therefore, algorithm which could stretches speech signal captured by the wireless microphones and reproduce it on the headphone was designed.

Streszczenie Opracowano system przeznaczony dla osób z pogorszoną rozdzielczością czasową słuchu (osoby takie nie rozumieją mowy wypowiadanej w szybkim tempie). System został oparty na założeniu, że modyfikacja tempa mowy (spowalnianie) pozwoli zwiększyć rozumienie wypowiedzi u tej grupy osób. W tym celu opracowano algorytm umożliwiający spowalnianie w czasie rzeczywistym mowy rejestrowane przez mikrofon.

Entry No. 429

Entry type journal paper

Authors J. Kotus, K. Łopatka, A. Czyżewski

English title Detection and localization of selected acoustic events in acoustic field for smart surveillance applications

Polish title Detekcja i lokalizacja wybranych zdarzeń akustycznych w polu akustycznym dla zastosowań w systemach monitorowania bezpieczeństwa

Journal Multimedia Tools and Applications

Volume

Number

Pages

Notes opublikowano online http://www.springerlink.com/content/b430j4820v573421/

Abstract A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals from the multichannel acoustic vector probe is performed upon the detection. The described algorithms can be employed in surveillance systems to monitor behavior of public events participants. The results can be used to detect sound source position in real time or to calculate the spatial distribution of sound energy in the environment. Moreover, the spatial filtration can be performed to separate sounds arriving from a chosen direction.

Streszczenie W artykule przedstawiono metodę automatycznego określania położenia źródła dźwięku w przestrzeni 3D w oparciu o analizę wybranych zdarzeń akustycznych takich jak mowa lub sygnały o charakterze impulsowym. Źródło dźwięku jest lokalizowane w rzeczywistych warunkach akustycznych (liczne odbicia) w oparciu o analizę zdarzenia akustycznego za pomocą wektorowego czujnika akustycznego. Głos człowieka i impulsowe dźwięki są wykrywane za pomocą adaptacyjnie przestrajanych detektorów, których działanie opiera się na analizie różnić w pikach widma oraz poziomu dźwięku. Moduł lokalizacji źródła dźwięku działa w oparciu o analizę wielokanałowego sygnału pozyskanego za pomocą czujnika wektorowego. Jedynie fragmenty z wykrytym zdarzeniem akustycznym są analizowane pod kątem określenia pozycji źródła dźwięku. Przedstawione algorytmy mogą znaleźć zastosowanie w systemach monitorowania bezpieczeństwa podczas imprez masowych. Wyniki działania opracowanych algorytmów mogą być wykorzystywane do wykrywania pozycji źródła dźwięku w czasie rzeczywistym lub do obliczania przestrzennego rozkładu energii dźwięku w środowisku. Ponadto, możliwe jest wyodrębnienie dźwięku dobiegającego z wybranego kierunku w oparciu o filtrację przestrzenną.

Entry No. 430

Entry type conference paper

Authors K. Łopatka, A. Czyżewski

English title Automatic Regular Voice, Raised Voice, and Scream Recognition Employing Fuzzy Logic

Polish title Automatyczne rozpoznawanie zwykłego głosu, podniesionego głosu i krzyku z wykorzystaniem logiki rozmytej

Conference 132nd AES Convention

Preprint 8636

Number

Volume

Pages

Conference site Budapeszt, Węgry

Conference date 26.4.2012- 29.4.2012

Abstract A method of automatic recognition of regular voice, raised voice, and scream used in an audio surveillance system is presented. The algorithm for detection of voice activity in a noisy environment is discussed. Signal features used for sound classification, based on energy, spectral shape, and tonality are introduced. Sound feature vectors are processed by a fuzzy classifier. The method is employed in an audio surveillance system working in real-time both in an indoor and outdoor environment. Achieved results of classifying real signals are presented and discussed.

Streszczenie Przedstawiono metodę automatycznego rozpoznawania zwykłego głosu, podniesionego głosu i krzyku wykorzystywaną w fonicznym systemie nadzoru bezpieczeństwa. Omówiono algorytm detekcji aktywności głosowej w obecności zakłóceń. Przedstawiono parametry wykorzystywane w klasyfikacji sygnałów fonicznych oparte na energii, funkcji widma gęstości mocy i tonalności. Do rozróżnienia sygnałów zastosowano klasyfikator oparty na logice rozmytej. Metodę zaimplementowano w systemie nadzoru bezpieczeństwa pracującym wewnątrz i na zewnątrz budynków. Działanie algorytmu sprawdzono na przykładzie rzeczywistych sygnałów.

Entry No. 431

Entry type conference paper

Authors A. Ciarkowski, A. Czyżewski

English title Open Standards-based Communication Subsystem for Distributed Intelligent Surveillance Solution

Polish title Oparty na otwarych standardach podsystem komunikacji dla rozproszonego systemu inteligentnego monitoringu

Conference 24th International Conference on Advanced Information Systems Engineering (CAiSE'12)

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 25.6.2012- 26.6.2012

Notes w recenzjach, informacje bibliograficzne zostaną uzupełnione po wydaniu proceedings

Abstract The paper focuses on the design and implementation of de- veloped open standards-based communication subsystem providing an element of distributed surveillance solution. The paradigm of “intelligent”surveillance approach is introduced. Requirements analysis toward the design of communication subsystem architecture is presented. The application of XMPP protocol to the stated problem is described. Special attention is paid to the multimedia streaming functionality of presented solution, which is based on XMPP/Jingle extension; issues related to NAT and firewall traversal of multimedia streams in the open Internet are discussed.

Streszczenie Dokument koncentruje się na projekcie i wdrożeniu opartego na otwartych standardach podsystemu komunikacji będącego elementem rozproszonego systemu monitoringu. Przedstawiono paradygmat "inteligentnego" monitoringu. Dokonano analizy wymagań wobec projektu architektury podsystemu komunikacji. Omówiono zastosowanie protokołu XMPP do przedstawionego problemu. Szczególną uwagę zwrócono na funkcjonalność streamingu multimediów, która jest oparta na protokołach XMPP/Jingle; omówiono kwestie związane z trawersacją urządzeń typu NAT i firewall przez strumienie multimedialne w otwartej sieci Internet.

Entry No. 432

Entry type

Authors A. Czyżewski, B. Kostek, R. Rybacki

English title The Manner of Ranging Items on the Computer Monitor Screen Surface, Especially Keywords for the Requirements of WEB Browsers Users

Polish title Sposób wartościowania obiektów na powierzchni ekranu monitora komputerowego, zwłaszcza słów-kluczy dla potrzeb użytkowników przeglądarki internetowej

Notes zgłoszenie patentowe w USA do zgł. nr P. 395764 w Polsce

Entry No. 433

Entry type conference paper

Authors A. Czyżewski, J. Kotus, M. Szczodrak, B. Kostek

English title System for creating maps of noise threatening hearing with grid computing on supercomputing platforms

Polish title System do tworzenia map zagrożeń hałasem z zastosowaniem obliczeń gridowych na platformach superkomputerowych

Conference VI Międzynarodowa Warszawska Wystawa Wynalazków IWIS 2012

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 16.10.2012- 19.10.2012

Abstract Engineered system allows for obtaining level and influence on hearing of environmental noise in urban areas. The outcome is presented in dedicated website. Moreover developed solutions can provide a base for scientific research. Grid computing allows for obtaining free of charge access (given as a grant) to the computational resources, therefore researchers interested in the problem can conduct own experiments without need of purchasing specialized software.

Streszczenie Opracowany system pozwala na określenie poziomu oraz wpływu na słuch hałasu występującego w środowisku aglomeracji miejskich. Wyniki przedstawiane są poprzez serwis internetowy. Ponadto opracowane rozwiązania mogą służyć prowadzeniu eksperymentów badawczych. Zastosowanie obliczeń gridowych, umożliwiających bezpłatny dostęp do zasobów na podstawie grantów, pozwala na wykonywanie symulacji przez badaczy, bez konieczności zakupu specjalistycznego oprogramowania.

Entry No. 434

Entry type journal paper

Authors M. Szczodrak, J. Kotus, B. Kostek, A. Czyżewski

English title Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure

Polish title

Journal Archives of Acoustics

Volume 38

Number 2

Pages 235 - 242

Abstract The paper presents functionality and operation results of a system for creating dynamic maps of acoustic noise employing the PL-Grid infrastructure extended with a distributed sensor network. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project for measuring, modeling and rendering data related to noise level distribution in city agglomerations. Specific computational environments, the so-called domain grids, are developed in the mentioned project. For particular domain grids, specialized IT solutions are prepared, i.e. software implementation and hardware (infrastructure adaptation), dedicated for particular researcher groups demands, including acoustics (the domain grid “Acoustics”). The infrastructure and the software developed can be utilized mainly for research and education purposes, however it can also help in urban planning. The engineered software is intended for creating maps of noise threat for road, railways and industrial sources. Integration of the software services with the distributed sensor network enables automatic updating noise maps for a specific time period. The unique feature of the developed software is a possibility of evaluating auditory effects which are caused by the exposure to excessive noise. The estimation of auditory effects is based on calculated noise levels in a given exposure period. The outcomes of this research study are presented in a form of the cumulative noise dose and the characteristics of the temporary threshold shift.

Entry No. 435

Entry type journal paper

Authors P. Dalka, A. Ciarkowski, P. Szczuko, A. Czyżewski

English title Open standards-based communication system for distributed intelligent surveillance solution

Polish title System komunikacyjny bazujący na otwartych standardach przeznaczony do rozproszonych, inteligentnych systemów monitoringu

Journal Przegląd Telekomunikacyjny

Volume

Number 8-9

Pages 777 - 785

Bibliographic No. 13

Notes XXIX Krajowe Sympozjum Telekomunikacji i Teleinformatyki

Abstract The paper presents an open standards-based communication system being a part of a distributed surveillance solution. The paradigm of “intelligent” surveillance approach is introduced, and employed video processing is discussed briefly. Requirements analysis toward the design of communication subsystem architecture is presented. Special attention is paid to the multimedia streaming functionality of presented solution, which is based on XMPP/Jingle extension. Results of long-term exploitation of the system are commented.

Streszczenie Artykuł prezentuje system komunikacyjny bazujący na otwartych standardach, który stanowi część rozproszonego systemu monitoringu. Wprowadzono pojęcie "inteligentnego" monitoringu i krótko przedstawiono wykorzystywane metody przetwarzania obrazu. Przedstawiono analizę wymagań na potrzeby projektowania architektury podsystemu komunikacyjnego systemu. Szczególny nacisk położono do zaimplementowanej w prezentowanym rozwiązaniu funkcjonalności strumieniowania multimediów, która bazuje na rozszerzeniach XMPP/Jingle. Przedstawiono i skomentowano wyniki długo-okresowej eksploatacji systemu.

Entry No. 436

Entry type conference paper

Authors K. Łopatka, A. Czyżewski

English title Measurements of acoustic crosstalk cancellation efficiency in mobile listening conditions

Polish title Pomiary skuteczności usuwania akustycznego przesłuchu międzykanałowego w mobilnych warunkach odsłuchu

Conference IEEE Conf. on Signal Processing Algorithms, Architectures, Arrangements and Applications

Preprint

Number

Volume

Pages 215 - 219

Conference site Poznań, Polska

Conference date 26.9.2013- 29.9.2013

Abstract The cancellation of acoustic crosstalk is employed to enhance the stereo image in mobile listening conditions. The implementation of the crosstalk cancellation algorithm in Matlab is introduced. The measurement signals and equipment are described. A practical setup employing a mobile computer and a head and torso simulator is employed. The results of the measurements provided conclusions regarding the employment of acoustic crosstalk cancellation in mobile computers.

Streszczenie Przedstawiono wykorzystanie usuwania akustycznego przesłuchu międzykanałowego dla poprawy obrazowania stereofonicznego w mobilnych warunkach odsłuchu. Opisano implementację algorytmu usuwania przesłuchu w środowisku Matlab. Opisano wykorzystane sygnały pomiarowe oraz metodologię przeprowadzenia eksperymentu. Wykorzystano sztuczną głowę oraz typowy komputer przenośny w celu symulacji praktycznych warunków. Wyniki pomiarów pozwalają na wyciągniecie wniosków dotyczących możliwości wykorzystania algorytmów usuwania przesłuchu akustycznego w mobilnych warunkach.

Entry No. 437

Entry type conference paper

Authors T. Sanner, A. Czyżewski

English title Experimental Analysis of Connection Between Object-Oriented Metrics and Software Changeability

Polish title

Conference ICTYoung 2013

Preprint

Number

Volume

Pages 133 - 139

Conference site Gdańsk, Polska

Conference date 24.5.2013- 25.5.2013

Notes ISBN 978-83-60779-21-7

Abstract For the purpose of video surveillance software quality assessment in this work the ISO/IEC-9126 norm was used with a particular focus on maintainability of the software system. The paper presents a study on the connection between software metrics derived from the static analysis of the source code and changeability of the video surveillance software system. It is shown that meeting requirements of software quality metrics may result in reducing the time needed to introduce changes to the software demanded because of some changes in environmental requirements and specifications.

Entry No. 438

Entry type conference paper

Authors T. Sanner, A. Czyżewski

English title Steady State Visually Evoked Potentials for Brain Computer Interface

Polish title

Conference ICTYoung2013

Preprint

Number

Volume

Pages 279 - 286

Conference site Gdańsk, Polska

Conference date 24.5.2013- 25.5.2013

Notes ISBN 978-83-60779-21-7

Abstract An experiment conducted to validate a possibility of use a single active electrode EEG device for detecting Steady State Visually Evoked Potentials (SSVEP) is shown. A LED stimulator was applied to stimulate patients with two different frequencies - 13 Hz and 17 Hz. First, EEG signals were recorded and pre-processed using MATLAB software. In the next step recordings were analysed and classified employing the WEKA software. As indicated by the results, there is a possibility to use the examined device as a Brain Computer Interface (BCI) based on the SSVEP paradigm.

Entry No. 439

Entry type conference paper

Authors T. Sanner, K. Łopatka, A. Czyżewski

English title Evaluation of Sound Enhancement in Mobile Device using Virtual Bass Algorithm

Polish title

Conference ISSET 2013

Preprint

Number

Volume

Pages 1 - 12

Conference site Kraków, Polska

Conference date 27.6.2013- 29.6.2013

Abstract An experiment conducted to validate possibility of use virtual bass synthesis (VBS) algorithm in a portable computer is presented. The subjective listening tests based on the procedure of pairwise comparison between VBS, based on the so-called missing fundamental phenomenon, and standard bass boost technique are employed. The evaluation was carried out in two types of conditions: in a professional listening room and employing an ultrabook to play back the sounds. As it is indicated by the results, the proposed technique proved the possibility of rendering bass-related components in audio signals in a better way than the standard bass boost technique.

Entry No. 440

Entry type conference paper

Authors B. Kunka, A. Czyżewski, A. Kwiatkowska

English title Interaction with post-comatose patients employing video-based eye-gaze tracking system

Polish title Interakcja z pacjentemi wybudzonymi ze śpiączki z zaburzeniami świadomości wykorzystująca system śledzenia wzroku

Conference XVIII Krajowa Konferencja Biocybernetyki i Inżynierii Biomedycznej

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 10.10.2013- 12.10.2013

Bibliographic No. 5

Notes mat. konferencyjne w wersji elektronicznej (pendrive)

Abstract Post-comatose patients who did not recover to full consciousness are often incorrectly regarded as patients in vegetative state. In order to verify this assumption we have employed a video-based eye-gaze tracking system to diagnosis and therapy process of those people. Usually, physically handicapped patients can move their eyes. Thus, patients who retained their awareness to some extent, especially locked-in syndrome and minimally conscious patients, may interact with environment by gazing. In the paper we present two programming tools – the pictograms application and the virtual keyboard – developed for those patients. Moreover, first observations related to interaction with post-comatose patients utilizing the eye-gaze tracking computer interface are summarized.

Streszczenie Pacjenci, którzy wybudzili się ze śpiączki, ale nie odzyskali pełnej świadomości są często uważani za osoby w stanie wegetatywnym. W celu zweryfikowania tego założenia przeprowadziliśmy badania z udziałem takich pacjentów wykorzystując system śledzenia wzroku. Pacjenci unieruchomieni, tak jak np. pacjenci w tzw. stanie wegetatywnym, zazwyczaj mogą poruszać oczami. Zatem pacjenci, którzy zachowali świadomość, w szczególności pacjenci w zespole zamknięcia oraz w stanie minimalnej świadomości, mogą nawiązywać kontakt z otoczeniem za pomocą wzroku. W referacie przedstawiono dwie aplikacje komputerowe – program z piktogramami oraz wirtualną klawiaturę – opracowane dla takich pacjentów. Ponadto, przedstawiono także wnioski z pierwszych obserwacji związanych z komunikacją z pacjentami za pomocą systemu śledzenia wzroku.

Entry No. 441

Entry type journal paper

Authors A. Czyżewski, K. Lisowski

English title Adaptive Method of Adjusting Flowgraph for Route Reconstruction in Video Surveillance Systems

Polish title

Journal Fundamenta Informaticae

Volume 127

Number

Pages 561 - 576

Abstract Pawlak’s ﬂowgraph has been applied as a suitable data structure for description and anal- ysis of human behaviour in the area supervised with multicamera video surveillance system. Infor- mation contained in the ﬂowgraph can be easily used to predict consecutive movements of a partic- ular object. Moreover, utilization of the ﬂowgraph can support reconstructing object route from the past video images. However, such a ﬂowgraph with its accumulative nature needs a certain period of time for adaptation to changes in behaviour of objects which can be caused, e.g. by closing a door or placing other obstacle forcing people to pass it by. In this paper a method for reduction of time needed for ﬂowgraph adaptation is presented. Additionally, distance measure between ﬂowgraphs is also introduced in order to determine if carrying out the adaptation process is needed.

Entry No. 442

Entry type report

Authors A. Czyżewski, J. Cichowski, W. Moskwa, B. Hermanowicz, M. Lech

English title H/W High Level Design: Generator of Ultrasonic Haptic Feedback

Polish title Projekt Warstwy Wysokiego Poziomu: Generator Ultradźwiękowego Sprzężenia Zwrotnego

Report Number SAMS05/03/13

Notes Projekt: Innotech 020666 (Samsung)

Abstract A technology described in this document presents a new type of HCI. The SGRS system will enable users to control mobile devices without touching their surface, e. g. keyboards. According to project assumptions, gestures executed in front of mobile devices are detected and classified employing advanced background segmentation algorithms. Gestures interpretation and embedded artificial intelligence translate gestures into specific actions realized on mobile devices, such as cursor movement, button click etc.

Entry No. 443

Entry type journal paper

Authors J. Cichowski, P. Czyżyk, B. Kostek, A. Czyżewski

English title Low-Level Music Feature Vectors Embedded as Watermarks

Polish title

Journal Intelligent Tools for Building a Scientific Information Platform

Volume 467

Number

Pages 453 - 473

Notes http://link.springer.com/chapter/10.1007%2F978-3-642-35647-6_27

Abstract In this paper a method consisting in embedding low-level music feature vectors as watermarks into a musical signal is proposed. First, a review of some recent watermarking techniques and the main goals of development of digital watermarking research are provided. Then, a short overview of parameterization employed in the area of Music Information Retrieval is given. A methodology of non-blind watermarking applied to music-content description is presented. The system architecture for the embedding and recovery of the watermarks, along with the algorithms implemented, are described. The robustness of the watermark implemented is tested against audio file processing, such as re-sampling, filtration, time warping, cropping and lossy compression. Procedures for simulating musical signal alteration are explained with a focus on the influence of lossy compression on the degradation of the embedded watermark. The advantages and disadvantages of the proposed approach are discussed. An outline of future applications of the methodology introduced is also included.

Entry No. 444

Entry type journal paper

Authors J. Cichowski, A. Czyżewski, B. Kostek

English title Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture

Polish title

Journal MULTIMEDIA TOOLS AND APPLICATIONS

Volume

Number

Pages 1 - 21

Notes http://link.springer.com/article/10.1007%2Fs11042-013-1636-0

Abstract The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed, and the robustness of the proposed approach is evaluated in the context of the influence of lossy compression on the watermark degradation. Subjective and objective analyses are performed for the algorithm proposed by the authors and compared with the Audio Watermarking Tools (AWT) encoder. Finally, the advantages and drawbacks of the proposed approach are debated followed by the conclusion section outlining possible improvements to the proposed method.

Entry No. 445

Entry type conference paper

Authors J. Cichowski, A. Czyżewski, B. Kostek

English title Visual Data Encryption for Privacy Enhancement in Surveillance Systems

Polish title

Conference Advanced Concepts for Intelligent Vision Systems,

Preprint

Number

Volume 8192

Pages 13 - 24

Conference site Poznań, Polska

Conference date 28.10.2013- 31.10.2013

Notes http://link.springer.com/chapter/10.1007/978-3-319-02895-8_2

Abstract In this paper a methodology for employing reversible visual encryption of data is proposed. The developed algorithms are focused on privacy enhancement in distributed surveillance architectures. First, motivation of the study performed and a short review of preexisting methods of privacy enhancement are presented. The algorithmic background, system architecture along with a solution for anonymization of sensitive regions of interest are described. An analysis of efficiency of the developed encryption approach with respect to visual stream resolution and the number of protected objects is performed. Experimental procedures related to stream processing on a single core, single node and multiple nodes of the supercomputer platform are also provided. The obtained results are presented and discussed. Moreover, possible future improvements of the methodology are suggested.

Entry No. 446

Entry type conference paper

Authors B. Kostek, J. Cichowski, A. Czyżewski

English title Testing Watermark Robustness against Application of Audio Restoration Algorithms

Polish title

Conference 135th International Audio Engineering Society Convention

Preprint 126

Number

Volume

Pages

Conference site New York, USA

Conference date 17.10.2013- 20.10.2013

Notes http://www.aes.org/e-lib/browse.cfm?elib=16961

Abstract The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features embedded as watermarks using the non-blind approach. After applying restoration algorithms, the watermark is extracted from the audio track. It was shown in experiments, that a watermark “attacked” by the restoration procedures may still be detected. However in some cases it is possible to retrieve only a binary information about the watermark presence in the audio carrier.

Entry No. 447

Entry type conference paper

Authors J. Kotus, K. Łopatka, A. Czyżewski, G. Bogdanis

English title Audio-visual surveillance system for application in bank operating room

Polish title Foniczno-wizyjny system nadzoru sali operacyjnej w banku

Conference 6th International Conference on Multimedia, Communications, Services and Security

Preprint

Number

Volume

Pages 107 - 120

Conference site

Conference date 6.6.2013- 7.6.2013

Abstract n audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic events. The methods for calculating the direction of coming sound employing an acoustic vector sensor are presented. The localization is achieved by calculating the DOA (Direction of Arrival) histogram. The evaluation of the system based on experiments conducted in a real bank operating room is given. Results of sound event detection, classification and localization are given and discussed. The system proves efficient for the task of automatic surveillance of the bank operating room

Streszczenie Przedstawiono system foniczno-wizyjnego nadzoru sali bankowej zdolny do detekcji i lokalizacji zdarzeń dźwiękowych w sali operacyjnej. Opisano algorytmy detekcji i klasyfikacji niepokojących zdarzeń dźwiękowych takich jak krzyki i wystrzały z broni. Wykorzystano dwa typy detektorów: do wykrywania zdarzeń impulsowych oraz aktywności głosowej. Zaimplementowano klasyfikator SVM w celu odróżnienia od siebie poszczególnych klas niebezpiecznych zdarzeń. Lokalizacja źródła dźwięku wyznaczana jest z pomocą akustycznego czujnika wektorowego. Do obliczenia lokalizacji wykorzystano histogram kierunków nadejścia fali dźwiękowej (DoA). Przedstawiono wyniki oceny skuteczności systemu w rzeczywistym środowisku sali bankowej. System osiąga wystarczającą skutecznosć do zadania automatycznego nadzoru bezpieczeństwa w sali bankowej.

Entry No. 448

Entry type report

Authors A. Czyżewski, J. Cichowski, W. Moskwa, B. Hermanowicz

English title H/W Low Level Design, Generator of Ultrasonic Haptic Feedback

Polish title Projekt konstrukcyjny urządzenia do wytwarzania skoncentrowanej wiązki ultradźwiękowej

Report Number INNOTECH-SAMS12/08/1

Notes Projekt: Innotech 020666 (Samsung)

Abstract This document summarizes the research carried out since 2013-04-01 till 2013-08-31. It is deliverable of the task 4. The main objective of the mentioned task was the design of the device to produce a concentrated beam of ultrasound. The following sections present the electrical diagrams for device’s subcomponents. The list and functionality of the specific parts is briefly presented in the Section entitled functional block diagram.

Entry No. 449

Entry type journal paper

Authors A. Ciarkowski, A. Czyżewski

English title

Polish title System komunikacji operacyjnej i dostępu do strumieni multimedialnych dla terminali mobilnych

Journal Nowoczesne systemy łaczności i transmisji danych na rzecz bezpieczeństwa. Szanse i zagrożenia

Volume

Number

Pages 328 - 341

Notes red. A. R. Pach, Z. Rau, M. Wągrowski; wyd. Wolters Kluwer Polska

Streszczenie Przedstawiono opracowany system komunikacji multimedialnej zoptymalizowany pod kątem jego wykorzystania w warunkach operacyjnych przez służby odpowiedzialne za ochronę obiektów i bezpieczeństwo. Szczególną uwagę poświęcono funkcjonalności bezprzewodowego dostępu do strumieni multimedialnych pochodzących z kamer systemu „inteligentnego monitoringu”. Przeanalizowano wymagania i omówiono założenia, na których opiera się projekt tego systemu. Zaproponowano wykorzystanie urządzeń klasy PDA w roli mobilnych terminali komunikacyjnych. Przedstawiono ideę wykorzystania protokołu XMPP jako medium komunikacji sygnalizacyjnej. Przedyskutowano zagadnienie transmisji multimediów w czasie rzeczywistym z wykorzystaniem rozszerzenia Jingle/XMPP. Zwrócono także uwagę na techniczne aspekty związane z nawiązaniem sesji komunikacji multimedialnej w obecności urządzeń pośredniczących typowych dla sieci rozproszonych. Omówiono zagadnienie transmisji metadanych charakterystycznych dla strumieni wizyjnych i fonicznych pochodzących ze stacji systemu monitoringowego.

Entry No. 450

Entry type conference paper

Authors Ł. Kosikowski, A. Czyżewski

English title Vision screening using mobile devices

Polish title Przesiewowe badania wzroku z wykorzystaniem urządzeń mobilnych

Conference XVIII Krajowa Konferencja Biocybernetyka i Inżynieria Biomedyczna

Preprint

Number

Volume

Pages 157 - 157

Conference site Gdańsk, Polska

Conference date 10.10.2013- 12.10.2013

Abstract This paper describes the application to vision screening. The application has been designed in such a way that each user can examine yourself. A negative test result indicates the need to perform a detailed eye examination by an ophthalmologist or optometrist. Choosing a platform related to the quality and repeatability of devices allowed for the preparation of an application that provides adequate reliability of the results, in particular, sufficient for screening - without the need for user calibration.

Streszczenie W referacie przedstawiono opis aplikacji do przesiewowego badania wzroku. Aplikacja została opracowana w taki sposób, aby każdy użytkownik mógł wykonać badanie samodzielnie. Negatywny wynik badania oznacza konieczność wykonania szczegółowego badania wzroku w gabinecie okulistycznym lub u optometrysty. Wybór platformy związany z jakością i powtarzalnością urządzeń pozwolił na przygotowanie aplikacji zapewniającej odpowiednią wiarygodność wyników, w szczególności wystarczającą do badania przesiewowego - bez konieczności wykonywania kalibracji przez użytkownika.

Entry No. 451

Entry type book

Authors H. Krawczyk, A. Czyżewski, A. Ciarkowski, P. Bratoszewski, J. Cichowski, D. Ellwart

English title

Polish title KASKBOOK - Platformy Przetwarzania i Aplikacje Multimedialne - Aplikacja Rozpoznawania Obiektów i Zdarzeń

Editor KASK/WETI PG

Pages 1 - 219

Notes K. Kopaczewski, A. Korzeniewski, J. Kotus, K. Lisowski, K. Łopatka, T. Sanner, M. Szczodrak, P. Szczuko, G. Szwoch

Entry No. 452

Entry type journal paper

Authors A. Czyżewski, P. Dalka, Ł. Kosikowski, B. Kunka, P. Odya

English title Multimodal human-computer interfaces based on advanced video and audio analysis

Polish title Multimodalne interfejsy człowiek-komputer bazujące na zaawansowanej analizie obrazu i dźwięku

Journal Advances in Soft Computing

Volume

Number

Pages

Notes w druku

Abstract Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for disabled people are presented. The LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is shown. The progression of the disease can be measured employing the UPDRS (Unified Parkinson Disease Rating Scale) scale which is used to evaluate motor and behavioral symptoms of the Parkinson’s disease, based on the multimodal interface called Virtual-Touchpad (VTP) used for supporting medical diagnosis. The scent emitting mul-timodal computer interface provides an important supplement of the polysensoric stimulation process, playing an essential role in education and therapy of children with certain developmental disorders. The Smart Pen providing a tool for support-ing therapy of developmental dyslexia is presented and results achieved with its application are discussed. The eye-gaze tracking system named “Cyber Eye” de-veloped at the Multimedia Systems Department employed to many kinds of experiments is presented including analysis of visual activity of patients remaining in vegetative state and their awareness evaluation. The paper is con-cluded with some general remarks concerning the role of multimodal computer in-terfaces applied to learning, therapy and everyday usage of computerized devic-es.

Streszczenie W artykule zawarto przykłady zastosowań interfejsów multimodalnych w edukacji a także w terapii osób z różnego typu niepełnosprawnościami.

Entry No. 453

Entry type report

Authors J. Kotus, M. Szczodrak, B. Kostek, A. Czyżewski

English title Acoustics - new services for urban planning, research and education

Polish title

Report Number

Notes http://www.plgrid.pl/projekty/plus/materialy_promocyjne/broszury/pliki/Broszura_Acoustics_PLGridPlus

Abstract The main purpose of the presented design is twofold, namely: providing detailed information about the noise threats that occur every day in city areas and preventing the noise induced hearing loss especially among young people. An experimental system designed for the continuous monitoring of the acoustic climate of urban areas was developed and implemented within the PLGrid Plus project. The assessment of environmental threats is performed based on online data, acquired through a grid of engineered monitoring stations, employing some selected psychoacoustic properties of the human hearing system. Another aim is to make available efficient computational tools for the community of acousticians engaged in the noise threat combating.

Entry No. 454

Entry type report

Authors P. Odya, P. Szczuko, A. Czyżewski, K. Lisowski, J. Kotus, A. Ciarkowski

English title The Innovative Faculty for Innovative Technologies

Polish title

Report Number

Notes publikacja online: http://www.eti.pg.gda.pl/wydzial/dz_naukowo_badawcza/InnovativeTechnologies.pdf

Abstract A leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar, "Sound recognition service" - a super-computer service able to detect, classify and localize threatening acoustic events, System for creating dynamic maps of noise threats employing grid computing, System supporting speech perception for special educational needs of schoolchildren, Video event recognition system with enhanced privacy protection, Virtual Whiteboard

Entry No. 455

Entry type conference paper

Authors J. Kotus, P. Szczuko, P. Dalka, G. Szwoch, M. Szczodrak, A. Czyżewski

English title Remote audio-visual observation system

Polish title System zdalnej obserwacji akustyczno-wizyjnej

Conference EUROPOLTECH 2013

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 17.4.2013- 19.4.2013

Notes Dodatkowe osoby biorące udział w targach: A. Korzeniewski

Abstract Remote audio-visual observation system allows for discrete sound field analysis for detection, classification, localization and concurrent tracking of multiple sound sources. It comprises a new type of multichannel miniature acoustic vector sensors, and digital signal processing algorithms. Combined with fixed and pan-tilt-zoom cameras it allows for: positioning PTZ camera at the detected sound source, indicating the source in the video stream, simultaneous monitoring of multiple directions and multiple sources. This system can act online (observation, archiving) and offline (audio forensics).

Streszczenie System zdalnej obserwacji akustyczno-wizyjnej umożliwia niejawną analizę pola akustycznego dla celów detekcji, klasyfikacji, lokalizacji i jednoczesnego śledzenia ruchu wielu źródeł dźwięku. Składa się z nowego rodzaju wielokanałowych, miniaturowych wektorowych czujników akustycznych oraz algorytmów cyfrowego przetwarzania sygnałów. W połączeniu z zestawem kamer stacjonarnych i obrotowych umożliwia: nakierowanie kamery obrotowej na wykryte źródło dźwięku, wskazanie źródła dźwięku w obrazie z kamery tradycyjnej lub termowizyjnej, odsłuch dźwięków z wybranych kierunków. Urządzenie może działać w trybie online (obserwacja i rejestracja) lub offline (rekonstrukcja zdarzenia).

Entry No. 456

Entry type conference paper

Authors M. Szczodrak, J. Kotus, B. Kostek, A. Czyżewski

English title Creating dynamic maps of noise threat using pl-grid infrastructure

Polish title

Conference Noise Control 2013

Preprint

Number

Volume

Pages

Conference site Ryn, Polska

Conference date 26.5.2013- 29.5.2013

Abstract This paper presents functionality and operation results of the system for creating dynamic maps of noise thread with the use of the PL-Grid infrastructure integrated with distributed sensors network for measuring, modeling and rendering noise level distribution. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project. Specific computational environments, so called domain grids, are developed in the mentioned project. For particular domain grids, specialized IT solutions are prepared, i.e. dedicated software implementation, and hardware (infrastructure adaptation), suited for particular researchers’ groups demands, including acoustics (domain grid “Acoustics”). The infrastructure and the software developed can be utilized mainly for research and education purposes. The engineered software is intended for creating maps of noise threat for road, railways and industrial sources. Integration of the software service with distributed sensor network enables to automatically update noise maps for a specific time period. The unique feature of the developed software is a possibility to estimate auditory effects which are caused by the exposure to noise. The estimation of auditory effects is based on calculated noise levels and on a given exposure period. The outcomes of this research study are presented in a form of the cumulative noise dose and characteristics of the temporary threshold shift.

Entry No. 457

Entry type conference paper

Authors J. Kitowski, P. Bała, M. Borcz, A. Czyżewski, Ł. Dutka, J. Kotus

English title Development of Domain-Specific Solutions within the Polish Infrastructure for Advanced Scientific Research

Polish title

Conference 10th International Conference on Parallel Processing & Applied Mathematics

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 8.9.2013- 11.9.2013

Notes Pozostali autorzy: R. Kluszczynski, P. Kustra, N. Meyer, A. Milenin, Z. Mosurska, R. Pajak, Ł. Rauch, M. Sterzel, D. Stokłosa, T. Szepieniec

Abstract The Polish Grid computing infrastructure was established during the PL-Grid project (2009-2012). The main purpose of this Project was to provide the Polish scientists with an IT basic platform, allowing them to conduct interdisciplinary research on a national scale, and giving them transparent access to international grid resources via international grid infrastructures. Currently, the infrastructure is maintained and extended within a follow-up PLGrid Plus project (2011-2014). Its main objective is to increase the potential of the Polish Science by providing necessary IT services for research teams in Poland, in line with European solutions. The paper presents several examples of the domain-specific computational environments, developed within the Project. For particular environments, specialized IT solutions are prepared, i.e. dedicated software implementation and infrastructure adaptation, suited for particular researchers groups’ demands.

Entry No. 458

Entry type journal paper

Authors M. Lech, B. Kostek, A. Czyżewski

English title Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System

Polish title

Journal Multimedia and Internet Systems: Theory and Practice; Advances in Intelligent Systems and Computing

Volume 183

Number

Pages 77 - 86

Notes Springer Berlin Heidelberg

Abstract The main objective of the paper is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system based on a camera and a mul-timedia projector enabling a user to process sound in audio mixing domain by hand gestures. The image processing method and hand shape parameterization method are described in relation to the specificity of the input and data classifiers. The SVM classifier is considered the optimum choice for the engineered gesture-based sound mixing system.

Entry No. 459

Entry type journal paper

Authors K. Kopaczewski, M. Szczodrak, A. Czyżewski, H. Krawczyk

English title A method for counting people attending large public events

Polish title

Journal Multimedia Tools and Applications

Volume

Number

Pages 1 - 13

Notes Wydane jako "Online First, Open Access", DOI: 10.1007/s11042-013-1628-0

Abstract The algorithm for people counting in crowded scenes, based on the idea of virtual gate is presented. The concept and practical application of the developed algorithm in real conditions is depicted. The aim of the work is to estimate the number of people passing through entrances of a large sport hall. The most challenging problem was the unpredicted behavior of people while entering the building. Flow of people fluctuated between single persons and dense crowd. A series of experiments during sport and entertainment events was made. The results of the experiments show high accuracy of the algorithm.

Entry No. 460

Entry type conference paper

Authors T. Poremski, J. Kotus, P. Odya, P. Suchomski, B. Kostek, A. Czyżewski

English title THE APPLICATION OF SOUND SYNTHESIS IN DETERMINING THE CHARACTERISTICS OF SUBJECTIVE TINNITUS

Polish title ZASTOSOWANIE SYNTEZY DŹWIĘKU W OKREŚLANIU CECH CHARAKTERYSTYCZNYCH SUBIEKTYWNYCH SZUMÓW USZNYCH

Conference XV Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku, ISSET 2013

Preprint

Number

Volume

Pages 1 - 15

Conference site Kraków,

Conference date 27.6.2013- 29.6.2013

Notes ISBN 987-83-921663-4-4

Streszczenie W niniejszym referacie przedstawiono wykorzystanie opracowanego Syntezatora dźwięku w pomiarach parametrów psychoakustycznych szumów usznych. W pierwszej kolejności przywołano definicję szumów usznych, zestaw procedur i testów stosowanych w ich ocenie, jak również kryteria służące do oceny szumów usznych. Następnie opisano Syntezator dźwięku opracowany w Katedrze Systemów Multimedialnych oraz zilustrowano przygotowany interfejs użytkownika. W ramach prowadzonych badań z osobami cierpiącymi na szumy uszne dokonano oceny skuteczności syntezatora, polegającej na porównaniu wyników uzyskanych przy jego użyciu oraz z wykorzystaniem audiometru klinicznego. Jako miarę do porównania przyjęto czas trwania badania oraz subiektywną ocena podobieństwa wzorca szumu do odczuwanego własnego szumu usznego. Z przeprowadzonych badań wynika, że zastosowanie Syntezatora skraca czas przeprowadzenia badania. Uzyskane w ten sposób wzorce szumu usznego są ponadto oceniane przez pacjentów jako bardziej podobne do odczuwanych szumów usznych.

Entry No. 461

Entry type journal paper

Authors J. Cichowski, A. Czyżewski

English title PRIVACY ENHANCEMENT METHODS FOR SURVEILLANCE SYSTEMS, AN OVERVIEW OF THE DEVELOPED ARCHITECTURES AND ALGORITHMS

Polish title OCHRONA PRYWATNOŚCI W SYSTEMACH MONITORINGU WIZYJNEGO, PRZEGLĄD OPRACOWANYCH ARCHITEKTUR I ALGORYTMÓW

Journal Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej

Volume

Number 36

Pages 47 - 52

Bibliographic No. 9

Notes ISSN: 2353-1290

Abstract The main focus of the paper concerns methods and algorithms for enhancing privacy in visual surveillance systems. Analysis of possible approaches to smart surveillance systems architectures with regards to personal data protection was made. A balance between privacy and security is searched for employing three possible solutions presented, each of them using a different hardware and software setup. The proposed system architectures accompanied by descriptions of algorithmic background and applied specific mechanisms explaining were included. Summarizing remarks pertaining implementation and research results were added.

Streszczenie Nieustannie rozwijające się technologie informacyjne związane z inteligentnym monitoringiem wizyjnym stwarzają ryzyko niewłaściwego wykorzystywania danych osobowych. W celu zapewnienia prawidłowej ochrony materiału wizyjnego, w ramach projektów realizowanych w Katedrze Systemów Multimedialnych WETI PG, opracowany został szereg architektur i algorytmów, które ułatwiają ochronę danych wrażliwych, takich jak: wizerunki osób, numery tablic rejestracyjnych, okna budynków i samochodów oraz prywatne posesje. W referacie opisano podstawy algorytmiczne rozwiązań stosowanych do celów detekcji oraz klasyfikacji obszarów wrażliwych w obrazach wizyjnych. Przedstawiono opracowane algorytmy anonimizacji, których zastosowanie w związku z zaproponowanymi architekturami przepływu informacji umożliwia odwracalną ochronę danych wrażliwych. Odwracalność procesu anonimizacji umożliwia osiągnięcie rozsądnego kompromisu pomiędzy osiąganym poziomem bezpieczeństwa i poszanowaniem prywatności osób postronnych.

Entry No. 462

Entry type conference paper

Authors A. Czyżewski, P. Dalka, Ł. Kosikowski, B. Kunka, M. Lech, P. Odya

English title Multimodal human-computer interfaces based on advanced video and audio analysis

Polish title Multimodalne interfejsy człowiek-komputer bazujące na zaawansowanej analizie obrazu i dźwięku

Conference 2013 The 6th IEEE International Conference on Human System Interaction (HSI)

Preprint

Number

Volume

Pages 18 - 25

Conference site Gdańsk, Polska

Conference date 6.6.2013- 8.6.2013

Abstract Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart Pen providing a tool for supporting therapy of developmental dyslexia is presented and results achieved with its application are discussed. The eye-gaze tracking system named “Cyber-Eye” developed at the Multimedia Systems Department employed to many kinds of experiments is presented including analysis of visual activity of patients remaining in vegetative state and their awareness evaluation. The scent emitting multimodal computer interface provides an important supplement of the polysensoric stimulation process, playing an essential role in education and therapy of children with certain developmental disorders. A new approach to diagnosing Parkinson's disease is shown. The progression of the disease can be measured employing the UPDRS (Unified Parkinson Disease Rating Scale) scale which is used to evaluate motor and behavioral symptoms of Parkinson's disease, based on the multimodal interface called Virtual-Touchpad (VTP) used for supporting medical diagnosis. The paper is concluded with some general remarks concerning the role of multimodal computer interfaces applied to learning, therapy and everyday usage of computerized devices.

Entry No. 463

Entry type conference paper

Authors T. Poremski, J. Kotus, P. Odya, P. Suchomski, A. Czyżewski, B. Kostek

English title DETERMINATION OF SUBJECTIVE TINNITUS CHARACTERISTICS BY MEANS OF SOUND SYNTHESIS CONTROLLED BY THE TOUCH SCREEN INTERFACE

Polish title Badanie psychoakustycznych charakterystyk szumów usznych

Conference ICAD 2013 - International Conference on Auditory Display

Preprint

Number

Volume

Pages 261 - 265

Conference site Łodź, Polska

Conference date 6.7.2013- 10.7.2013

Notes http://www.icad2013.com/paper/33_S8-3_Poremski.pdf

Abstract Determination of Tinnitus (defined as a phantom auditory sensation) characteristics concerning sound type, level, bandwidth or frequency are one of the steps in the measurement protocol. A novel technique to measure Tinnitus parameters is proposed. It is based on a computer application designed as an auditory display for easier identification of the perceived Tinnitus. The proposed method utilizes sound synthesis employing a special graphical user interface to facilitate sound generation and identification. The method was verified during preliminary tests organized with participation of people suffering from Tinnitus and compared with the classical audiometry-based measurements. The obtained results are presented and discussed in the paper.

Streszczenie W pracy przedstawiono pokrótce problematykę szumów usznych. W szczególności skupiono się na pomiarach psychoakustycznych charakterystyk szumów usznych. W tym celu wykorzystano aplikację oparta na syntezie tonów i szumu wąskopasmowego.

Entry No. 464

Entry type journal paper

Authors P. Bratoszewski, A. Czyżewski

English title Head Tracking using the Time of Flight Camera

Polish title Śledzenie głowy użytkownika komputera z użyciem kamery Time of Flight

Journal Zeszyty naukowe WE PG

Volume

Number 36

Pages 35 - 38

Bibliographic No. 6

Abstract A depth image based real-time head tracking system is described. The proposed system utilizes the Time of Flight camera and digital image processing techniques in order to track user’s head position in the real world coordinates. The detection of head position is based on the shape features of the silhouette. Tracking of head location is enhanced and smoothed by the usage of the Kalman filtering. The developed application runs in real-time and is resistant to different lighting conditions.

Streszczenie Opisano opracowaną metodę śledzenia położenia głowy użytkownika komputera lub urządzenia mobilnego przy wykorzystaniu kamery mierzącej czas powrotu wiązki promieniowania elektromagnetycznego podczerwonego odbitej od oświetlanego obiektu (ang. Time Of Flight camera). Dzięki zastosowaniu odpowiednich metod cyfrowego przetwarzania obrazu pozyskanego z kamery tego typu możliwe jest zlokalizowanie użytkownika w przestrzeni 3D. Znajomość dokładnej lokalizacji głowy może posłużyć tworzeniu nowych interfejsów komunikacji między człowiekiem a maszyną lub tworzeniu aplikacji komputerowych nowego typu.

Entry No. 465

Entry type journal paper

Authors K. Lisowski, A. Czyżewski

English title Methods For Object Tracking In Distributed Video Surveillance Systems

Polish title Metody Śledzenia Obiektów W Rozproszonych Systemach Monitoringu Wideo

Journal Zeszyty naukowe WE PG

Volume

Number 36

Pages 216 - 222

Abstract Streszczenie: Video surveillance systems have become a common part of both the public spaces and places with limited access. Monitoring a large surface area requires arrangement of multiple cameras. Effective analysis of the large number of video images by human is virtually impossible. Therefore, methods for automatic video processing aimed at contextual analysis are developed. In the case of non-overlapping fields of view re-identification of objects in different cameras is important. This paper focuses on a review of methods for tracking objects between cameras. Eventually automatic analysis is applied in video surveillance systems in order to facilitate the observation of situation in a large area by indicating the video streams associated with certain significant events.

Streszczenie Systemy monitoringu wideo stały się powszechną częścią zarówno przestrzeni publicznej jak również miejsc o ograniczonym dostępie. Nadzór obszaru o dużej powierzchni wymaga rozmieszczenia wielu kamer. Skuteczna analiza przez człowieka dużej liczby obrazów wideo jest praktycznie niemożliwa. Dlatego rozwijane są metody służące do automatycznego przetwarzania wideo ukierunkowanego na analizę kontekstową. W przypadku niepokrywających się pól widzenia kamer znaczenia nabiera również reidentyfikacja obiektów w różnych kamerach. Ten referat koncentruje się na przeglądzie metod śledzenia obiektów pomiędzy kamerami. Docelowo automatyczna analiza ma ułatwić śledzenie sytuacji na dużym obszarze poprzez wskazanie strumieni wideo skojarzonych z pewnymi istotnymi zdarzeniami.

Entry No. 466

Entry type conference paper

Authors A. Czyżewski

English title Application of computer multimedia interfaces to diagnosis and therapy of vegetative state patients

Polish title Zastosowanie komputerowych interfejsów dla potrzeb diagnozy i terapii pacjentów w stanie wegetatywnym

Conference III Konferencja "Jest życie w śpiączce"

Preprint

Number

Volume

Pages 1 - 47

Conference site Toruń, Polska

Conference date 12.9.2013- 13.9.2013

Notes prezentacja Power Point i program konferencji

Streszczenie W dniach 12-13 września 2013 w Toruniu odbyła się III Międzynarodowa Konferencja “Jest życie w śpiączce” zorganizowana przez Fundację “Światło”, współpracującą z Katedrą Systemów Multimedialnych. Głównym przesłaniem konferencji było zwrócenie uwagi na świadomość pacjentów apalicznych (w tzw. stanie wegetatywnym) oraz możliwość i konieczność dotarcia do nich także za pomocą specjalistycznych urządzeń do komunikacji pozawerbalnej. Podczas uroczystego bankietu prof. Andrzej Czyżewski otrzymał statuetkę Przyjaciela Fundacji “Światło” oraz honorowe członkostwo Stowarzyszenia Apalicznych i Wybudzonych “Motyl”.

Entry No. 467

Entry type book

Authors A. Czyżewski, H. Krawczyk

English title KASKADA Platform and Multimedia Applications

Polish title Platforma KASKADA i aplikacje multimedialne

Editor KASKBOOK Politechnika Gdańska

Pages 1 - 219

Notes redakcja wydawnictwa zbiorowego

Abstract Zadania wykonane w ramach projektu MAYDAY EURO 2012 w temacie można podzielić na dwie główne kategorie: prace implementacyjne, obejmujące zarówno implementację rozwiązań ekstrakcji cech twarzy jak i systemów rozpoznawania i typowania osób na platformach WINDOWS i KASKADA, prace badawcze, obejmujące z kolei badania związane ze skutecznością typowania i rozpozna-wania osób oraz możliwości zrównoleglania opracowanych rozwiązań na platformie KASKADA. Wykonane w ramach każdej z kategorii zadania opisane są w kolejnych rozdziałach pracy zbiorowej, która ukazała się w wydawnictwie książkowym nakładem Katedry Architektury Systemów Komputerowych WETI PG.

Entry No. 468

Entry type conference paper

Authors A. Czyżewski

English title New applications of multimedia technology to the enhancement of learning, therapy and rehabilitation

Polish title Nowe zastosowania technologii multimedialnych do usprawniania edukacji, terapii i rehabilitacji

Conference Net Vision 13 Ogólnopolska konferencja biznesu i nowych technologii

Preprint

Number

Volume

Pages 1 - 47

Conference site Gdańsk, Polska

Conference date 18.4.2013- 21.4.2013

Notes prezentacja Power Point i program konferencji

Streszczenie W wystąpieniu na konferencji zorganizowanej w Politechnice Gdańskiej w ramach XIII edycji spotkań NetVision zaprezentowano opracowane w Katedrze Systemów Multimedialnych technologie w aspekcie zarządzania projektami IT i ich potencjalnego wpływu na technologie przyszłości.

Entry No. 469

Entry type conference paper

Authors A. Czyżewski

English title New Applications of Multimodal Human-Computer Interfaces

Polish title Nowe zastosowania komputerowych interfejsów człowiek-maszyna

Conference Expanding innovations by joining strengths Japanese-Polish Science and Technology Seminar

Preprint

Number

Volume

Pages 1 - 54

Conference site Tokyo, Japan

Conference date 16.10.2013

Notes prezentacja Power Point, wydruk programu i abstarkt

Abstract Developed multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named “Cyber Eye” is presented including the method of analysis of visual activity of patients remaining in vegetative state helping to assessment of their awareness. The scent emitting multimodal computer interface is also discussed. A new approach to diagnosing Parkinson’s disease is shown which is used to evaluate motor and behavioral symptoms of the neurodegenerative disease. The paper is concluded with some additional demonstrations of technologies developed for applications to intelligent surveillance systems and for enhancement of degraded audio recordings.

Entry No. 470

Entry type conference paper

Authors A. Czyżewski, B. Kostek, T. Ciszewski, D. Majewicz

English title Language material for English audiovisual speech recognition system developmen

Polish title Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

Conference The Journal of the Acoustical Society of America vol. 134/5, p. 4069 (abstr.) plus Proceedings of Meetings on Acoustics

Preprint

Number 1

Volume 20

Pages 1 - 7

Conference site San Francisco, USA

Conference date 2.12.2013- 6.12.2013

Notes online: http://scitation.aip.org/content/asa/journal/poma/20/1/10.1121/1.4864363

Abstract The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the lexical word frequencies and casual speech sound frequencies in the BNC corpus of approx. 100m words. The semantically and syntactically congruent sentences mirror the 100m-word corpus frequencies. The absolute deviation from source sound frequencies is .09% and individual vowel deviation is reduced to a level between .0006% (min.) and .009% (max.). The absolute consonant deviation is .006% and oscillates between .00002% (min.) and .012% (max.). Similar convergence is achieved in the language sample for testing algorithms (29 sentences; 599 sounds). The post-recording analysis involves the examination of particular articulatory settings which aid visual recognition as well as co-articulatory processes which may affect the acoustic characteristics of individual sounds. Results of bi-modal speech elements recognition employing the language material are included in the paper.

Streszczenie System rozpoznawania mowy bimodalny wymaga bimodalnych próbek do trenowania i do testowania algorytmów w celu rozpoznawania naturalnej mowy w języku angielskim. Do celów nagrań audiowizualnych , bazy danych, treningu utworzono słownik 264 zdań (1730 słów bez powtórzeń ; 5685 dźwięków). Słownik odzwierciedla frekwentację spółgłosek i samogłosek w mowie potocznej. Zarejestrowany materiał odzwierciedla zarówno leksykalne frekwentacje haseł, jak i frekwentacje dźwięków mowy w korpusie BNC obejmującym ok 100 mln słów. Absolutne odchylenie od częstotliwości dźwięku źródłowych jest na poziomie 0,09 % a indywidualne odchylenie frekwentacji samogłosek jest zmniejszone do poziomu pomiędzy 0,0006 % (minimum ) i 0,009 % (max.) . Absolutne odchylenie spółgłosek jest 0,006 % i waha się od 0,00002 % (minimum ) do 0,012 % (max.). Podobną zbieżność uzyskuje się dla próbki testów językowych dla algorytmów ( 29 zdań; 599 dźwięków). Wyniki bi-modalnego rozpoznawania elementów mowy wykorzystującego opracowany materiał językowy są zawarte w referacie.

Entry No. 471

Entry type conference paper

Authors A. Czyżewski, J. Cichowki, A. Kuryjanow, B. Kostek

English title Online sound restoration system for digital library applications

Polish title Internetowy system rekonstrukcji dźwięku do zastosowań w bibliotekach cyfrowych

Conference 116th Meeting of Acoustical Society of America

Preprint

Number

Volume

Pages 1 - 17

Conference site San Francisco, USA

Conference date 2.12.2013- 6.12.2013

Notes The Journal of the Acoustical Society of America vol. 134/5, p. 3999 (abstr.) plus Proceedings of Meetings of Acoustics vol. 20 http://scitation.aip.org/content/asa/journal/poma/20/1/10.1121/1.4863268

Abstract Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jann- sen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion prediction and compensations algorithms are computationally complex, an implementation which uses parallel computing has been proposed. Many archival recordings are at the same time clipped and affected by wideband noise. To restore those recordings, the algorithm based on the concatenation of signal clipping reduction and spectral expansion was proposed. The clip- ping reduction algorithm uses an intelligent interpolation to replace dis- torted samples with the predicted ones based on learning algorithms. Next, spectral expansion is performed in order to reduce the overall level of noise. The online service has been extended with some copyright protection mechanisms. Immunity of watermarks to the sound restoration is discussed with regards to low-level music feature vectors embedded as watermarks. Then, algorithmic issues pertaining watermarking techniques are briefly recalled. The architecture of the designed system is presented.

Entry No. 472

Entry type book

Authors A. Dziech, A. Czyżewski

English title Multimedia Communications, Services and Security

Polish title Multimedialna komunikacja, usługi i technologie bezpieczeństwa

Editor Springer

Pages 1 - 323

Notes red. pokonferencyjnego wydawnictwa książkowego

Abstract The edited book contains 27 papers devoted to applications of most modern multimedia solutions applicable to communication, various kind of services, and especially to security & safety.

Entry No. 473

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Implementacja metod analizy obrysu dłoni oraz analiza łączenia wyników wielu metod

Report Number SAMS17/11/13

Streszczenie W założeniach systemu SGRS, aplikacja używająca tego systemu powinna mieć dostęp do lokalizacji wszystkich palców ręki. Dotychczas zaimplementowane metody skupiały się głównie na bezpośrednim określaniu położenia opuszków dłoni. Przykładem takiej metody może być metoda dopasowania do wzorca kołowego opisana przez P. Trellę w raporcie [1]. Metody geometrycznej analizy obrysu dłoni uzyskanego w wyniku segmentacji tła są metodą bardziej ogólną, pozwalającą na uzyskanie większej ilości parametrów wykrytych obiektów, ale z kolei powodującą potencjalnie większy koszt obliczeniowy. Istniejące algorytmy zostały zaprojektowane przez P. Bratoszewskiego we wcześniejszych pracach związanych z kamerą ToF.

Entry No. 474

Entry type conference paper

Authors A. Czyżewski, A. Ciarkowski, B. Kostek, J. Cichowski

English title Online sound restoration system for digital library applications

Polish title

Conference Proceedings of Meetings on Acoustics (Acoustical Society of America) POMA

Preprint 055004

Number 20

Volume

Pages 1 - 17

Conference site San Francisco, USA

Conference date 2.12.2013- 6.12.2013

Notes http://scitation.aip.org/content/asa/journal/poma/20/1/10.1121/1.4863268

Abstract Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion prediction and compensations algorithms are computationally complex, an implementation which uses parallel computing has been proposed. Many archival recordings are at the same time clipped and affected by wideband noise. To restore those recordings, the algorithm based on the concatenation of signal clipping reduction and spectral expansion was proposed. The clipping reduction algorithm uses an intelligent interpolation to replace distorted samples with the predicted ones based on learning algorithms. Next, spectral expansion is performed in order to reduce the overall level of noise. The online service has been extended with some copyright protection mechanisms. Immunity of watermarks to the sound restoration is discussed with regards to low-level music feature vectors embedded as watermarks. Then, algorithmic issues pertaining watermarking techniques are briefly recalled. The architecture of the designed system is presented.

Entry No. 475

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Dostosowanie algorytmów detekcji i śledzenia opuszków palców do współpracy z kamerą ToF w systemie SGRSwF

Report Number SAMS18/11/13

Streszczenie W specyfikacji wymagań systemu SGRS wyszczególnionych jest kilka typów kamer, z których może korzystać system. Jednym z typów jest kamera ToF (ang. Time of Flight). Zadanie polegało na podmianie, z istniejącego już łańcucha przetwarzania, kamery źródłowej z RGB na kamerę ToF, oraz napisaniu prostych algorytmów pośredniczących – dostosowujących postać obrazu z kamery ToF do tego istniejącego łańcucha przetwarzania.

Entry No. 476

Entry type journal paper

Authors A. Czyżewski, J. Cichowski, A. Kupryjanow, B. Kostek

English title Online sound restoration system for digital library applications

Polish title

Journal J. Acoust. Soc. Amer.

Volume 134

Number 5

Pages

Notes http://scitation.aip.org/content/asa/journal/jasa/134/5/10.1121/1.4830591

Abstract Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion prediction and compensations algorithms are computationally complex, an implementation which uses parallel computing has been proposed. Many archival recordings are at the same time clipped and affected by wideband noise. To restore those recordings, the algorithm based on the concatenation of signal clipping reduction and spectral expansion was proposed. The clipping reduction algorithm uses an intelligent interpolation to replace distorted samples with the predicted ones based on learning algorithms. Next, spectral expansion is performed in order to reduce the overall level of noise. The online service has been extended with some copyright protection mechanisms. Immunity of watermarks to the sound restoration is discussed with regards to low-level music feature vectors embedded as watermarks. Then, algorithmic issues pertaining watermarking techniques are briefly recalled. The architecture of the designed system together with the employed workflow for embedding and extracting the watermark are described. The implementation phase is presented and the experimental results are reported.

Entry No. 477

Entry type

Authors A. Czyżewski, G. Szwoch

English title Method and Apparatus for Acoustic Echo Cancellation in VOIP Terminal

Polish title Metopd ai urządzneia do usuwania echa w terminalu telefonii internetowej

Notes Data udzielnia patentu: 19. 11. 2013

Abstract The subject of the invention is a method and a circuit for acoustic echo cancellation in VoIP terminal. The solution is intended for various types of client terminals of Internet voice communication systems, especially when the client uses a loudspeaker instead of a headset and it is based on the application of watermarking technology.

Entry No. 478

Entry type

Authors A. Czyżewski, G. Budzyński

English title A way and device for warning about approaching train

Polish title Sposób i układ ostrzegania dla przejazdów kolejowych

Notes data zgłoszenia wynalazku 25.11.2008 r.

Streszczenie Przedmiotem wynalazku jest sposób i układ ostrzegania dla skrzyżowań szynowych. Znajduje on zastosowanie jako samoczynnie działający system ostrzegający przed kolizjami na przejazdach przez tory pojazdów szynowych, w szczególności tory kolejowe i tramwajowe, tj. wszelkiego rodzaju skrzyżowaniach dróg szynowych z drogami ruchu kołowego i pieszego.

Entry No. 479

Entry type conference paper

Authors J. Kotus, P. Dalka, M. Szczodrak, G. Szwoch, P. Szczuko, A. Czyżewski

English title Multimodal Surveillance Based Personal Protection System

Polish title

Conference Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) 2013

Preprint

Number

Volume

Pages 100 - 105

Conference site Poznań, Polska

Conference date 26.9.2013- 28.9.2013

Notes Referat dostępny w IEEE Xplore: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6710605

Abstract A novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed. The solution is based on combining four modalities (signals coming from: Blue-tooth, fixed and PTZ cameras, thermal camera, acoustic sensors). The Bluetooth signal is used continuously to monitor the protected person presence, and in case of abduction attempt it reports an alert accompanied with GPS coordinates. The video monitoring algorithm analyses streams from cameras closest to the event coordinates, examines the direction of objects’ movement and detects situations such as invaliding cars and road blocking. Thermal camera images are used for the detection of explosions and tracing cars in difficult lighting conditions. The audio monitoring subsystem uses acoustic sensors for the detection and localization of important sounds, such as shouts and gunshots. As a result, the combined modalities allow for the detection of important security threats, i.e. a person abduction in some studied case scenarios.

Streszczenie Referat przedstawia nowatorskie, multimedialne podejście do automatycznego wykrywania zdarzenia porwania osoby, z wykorzystaniem dedykowanego urządzenia ochrony osobistej oraz systemu monitoringu miejskiego. Proponowane rozwiązanie łączy cztery modalności: sygnały z urządzenia Bluetooth, obrazy z kamer stacjonarnych, obrazy z kamer termowizyjnych, sygnały z sensorów akustycznych. Sygnał z urządzenia bluetooth jest nieustannie monitorowany w celu określenia położenia osoby, w przypadku zaniku sygnału wskutek porwania osoby wysyłany jest alarm zawierający położenie GPS urządzenia. Algorytmy analizy obrazu dokonują przetwarzania sygnałow z kamer położonych najbliżej miejsca zdarzenia, wykrywają kierunek ruchu obiektów oraz zdarzenia takie jak zablokowanie ruchu pojazdu. Obrazy z kamer termowizyjnych są wykorzystywane do wykrywania eksplozji i do śledzenia ruchu obiektów w warunkach zmniejszonej widoczności. Dane z sensorów akustycznych są używane do wykrywania i lokalizacji istotnych zdarzeń dźwiękowych, takich jak wystrzał z broni palnej czy krzyki. Połączenie opisanych modalności pozwala na wykrywanie istotnych zagrożeń bezpieczeństwa, takich jak porwanie osoby w opisywanym przykładowym scenariuszu.

Entry No. 480

Entry type journal paper

Authors A. Czyżewski, K. Lisowski, P. Bratoszewski, P. Hoffmann

English title

Polish title Europejski projekt ADDPRIV Automatyczna interpretacja danych pozyskiwanych z obrazu dla potrzeb systemów monitoringu wizyjnego funkcjonujących z poszanowaniem prywatności osób

Journal Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages

Bibliographic No. 9

Notes artykuł na płycie CD

Streszczenie Systemy monitorowania bezpieczeństwa publicznego generują i przechowują ogromne ilości danych implikując wzrost prawdopodobieństwa użycia tych danych w sposób nieodpowiedni z punktu widzenia ochrony danych osobowych. W niniejszym referacie zaprezentowany jest europejski projekt ADDPRIV, który bezpośrednio odnosi się do kwestii poszanowania prywatności poprzez automatyczne rozpoznawanie istotności danych pochodzących z rozproszonego systemu kamerowego nadzoru bezpieczeństwa. Przedstawiono założenia projektu i wyniki dotychczasowych prac badawczo eksperymentalnych zrealizowanych w Politechnice Gdańskiej, która jest uczestnikiem tego projektu europejskiego.

Entry No. 481

Entry type conference paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek

English title

Polish title Usługi przygotowane w ramach gridu dziedzinowego AKUSTYKA D.1

Conference Spotkanie Techniczne Projektu PLGrid Plus

Preprint

Number

Volume

Pages

Conference site Wisła, Polska

Conference date 23.10.2013- 25.10.2013

Notes Prezentacja multimedialna

Entry No. 482

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Implementacja algorytmów śledzenia punktów lokalizujących opuszki palców w obrazie z kamery RGB, oraz przygotowanie aplikacji demonstracyjnej

Report Number SAMS15/11/13

Streszczenie Celem prac realizowanych w ostatnim czasie przez cały zespół projektowy było przygotowanie pierwszej kompletnej aplikacji demonstracyjnej systemu, pokazującej kierunek prowadzonych badań. Do jej stworzenia potrzebne były jeszcze dwa ostatnie ogniwa łańcucha przetwarzania obrazu. Jako pierwszy z nich - algorytm śledzenia punktów na podstawie sekwencji przetworzonych (progowanie oraz detekcja opuszków) obrazów z kamery wideo. Jako drugi – algorytm rozpoznawania gestów na podstawie relacji czasowych i przestrzennych śledzonych punktów.

Entry No. 483

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Integracja filtrów i aplikacji z nową wersją platformy SGRS oraz analiza sposobów wykorzystania kaskady Haara w systemie rozpoznawania gestów

Report Number SAMS13/10/13

Streszczenie Wraz ze zmianą architektury systemu Spatial Gesture Recognition System zmieniły się zarówno wzorce projektowania aplikacji jak i filtrów w systemie. Efekty prac dotychczas poczynionych w ramach projektu stały się wraz z aktualizacją silnika niezgodne z architekturą. W ramach pracy wszystkie dotychczas zaimplementowane metody przetwarzania zostały zintegrowane z nowym środowiskiem SGRS. Raport ten ma na celu opisać strukturę przykładowego filtra oraz aplikacji zgodnych z nową strukturą SGRS.

Entry No. 484

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Poszerzenie zakresu działania algorytmów do założeń projektowych

Report Number SAMS14/10/13

Streszczenie Głównym celem realizowanych prac było znajdowanie opuszków palców w obrazie z kamery RGB. W założeniu algorytmy systemu SGREwF mają poprawnie działać przy zmiennym oświetleniu, ruchomym tle i ruchomej kamerze. Taki scenariusz pracy nie jest możliwy, gdy wstępne przetwarzanie obrazu wizyjnego jest oparte o algorytmy odejmowania tła [2] (codebook, GMM). Jedną z metod uniezależnienia się od tych warunków jest oparcie przetwarzania o segmentację obrazu względem koloru skóry. Można to zrealizować wykorzystując progowanie w przestrzeni barw HSV (Hue – barwa, Saturation – nasycenie, Value – wartość/jasność). Kolejnym zadaniem było zaimplementowanie nowego algorytmu detekcji opuszków palców w sprogowanym obrazie wizyjnym.

Entry No. 485

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Dostosowanie metod przetwarzania obrazu do architektury filtrów w systemie rozpoznawania gestów na urządzenia mobilne

Report Number SAMS06/04/13

Streszczenie Architektura systemu Spatial Gesture Recognition System zakłada stworzenie systemu niskopoziomowego rozpoznawania gestów działającego na różnych urządzeniach i różnych systemach. Produkt powinien realizować uchwyty funkcji wykrywania gestów dla programistów w warstwie aplikacji. Projekt, którego niniejszy raport dotyczy miał na celu wypełnienie odpowiedniej warstwy systemu SGRS metodami wstępnego przetwarzania obrazu przygotowanie systemu do możliwości wywoływania algorytmów wchodzących w skład biblioteki OpenCV. Kolejnym celem była implementacja metody modelowania tła Codebook w architekturze SGRE PC. Dokonano również analizy jakości przetwarzania pod kątem pracy w warunkach założonych w systemie.

Entry No. 486

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Implementacja metody modelowania tła codebook w systemie SGRSwF

Report Number SAMS07/04/13

Streszczenie Przygotowanie systemu SGRSwF do możliwości wywoływania algorytmów wchodzących w skład biblioteki OpenCV. Implementacja metody modelowania tła Codebook a architekturze SGRE PC. Analiza jakościowa przetwarzania w warunkach pracy założonych w systemie.

Entry No. 487

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Implementacja metody znajdowania palców w obrazie z kamery RGB przy wykorzystaniu dopasowywania do wzorców w systemie SGRSwF

Report Number SAMS08/05/13

Streszczenie Celem zrealizowanej pracy była implementacja algorytmu odnajdującego opuszki palców w obrazie z kamery RGB, wykorzystującej dopasowywanie fragmentów obrazu do statycznego wzorca. Do przetwarzania wstępnego posłużono się algorytmem odejmowania tła „codebook”, będącego tematem poprzedniego raportu.

Entry No. 488

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Testy jakości rozpoznawania gestów klasyfikatora Viola Jones względem zbiorów trenujących oraz parametrów przetwarzania

Report Number SAMS09/06/13

Streszczenie W środowisku Spatial Gesture Recognition System został zaimplementowany kaskadowy klasyfikator Viola Jones. Została również zbadana jego wrażliwość na zmiany parametrów wywoływania. Dodatkowo, został zaimplementowany skrypt oraz program pomocniczy C++ w celu wysoko sparametryzowanego treningu nowych klasyfikatorów.

Entry No. 489

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Lokalizacja i naprawa błędów uniemożliwiających prace rozwojowe oraz dalsze rozwijanie architektury systemu SGRS

Report Number SAMS10/07/13

Streszczenie W bieżącym miesiącu miały być implementowane kolejne algorytmy detekcji obiektów (opuszków palców) w obrazie z kamery RGB, a następnie algorytmy śledzenia tych obiektów. Nie było to jednak możliwe ze względu na pewne ograniczenia struktury systemu SGRS, które najpierw musiały zostać wyeliminowane.

Entry No. 490

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Implementacja wykrywania opuszków palców przy pomocy klasyfikatora kaskadowego oraz badanie wydajności przetwarzania

Report Number SAMS11/10/13

Streszczenie Silnik Spatial Gesture Recognition System współtworzony w ramach projektu na potrzeby dokładnego śledzenia gestów dłoni potrzebuje wydajnego a zarazem dokładnego algorytmu lokalizującego opuszki palców w strumieniu wizyjnym. Prace poczynione w tym kierunku dotychczas były bazowane na metodzie dopasowania do wzorca. Niniejsza praca ma na celu optymalizację wykrywania opuszków palców w obrazie oraz uproszczenie schematu detekcji.

Entry No. 491

Entry type report

Authors A. Czyżewski, J. Cichowski, W. Moskwa, B. Hermanowicz

English title H/W Low Level Design - Generator of Ultrasonic Haptic Feedback

Polish title

Report Number SAMS12/08/13

Abstract This document summarizes the research carried out since 2013-04-01 till 2013-08-31. It is deliverable of the task 4. The main objective of the mentioned task was the design of the device to produce a concentrated beam of ultrasound. The following sections present the electrical diagrams for device’s subcomponents. The list and functionality of the specific parts is briefly presented in the Section entitled functional block diagram.

Entry No. 492

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski

English title Video Analytics-Based Algorithm for Monitoring Egress from Buildings

Polish title

Conference 6th International Conference: Multimedia Communications, Services and Security (MCSS)

Preprint

Number

Volume

Pages 224 - 232

Conference site Kraków, Polska

Conference date 6.6.2013- 7.6.2013

Notes Communications in Computer and Information Science 368, A. Dziech, A. Czyżewski (Eds), Springer 2013

Abstract A concept and practical implementation of the algorithm for detecting of potentially dangerous situations of crowding in passages is presented. An example of such situation is a crush which may be caused by obstructed pedestrian pathway. Surveillance video camera signal analysis performed on line is employed in order to detect hold-ups near bottlenecks like doorways or staircases. The details of implemented algorithm which uses optical flow method combined with fuzzy logic are explained. The implementation details are introduced with focus on the computing platform and parallel processing. The experiments were carried out on the set of gathered video recordings from the surveillance camera installed in the campus of Gdansk University of Technology. The results of experiments performed on gathered video recordings show that efficiency of the algorithm is high.

Entry No. 493

Entry type

Authors K. Łopatka, A. Czyżewski

English title Method and apparatus for speech intelligibility enhancement in multichannel multimedia signal

Polish title Sposób poprawy zrozumiałości mowy w wielokanałowym sygnale multimedialnym i układ do realizacji sposobu

Notes to jest na razie zgłoszenie patentowe

Entry No. 494

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Adaptacja metod przetwarzania do działania w środowisku Android oraz integracja metody rozpoznawania gestu dynamicznego z platformą SGRS

Report Number SAMS16/11/13

Streszczenie Ze względu na ustalony termin demonstracji działania algorytmów rozpoznawania opuszków palców na urządzeniu opartym o system Android, przedsięwzięte zostały prace związane z przygotowaniem aplikacji. Implementacja filtru Kalmana oraz metody wykrywania gestu „kliknięcia” przygotowana przez P.Bratoszewskiego została załączona do biblioteki metod przetwarzania Spatial Gesture Recognition Engine w celu stworzenia aplikacji demonstrującej jakość przetwarzania w systemie używającym jednej kamery RGB. Z zaimplementowanych wcześniej metod wybrany został odpowiedni do warunków łańcuch filtrów, które następnie zostały przeniesione i uruchomione na platformie Android.

Entry No. 495

Entry type conference paper

Authors A. Czyżewski, Ł. Kosikowski, A. Kupryjanow, P. Odya

English title Auditory and visual attention stimulator

Polish title Stymulator uwagi słuchowej i wzrokowej

Conference Innowacje Technologie Maszyny

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 4.6.2013- 7.6.2013

Notes Plakat

Abstract The presented method is the use of visual and auditory stimulation, in such a way that the text displayed on the computer screen is modified synchronously with voice. During training, the eye-tracking system analyzes the visual fixation point on the computer screen.

Streszczenie Przedstawiona metoda polega na zastosowaniu stymulacji wzrokowej i słuchowej, w taki sposób, że tekst wyświetlany na ekranie monitora komputerowego jest modyfikowany synchronicznie z odtwarzanym w słuchawkach głosem lektora. W trakcie treningu, system śledzenia punktu fiksacji wzroku analizuje miejsce skupienia wzroku na ekranie. Na tej podstawie możliwe jest dostosowanie tempa mowy lektora do tempa czytania tekstu przez użytkownika.

Entry No. 496

Entry type journal paper

Authors A. Czyżewski, K. Lisowski

English title Employing flowgraphs for forward route reconstruction in video surveillance system

Polish title

Journal Journ. of Intelligent Information Systems

Volume

Number

Pages 1 - 15

Abstract Pawlak’s flowgraphs were utilized as a base idea and knowledge container for prediction and decision making algorithms applied to experimental video surveillance system. The system is used for tracking people inside buildings in order to obtain information about their appearance and movement. The fields of view of the cameras did not overlap. Therefore, when an object was moving through unsupervised areas, prediction was needed to identify the same object in the adjacent camera. Moreover, while being unobserved, the object was represented by measurements of probability of appearing in the subsequent camera’s field of view. Those probability values were obtained on the basis of knowledge from the past events contained in the flowgraph. Backward Route Reconstruction methods generated paths of objects providing input for prediction of next steps in the path. This prediction was named as Forward Route Reconstruction. The output of the prediction algorithm is a tree of probabilities of future object movement. Methods for creating a flowgraph from the paths of objects and for obtaining probability values for movement prediction are presented in this paper together with some experimental results and discussion.

Entry No. 497

Entry type conference paper

Authors K. Łopatka, B. Kunka, A. Czyżewski

English title Novel 5.1 Downmix Algorithm with Improved Dialogue Intelligibility

Polish title Nowy algorytm downmiksu 5.1 z poprawioną zrozumiałością ścieżki dialogowej w filmie.

Conference 134th AES Convention

Preprint Paper Number: 8831

Number

Volume

Pages

Conference site Roma, Italy

Conference date 4.5.2013- 7.5.2013

Bibliographic No. 11

Notes Open Access: http://www.aes.org/e-lib/browse.cfm?elib=16732

Abstract A new algorithm for 5.1 to stereo downmix is introduced, which addresses the problem of dialogue intelligibility. The algorithm utilizes proposed signal processing algorithms to enhance the intelligibility of movie dialogues, especially in difficult listening conditions or in compromised speaker setup. To account for the latter, a playback configuration utilizing a portable device, i.e. an ultrabook, is examined. The experiments are presented which confirm the efficiency of the introduced method. Both objective measurements and subjective listening tests were conducted. The new downmix algorithm is compared to the output of a standard downmix matrix method. The results of subjective tests prove that an improved dialogue intelligibility is achieved.

Streszczenie Zaproponowano algorytm zgrania wielokanałowej ścieżki dźwiękowej filmu w formacie 5.1 do stereofonii dwukanałowej (tzw. downmix) z uwzględnieniem zrozumiałości dialogów. Algorytm wykorzystuje opracowane algorytmy przetwarzania sygnałów w celu zwiększenia zrozumiałości dialogów filmowych, w szczególności w trudnych warunkach odsłuchu, zwłaszcza przy użyciu głośników o niskiej jakości. W celu uwzględnienia tych czynników wykorzystano testową konfigurację z użyciem przenośnego komputera typu ultrabook. Przedstawiono eksperymenty potwierdzające skuteczność opracowanej metody. Zostały przeprowadzone zarówno badania obiektywne, jak i subiektywne testy z udziałem grupy słuchaczy. Opracowany algorytm porównano ze standardowym macierzowym algorytmem downmix. Wyniki testów wskazują na zwiększoną zrozumiałość dialogów filmowych.

Entry No. 498

Entry type conference paper

Authors A. Kupryjanow, L. Kosikowski, P. Odya, A. Czyzewski

English title Auditory-visual attention stimulator

Polish title Stymulator uwagi słuchowo-wzrokowej

Conference Audio Engineering Society

Preprint 8810

Number

Volume

Pages 1 - 7

Conference site Rzym, Włochy

Conference date 4.5.2013- 7.5.2013

Bibliographic No. 13

Abstract New approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of the paper, results obtained for the reading comprehension training were presented. It was shown that usage of the proposed method could improve these skills in the group of children in the age between 7 and 8 years.

Entry No. 499

Entry type journal paper

Authors A. Kupryjanow, A. Czyzewski

English title Real-time speech signal segmentation methods

Polish title

Journal J. Audio Eng. Soc.

Volume 61

Number 7/8

Pages 1 - 14

Bibliographic No. 54

Notes w druku

Abstract We present two algorithms developed for the purpose of real-time speech signal analysis. The first algorithm’s function is for the vowel region detection and the second is the rate of speech estimator. Both algorithms have many applications in the field of speech processing, e.g. automatic speech recognition, automatic language identification, automatic emotion recognition, etc. Evaluation of the proposed algorithms assessed their accuracy, reliability and real-time operating capabilities and our experimental results show that the proposed algorithms perform equally well or better than the commonly known offline approaches.

Entry No. 500

Entry type journal paper

Authors M. Szczodrak, J. Kotus, A. Czyżewski, B. Kostek

English title The application of a noise mapping tool deployed in grid infrastructure for creating noise maps of urban areas

Polish title

Journal Computer Science

Volume 14

Number 2

Pages 231 - 242

Abstract The concept and implementation of the system for creating dynamic noise maps in PL-Grid infrastructure are presented. The methodology of dynamic acoustical maps creating is introduced. The concept of noise mapping, based on noise source and propagation models, was developed and employed in the system. The details of incorporation of the system to the PL-Grid infrastructure are presented. The results of simulations performed by the system prototype are depicted. The results in the form of noise maps obtained by a system are compared with some other solutions in order to investigate accuracy.

Entry No. 501

Entry type conference paper

Authors J. Cichowski, A. Kupryjanow, A Czyzewski

English title Further Developments of the Online Sound Restoration System for Digital Library Applications

Polish title

Conference Warsztaty SYNAT 2013

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 1.7.2013- 2.7.2013

Bibliographic No. 21

Notes w druku

Abstract New signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net.Missing or distorted audio samples are predicted using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the prediction algorithm is computationally complex, an implementation which uses parallel computing has been proposed. Many archival and homemade recordings are at the same time clipped and contain wideband noise. To restore those recordings, the algorithm based on the concatenation of signal clipping reduction and spectral expansion was proposed. The clipping reduction algorithm uses interpolation to replace distorted samples with the predicted ones. Next, spectral expansion is performed in order to reduce the overall level of noise. The online service has been extended also with some copyright protection mechanisms. Certain issues related to the audio copyright problem are discussed with regards to low-level music feature vectors embedded as watermarks. Then, algorithmic issues pertaining watermarking techniques are briefly recalled. The architecture of the designed system along with the employed workflow for embedding and extracting the watermark are described. The implementation phase is presented and the experimental results are reported. The paper is concluded with a presentation of experimental results of application of described algorithmic extensions to the online sound restoration service.

Entry No. 502

Entry type journal paper

Authors G. Szwoch, P. Dalka, A. Czyżewski

English title Spatial Calibration of a Dual PTZ-Fixed Camera System for Tracking Moving Objects in Video

Polish title Kalibracja podwójnego systemu kamer dla potrzeb śledzenia obiektów ruchomych w obrazie

Journal Journal of Imaging Science and Technology (JIST)

Volume 57

Number 2

Pages 1 - 10

Abstract A dual camera setup is proposed, consisting of a fixed (stationary) camera and a pan-tilt-zoom (PTZ) camera, employed in an automatic video surveillance system. The PTZ camera is zoomed in on a selected point in the fixed camera view and it may automatically track a moving object. For this purpose, two camera spatial calibration procedures are proposed. The PTZ camera is calibrated in relation to the fixed camera image, using interpolated look-up tables for pan and tilt values. For the calibration of the fixed camera, an extension of the Tsai algorithm is proposed, based only on measurements of distances between calibration points. This procedure reduces the time needed to obtain the calibration set and improves calibration accuracy. An algorithm for calculating PTZ values required for tracking of a moving object with the PTZ camera is also presented. The performance of the proposed algorithms is evaluated using the measured data.

Streszczenie W publikacji opisano podwójny system kamer składający się z kamery stacjonarnej i kamery PTZ, zastosowany w systemie monitoringu wizyjnego. Kamera PTZ jest automatycznie nastawiana na wybrany punkt w polu widzenia kamery stacjonarnej i może śledzić automatycznie obiekty ruchome. W tym celu zastosowano kalibrację systemu kamer. Kamera PTZ jest kalibrowana w odniesieniu do kadru kamery stacjonarnej przy użyciu interpolowanych tablic opisujący wartości parametrów pan i tilt. Kalibracja kamery stacjonarnej odbywa się przy pomocy zmodyfikowanej metody Tsai, w której wykorzystywane są tylko pomiary odległości pomiędzy punktami kalibracyjnymi. Proponowana procedura pozwala na skrócenie czasu potrzebnego do uzyskania danych kalibracyjnych i poprawia dokładność kalibracji. Ponadto opisano algorytm obliczania parametrów PTZ potrzebnych do śledzenia obiektów ruchomych. Dokładność algorytmu oceniono przy użyciu danych pomiarowych.

Entry No. 503

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Sposób i układ do bezkontaktowej interakcji z urządzeniami mobilnymi, zwłaszcza telefonem komórkowym

Notes ZGŁOSZENIE

Abstract Sposób i układ do bezkontaktowej interakcji z urządzeniami mobilnymi, zwłaszcza telefonem komórkowym wyposażony w ultradźwiękowy interfejs haptyczny.

Entry No. 504

Entry type

Authors A. Czyżewski

English title

Polish title Znak towarowy cyberoko

Streszczenie W 2014 r. zgłoszono do Urzędu Patentowego RP znak literowo-graficzny cyberoko

Entry No. 505

Entry type report

Authors P. Trella, A. Czyżewski

English title Scaling and rotation of the grid template for hand geometrics parametrization using fingertips position

Polish title Skalowanie i rotacja wzorca siatki binarnej do parametryzacji geometrii dłoni za pomocą lokalizacji przestrzennych opuszek palców

Report Number SAMS40/08/14

Abstract As a result of the previous work, a framework was developed for various parametrization methods to be used with SVM classifier. A template for the grid parametrization method was designed in GridCV filter. The aim of this work was to implement and train a working GridCV-parametrized SVM classifier. Next step was to invent a way of improving the accuracy of the GridCV parametrization method by altering the grid geometrically.

Entry No. 506

Entry type report

Authors P. Trella, A. Czyżewski

English title SGRSwF adaptation to work with the ToF camera stream based on camera static gesture recognition method using binary grid hand mask and the SVM classifier

Polish title

Report Number SAMS38/08/14

Abstract The aim of work described in this report was to adapt SGRE system to work with ToF camera stream. It was done by adding appropriate filters (described in earlier work [2], [3], [4]) to new processing graph in sgre_pc_ksm_demo application. To improve results of static gesture recognition another parametrization method was added.

Entry No. 507

Entry type report

Authors P. Trella, A. Czyżewski

English title Implementation of click gesture recognition method

Polish title

Report Number SAMS42/09/14

Abstract The task was to implement heuristic algorithm capable to recognize in-air click gesture and connect it as a filter into existing gesture recognition pipeline. Algorithm uses information from fingertips detection methods, described in earlier work and the work of other contributors, as well as SVM static gesture classifier to get good quality results. The algorithm is independent from used camera type (ToF. RGB), but performance is noticeable better when using ToF camera.

Entry No. 508

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Sposób i układ do bezdotykowej interakcji użytkownika z urządzeniami elektronicznymi i maszynami, zwłaszcza z telefonami i terminalami mobilnymi

Notes ZGŁOSZENIE

Abstract Sposób i układ do bezdotykowej interakcji użytkownika z urządzeniami elektronicznymi i maszynami, zwłaszcza z telefonami i terminalami mobilnymi wyposażony w ultradźwiękowy interfejs haptyczny.

Entry No. 509

Entry type report

Authors P. Klinke, A. Czyżewski

English title Implementation of fingertips counting method

Polish title Realizacja metody liczenia opuszek palców dłoni

Report Number SAMS37/07/14

Abstract The aim of this work was to develop methods capable of fingertips counting. FingersLogic filter, which was already capable of finding fingertips, is opposed to the older implementation of FingerTipsCV. The new method chosen to implement is a method of estimating a half-bounding circle and analyzing function of distance to the palm center. It was implemented as a new FingersCounter filter and later merged in ksm_pipeline with FingersLogic filter.

Entry No. 510

Entry type report

Authors P. Klinke, A. Czyżewski

English title Optimization of previously implemented methods for efficiency and robustness in different environments and creating the final documentation

Polish title Zoptymalizowanie wcześniej zaimplementowanych algorytmów pod kątem wydajności i skuteczności działania w różnych warunkach środowiskowych oraz sporządzenie dokumentacji końcowej

Report Number SAMS43/10/14

Abstract The aim of this work was to cut down on processing load caused by the existing methods by altering and simplifying the main filterchains. Another method of cutting down on complexity of the processing is adding methods that will simplify the input data by filtering the most distinctive parameters vector.

Entry No. 511

Entry type report

Authors M. Lech, P. Trella, P. Klinke, A. Czyżewski

English title Optimizing SGRSwF filters

Polish title

Report Number SAMS45/10/14

Abstract The aim of the work was to optimize filters in processing chain in sgre_pc_ksm_demo in order to increase system efficiency without degrading gesture recognition effectiveness.

Entry No. 512

Entry type report

Authors P. Trella, A. Czyżewski

English title Implementation of double click gesture recognition method

Polish title

Report Number SAMS48/11/14

Abstract The aim of work described in this report was to expand the list of recognized gestures in SGRS system on gesture similar to well-known from PC domain, double click gesture. The task was realized as extending existing in-air single click gesture detection algorithm, described in [4] with new functionality.

Entry No. 513

Entry type report

Authors P. Bratoszewski, A. Czyżewski

English title Hand detection and finger tracking algorithms using Time of Flight camera

Polish title

Report Number SAMS46/10/14

Abstract The aim of the work was to prepare the set of algorithms which enables hand (palm) detection, finger tips tracking and gesture recognition in the stream of the Time of Flight (ToF) camera. Two main applications of presented algorithms is gesture recognition and finger counting (presented in this document).

Entry No. 514

Entry type report

Authors P. Klinke, A. Czyżewski

English title Optimization of the fingertips detection method applied to detecting the pinch gesture

Polish title Zoptymalizowanie algorytmu śledzenia opuszek dwóch palców w zastosowaniu do powiększania i pomniejszania treści

Report Number SAMS47/11/14

Abstract The aim of this work was to construct an algorithm capable of detecting the pinch gesture and optimize the previously implemented fingertip detection methods to work well with that method.

Entry No. 515

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Sposób i układ do rozpoznawania gestów, zwłaszcza ruchu dłonią symulującego bezdotykowe pisanie na klawiaturze

Abstract W patencie opisano nowatorski sposób i układ do rozpoznawania gestów, zwłaszcza ruchu dłonią symulującego bezdotykowe pisanie na klawiaturze.

Entry No. 516

Entry type report

Authors M. Lech, A. Czyżewski

English title Parameters of algorithms used in sgre_pc_ksm_demo_conv application which affect efficiency and efficacy of gesture detection

Polish title

Report Number SAMS34/06/14

Abstract The aim of the work was to determine parameters of algorithms used in sgre_pc_ksm_demo_conv application which may affect both system efficiency and gesture recognition efficacy. In this context code optimization process could be viewed as finding the optimal values for the parameters.

Entry No. 517

Entry type report

Authors P. Klinke, A. Czyżewski

English title New parameters set proposition for static gesture recognition in SVM classifier

Polish title Propozycje parametrów dla rozpoznawania gestów statycznych za pomocą klasyfikatora SVM

Report Number SAMS35/06/14

Abstract The aim of this work was to present a new set of parameters for SVM classifier to work with. These parameters are an offshoot of the ConvexityFunction from FingersLogic filter.

Entry No. 518

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Sposób i układ do bezdotykowej interakcji użytkownika z urządzeniami elektronicznymi i maszynami, zwłaszcza z telefonami i terminalami mobilnymi

Notes ZGŁOSZENIE

Abstract Sposób i układ do bezdotykowej interakcji użytkownika z urządzeniami elektronicznymi i maszynami, zwłaszcza z telefonami i terminalami mobilnymi wyposażony w ultradźwiękowy interfejs haptyczny.

Entry No. 519

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Układ do pomiaru i analizy percepcji czuciowej dłoni

Notes ZGŁOSZENIE

Abstract Układ do pomiaru i analizy percepcji czuciowej dłoni wykorzystujący matrycę w kształcie rękawicy pozwalający na obiektywizację pomiarów dzięki zastosowaniu obiektywnych metod analizy m. in. EMG i EEG.

Entry No. 520

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski

English title Video analytics-based algorithm for monitoring egress from buildings

Polish title

Journal Multimedia Tools and Applications

Volume 75

Number

Pages 10733 - 10743

Notes Wydane jako "Online First, Open Access", opublikowano online" 19 czerwca 2014

Abstract A concept and a practical implementation of the algorithm for detecting of potentially dangerous situations related to crowding in passages is presented. An example of such a situation is a crush which may be caused by an obstructed pedestrian pathway. The surveillance video camera signal analysis performed in the online mode is employed in order to detect hold-ups near bottlenecks like doorways or staircases. The details of the implemented algorithm which uses the optical flow method combined with fuzzy logic are explained. The experiments were carried out on a set of gathered video recordings from the surveillance camera installed in the campus of Gdansk University of Technology. The results of experiments performed on gathered video recordings shows high efficiency of the algorithm.

Entry No. 521

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Adaptacja czasowa modelu koloru skóry w algorytmie segmentacji koloru skóry

Report Number SAMS25/03/14

Streszczenie Zadanie polegało na poszerzeniu możliwości algorytmu segmentacji dłoni przy wykorzystaniu koloru skóry (ColorSegmentation), o możliwość adaptacji czasowej modelu koloru skóry. Adaptacja ta ma w założeniu zwiększać odporność segmentacji na błędy i być niewidoczna dla użytkownika (brak fazy treningu algorytmu).

Entry No. 522

Entry type journal paper

Authors P. Dalka, D. Ellwart, G. Szwoch, K. Lisowski, P. Szczuko, A. Czyżewski

English title Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification

Polish title

Journal Studies in Computational Intelligence

Volume 584

Number

Pages 263 - 303

Notes Feature Selection for Data and Pattern Recognition, (eds) U. Stańczyk, LC. Jain

Abstract A comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification of objects in a multi-camera environment. The analysis is performed using two datasets containing images of humans and vehicles recorded with different cameras. For the purpose of descriptor evaluation, scatter and clustering measures are supplemented with a new measure that is derived from calculating direct dissimilarities between pairs of images. In order to draw conclusions from multi-dataset analysis, four aggregation measures are introduced. They are meant to find descriptors that provide the best identification effectiveness, based on the relative ranking, and simultaneously are characterized with large stability (invariance to the selection of objects in the dataset). Proposed descriptors are evaluated practically with object re-identification experiments involving four classifiers to detect the same object after its transition between cameras’ fields of view. The achieved results are discussed in detail and illustrated with figures.

Entry No. 523

Entry type report

Authors P. Trella, A. Czyżewski

English title The use of hand depth ToF camera based measurement to control thresholding level, and to control the size of the template in the Min Max Circles method

Polish title

Report Number SAMS28/03/14

Abstract The task was to extend processing graph to dynamically adapt algorithms accordingly to current z-positon of hand. Approach described in this report, should allow the user to move hand backwards and towards camera without losing detection and tracking efficiency.

Entry No. 524

Entry type report

Authors P. Trella, A. Czyżewski

English title Implementation of algorithms to recognize dynamic gestures based on spatial location of fingertips

Polish title

Report Number SAMS36/06/14

Abstract The task was an implementation of algorithms to recognize dynamic gestures on spatial location of fingertips and more specifically a virtual mouse click gesture. Gesture recognition pipeline, described in this report, was attached to existing gesture recognition processing graph. The greater part of the time spent on this task has been devoted to the optimization and cleanup official demo application source code, what will be necessary in developing interactive demo application in near future.

Entry No. 525

Entry type conference paper

Authors K. Lisowski, A. Czyżewski

English title Modelling Object Behaviour in a Video Surveillance System Using Pawlak’s Flowgraph

Polish title

Conference Multimedia Communications, Services and Security

Preprint

Number

Volume 429

Pages 122 - 136

Conference site Kraków, Polska

Conference date 11.6.2014- 12.6.2014

Abstract In this paper, methodology of acquisition and processing of video streams for the purpose of modelling object behaviour is presented. Multilevel contextual video processing was also mentioned. The Pawlak’s flowgraph is used as a container for the knowledge related to the behaviour of objects in the area supervised by a video surveillance system. Spatio-temporal dependencies in transitions between cameras can be easily changed in real-life situations. In order to cope with such fluctuating conditions, an adaptive algorithm is implemented. Consequently, as it was shown the flowgraph reacts faster to the occurring changes.

Entry No. 526

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Matryca przetworników ultradźwiękowych do bezkontaktowej interakcji użytkownika z urządzeniami mobilnymi

Notes ZGŁOSZENIE - WZÓR UŻYTKOWY

Abstract Matryca przetworników ultradźwiękowych do bezkontaktowej interakcji użytkownika z urządzeniami mobilnymi pozwalająca na bezdotykową interakcję użytkownika z maszynami.

Entry No. 527

Entry type report

Authors M. Lech, A. Czyżewski

English title

Polish title Klasyfikatory SVM i OPF w środowisku SGRSwF

Report Number SAMS19/01/14

Streszczenie Celem wykonanej pracy była implementacja dwóch klasyfikatorów zdolnych do rozpoznawania kształtów dłoni w systemie SGRSwF z dużą skutecznością. Na podstawie doniesień literaturowych do implementacji wybrano klasyfikatory SVM (ang. Support Vector Machine) i OPF (ang. Optimum-Path Forest).

Entry No. 528

Entry type conference paper

Authors M. Lech, P. Dalka, G. Szwoch, A. Czyżewski

English title Examining Quality of Hand Segmentation Based on Gaussian Mixture Models

Polish title Badanie jakości segmentacji dłoni wykonywanej w oparciu o modele sum gaussowskich

Conference Multimedia Communications, Services and Security

Preprint 978-3-319-07568-6

Number

Volume 429

Pages 111 - 121

Conference site Kraków, Polska

Conference date 11.6.2014- 12.6.2014

Bibliographic No. 14

Notes w serii: Communications in Computer and Information Science

Abstract Results of examination of various implementations of Gaussian mixture models are presented in the paper. Two of the implementations belonged to the Intel’s OpenCV 2.4.3 library and utilized Background Subtractor MOG and Background Subtractor MOG2 classes. The third implementation presented in the paper was created by the authors and extended Background Subtractor MOG2 with the possibility of operating on the scaled version of the original video frame and additional image post-processing phase. The algorithms have been evaluated for various conditions related to stability of background. The quality of hand segmentation when a whole user’s body is visible in the video frame and when only a hand is present has been assessed. Three measures, based on false negative and false positive errors, were calculated for the assessment of segmentation quality, i.e. precision, recall and accuracy factors.

Entry No. 529

Entry type journal paper

Authors K. Łopatka, A. Czyżewski

English title Adaptive acoustic crosstalk cancellation in mobile computer device

Polish title Adaptacyjne usuwanie przesłuchu w przenośnym urządzeniu komputerowym

Journal Elektronika: konstrukcje, technologie, zastosowania

Volume

Number 4

Pages 32 - 36

Abstract The cancellation of acoustic crosstalk is employed to enhance the stereo image in mobile listening conditions. A practical setup employing a mobile computer is employed. The adaptation of the crosstalk cancellation filter to the position of the listener's head is featured. The measurement evaluating the possibility of practical application of the method are described. The head and torso simulator was used for measurements. The spatial and spectral efficiency of the algorithm was evaluated. The results of the measurements show that the algorithm is effective in a limited frequency range and in a narrow sweet spot.

Streszczenie Zastosowano usuwanie przesłuchu akustycznego w celu wzbogacenia bazy stereofonicznej w przenośnych warunkach odsłuchu. Przedstawiono adaptację filtra usuwającego przesłuch do zmiany położenia głowy słuchacza. Opisano eksperymenty oceniające możliwość praktycznego zastosowania metody. W pomiarach wykorzystano symulator głowy i torsu. Zbadano przestrzenną i widmową rozdzielczość algorytmu. Wyniki eksperymentu dowodzą, że metoda jest skuteczna w ograniczonym zakresie częstotliwości i w wąskim punkcie w przestrzeni.

Entry No. 530

Entry type conference paper

Authors K. Łopatka, P. Suchomski, A. Ciarkowski, P. Odya, A. Czyżewski

English title Fitting the mobile device characteristics to the user's hearing preferences

Polish title Dopasowanie charakterystyki urządzenia przenośnego do preferencji słuchowych użytkownika

Conference 136th AES Convention

Preprint 147

Number

Volume

Pages

Conference site Berlin, Niemcy

Conference date 26.4.2014- 29.4.2014

Notes Convention E-brief

Abstract A method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both for the user's hearing preferences (or possible deficiencies) and for the playback characteristics of the device. The solution is implemented as a standalone PC calibration application and as an APO object installed in the system's audio driver.

Streszczenie Zaproponowano metodę dopasowania dźwięku w urządzeniu mobilnym do preferencji słuchowych użytkownika. Proces składa się z dwóch etapów: kalibracji i przetwarzania dynamiki. Podczas kalibracji użytkownik wykonuje test skalowania głośności, podając odpowiedzi dotyczące postrzeganej głośności. Przetwarzanie dynamiki dokonywane na tej podstawie dopasowuje głośność sygnału do komfortowego poziomu dla użytkownika. Przetwarzanie to uwzględnia zarówno preferencje słuchowe użytkownika (w tym możliwe ubytki słuchu) i charakterystykę odtwarzania dźwięku przez urządzenie. Rozwiązanie jest zaimplementowane jako aplikacja kalibracyjna na PC i obiekt APO zainstalowany w sterowniku audio systemu operacyjnego.

Entry No. 531

Entry type conference paper

Authors K. Łopatka, J. Kotus, A. Czyżewski

English title Evaluation of Sound Event Detection, Classification and Localization in the Presence of Background Noise for Acoustic Surveillance of Hazardous Situations

Polish title Badanie detekcji, klasyfikacji i lokalizacji zdarzeń dźwiękowych w obecności zakłóceń do zastosowań w nadzorze akustycznym nad niebezpiecznymi sytuacjami

Conference 7th International Conference on Multimedia Communications Services and Security, 2014

Preprint

Number

Volume 429

Pages 96 - 110

Conference site Kraków, Polska

Conference date 11.6.2014- 12.6.2014

Notes Wydano w serii Communications in Computer and Information Science

Abstract An evaluation of the sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for separating foreground events from the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the classifier are introduced. The sound source localization algorithm based on the analysis of multichannel signals from the Acoustic Vector Sensor is presented. The methods are evaluated in an experiment conducted in the anechoic chamber, in which the representative events are played together with noise of differing intensity. The results of detection, classification and localization accuracy with respect to the Signal to Noise Ratio are discussed. The algorithms presented are part of an audio-visual surveillance system.

Streszczenie Przedstawiono badanie skuteczności algorytmów detekcji, klasyfikacji i lokalizacji zdarzeń dźwiękowych w obecności zakłóceń różnego typu i o różnej intensywności. Wprowadzono metody detekcji zdarzeń na tle zakłóceń. Opisano klasyfikator oparty na algorytmie maszyny wektorów wspierających. Wprowadzono cechy sygnału wykorzystane do treningu klasyfikatora. Przedstawiono algorytm lokalizacji zdarzeń dźwiękowych oparty na analizie wielokanałowego sygnału pochodzącego z wektorowego czujnika akustycznego. Metody zostały zbadane podczas eksperymentu przeprowadzonego w komorze bezechowej, podczas którego przykładowe zdarzenia były odtwarzane z towarzyszeniem szumu o zmiennym natężeniu. Omówiono wyniki detekcji, klasyfikacji i lokalizacji zdarzeń dźwiękowych w zależności od stosunku sygnału do szumu. Przedstawione algorytmy są częścią systemu automatycznego nadzoru wizyjno-fonicznego.

Entry No. 532

Entry type journal paper

Authors M. Lech, A. Czyżewski, W. Kucharski, B. Kostek

English title Computer-Supported Polysensory Integration Technology for Educationally Handicapped Pupils

Polish title

Journal Lecture Notes in Artificial Intelligence

Volume 8502

Number

Pages 224 - 233

Bibliographic No. 17

Notes tytuł monografii: Foundations of Intelligent Systems, materiały konferencyjne z 21st International Symposium on Methodologies for Intelligent Systems, ISMIS 2014, Roskilde, Dania

Abstract In this paper, a multimedia system providing technology for hearing and visual attention stimulation is shortly presented. The system aims to support the development of educationally handicapped pupils. The system has been presented in the context of its configuration, architecture, and therapeutic exercise implementation issues. Results of pupils’ improvements after 8 weeks of training with the system are also provided. Training with the system led to the development spatial orientation and understanding cause-and-effect relationships.

Entry No. 533

Entry type report

Authors P. Klinke, A. Czyżewski

English title Skin color models used in color segmentation algorithms – research and implementation

Polish title Modele koloru skóry używane w algorytmach segmenacji tła - porównanie i implementacja

Report Number SAMS32/05/14

Abstract The aim of this work is to present another color segmentation method to the RGB camera stream filter pipeline. This segmentation, previously used in stereovision, was moved into the standard RGB filterchain.

Entry No. 534

Entry type report

Authors M. Lech, A. Czyżewski

English title Collecting data sets of hand shapes and training SVM

Polish title

Report Number SAMS33/06/14

Abstract The aim of the work was to extend the Support Vector Machines filter providing the possibility of creating large data sets of hand shapes in many runs.

Entry No. 535

Entry type conference paper

Authors B. Kunka, T. Sanner, A. Czyżewski, A. Kwiatkowska

English title Consciousness Study of Subjects with Unresponsive Wakefulness Syndrome Employing Multimodal Interfaces

Polish title Badanie świadomości pacjentów z zaburzoną świadomością z zastosowaniem interfejsów multimodalnych

Conference The 2014 International Conference on Brain Informatics and Health (BIH'2014)

Preprint LNAI

Number

Volume 8609

Pages 57 - 67

Conference site Warszawa, Polska

Conference date 11.8.2014- 14.8.2014

Bibliographic No. 19

Notes Springer International Publishing Switzerland 2014

Abstract The paper presents a novel multimodal-based methodology for consciousness study of individuals with unresponsive wakefulness syndrome. Two interfaces were employed in the experiments: eye gaze tracking system – CyberEye developed at the Multimedia Systems Department, and EEG device with electrode placement in the international 10-20 standard. It was a pilot study for checking if it is possible to determine objective methods based on multimodal techniques which could replace or support current expensive and difficult to access neuroimaging techniques, like fMRI, PET, utilizing in evaluation of consciousness state. The multimodal-based methodology consists of several phases of research involving subjects. Hearing examination based on objective methods (OAE, ABR), consciousness test based on analysis of visual activity, examination of visual neural pathway with Steady State Visually Evoked Potentials and EEG-based comprehension test were proposed. The results obtained within conducted experiments and presented in this paper suggest that proposed objective-subjective methodology could potentially be introduced into clinical facilities after further validation.

Entry No. 536

Entry type conference paper

Authors B. Kunka, A. Korzeniewski, B. Kostek, A. Czyżewski

English title Eye-Gaze Tracking-Based Telepresence System for Videoconferencing

Polish title System teleobecności oparty na technice śledzenia wzroku wykorzystywany w wideokonferencji

Conference The 2014 International Conference on Active Media Technology (AMT'2014)

Preprint LNCS

Number

Volume 8610

Pages 432 - 441

Conference site Warszawa, Polska

Conference date 11.8.2014- 14.8.2014

Bibliographic No. 23

Notes D. Ślȩzak et al. (Eds.): AMT 2014, LNCS 8610, pp. 432-441. Springer International Publishing Switzerland (2014)

Abstract An approach to the teleimmersive videoconferencing system enhanced by the pan-tilt-zoom (PTZ) camera, controlled by the eye-gaze tracking system, is presented in this paper. An overview of the existing telepresence systems, especially dedicated to videoconferencing is included. The presented approach is based on the CyberEye eye-gaze tracking system engineered at the Multimedia Systems Department (MSD) of Gdańsk University of Technology (GUT), as well as on a standard PTZ security camera communicating with the computer by the TCP/IP protocol. Technical aspects of the developed system prototype including two different use cases (one-way and two-way configuration of system) are described. Moreover, a discussion related to the gathered user’s experience as well as to difficulties and opportunities concerning the proposed approach are included.

Entry No. 537

Entry type journal paper

Authors K. Łopatka, A. Czyżewski

English title Acceleration of decision making in sound event recognition employing supercomputing cluster

Polish title Przyspieszenie podejmowania decyzji w rozpoznawaniu niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem klastra superkomputerowego

Journal Information Sciences

Volume 285

Number

Pages 223 - 236

Notes http://www.sciencedirect.com/science/article/pii/S0020025513008414

Abstract Parallel processing of audio data streams is introduced to shorten the decision making time in hazardous sound event recognition. A supercomputing cluster environment with a framework dedicated to processing multimedia data streams in real time is used. The sound event recognition algorithms employed are based on detecting foreground events, calculating their features in short time frames, and classifying the events with Support Vector Machine. Different strategies for improving the decision time are introduced. The experiments with the presented strategies are conducted and the results are presented.

Streszczenie W pracy wprowadzono równoległe przetwarzanie strumieni danych fonicznych w celu skrócenia czasu podejmowania decyzji w systemie rozpoznawania zdarzeń dźwiękowych związanych z zagrożeniem. Wykorzystano środowisko klastra superkomputerowego ze specjalnym środowiskiem przetwarzania strumieni danych multimedialnych. Opracowane algorytmy rozpoznawania zdarzeń dźwiękowych oparte są na wykrywaniu zdarzeń, obliczaniu parametów zdarzenia w ramkach i klasyfikacji z wykorzystaniem algorytmu maszyny wektorów wspierających. Wprowadzono różne strategie zrównoleglenia algorytmów w celu skrócenia czasu podejmowania decyzji. Opsiano eksperymenty porównujące wydajności uzyskane dla poszczególnych podejść i przedyskutowano wyniki.

Entry No. 538

Entry type conference paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek

English title

Polish title Modelowanie propagacji hałasu i jego wpływu na słuch z wykorzystaniem platformy obliczeniowej PL Grid Plus

Conference XV Międzynarodowe Sympozjum Nowości w Technice Audio i Wideo

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 25.9.2014- 27.9.2014

Streszczenie W referacie przedstawiono usługi dostępne w gridzie dziedzinowym Akustyka, opracowane w ramach projektu PL Grid Plus. Przygotowane usługi umożliwiają modelowanie propagacji hałasu w środowisku aglomeracji miejskiej pochodzącego ze źródeł liniowych (drogi), punktowych lub powierzchniowych (hałas przemysłowy, imprezy plenerowe) z wykorzystaniem klastrów obliczeniowych. Na podstawie uzyskanych wyników rozkładu poziomu hałasu możliwe jest przeprowadzenie dalszych symulacji ukazujących skutki słuchowe oddziaływania hałasu na organ słuchu. Opracowane narzędzia dają duże możliwości w zakresie definiowania scenariuszy obliczeniowych i sytuacyjnych dzięki temu mają istotny walor dydaktyczny i poznawczy.

Entry No. 539

Entry type conference paper

Authors M. Szczodrak, J. Kotus, A. Czyżewski, B. Kostek

English title Application of PL-Grid platform for modeling of the selected acoustic phenomena

Polish title

Conference CGW (Cracow Grid Workshop) 2014

Preprint

Number

Volume

Pages

Conference site Kraków, Polska

Conference date 27.10.2014- 29.10.2014

Abstract Domain grids are specific computational environments, developed within the PLGrid Plus project. For the Acoustic domain grid two supercomputer grid based services were prepared. Dedicated software consists of the outdoor sound propagation module and psychoacoustical noise dosimeter. The results are presented in a form of maps of sound level and Temporary Threshold Shift (TTS) values, therefore the services may play an informative role in the field of noise harmfulness.

Entry No. 540

Entry type conference paper

Authors P. Dalka, P. Bratoszewski, A. Czyżewski

English title Visual Lip Contour Detection for the Purpose of Speech Recognition

Polish title

Conference International Conference on Signals and Electronic Systems

Preprint

Number

Volume

Pages

Conference site Poznań, 2014

Conference date 11.9.2014- 13.9.2014

Notes Papers presented at the conference will be included in the IEEE Xplore and Web of Science database

Abstract A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are monitored in order to prevent the matching process from converging to a false local minimum. AAM-based visual features are applied in an experiment devoted to the static recognition of English vowels with SVM. Studies are carried out based on a database of recordings of 5 speakers of different skin colors. Results are thoroughly discussed and illustrated with figures.

Entry No. 541

Entry type conference paper

Authors P. Bratoszewski, J. Cichowski, A. Czyżewski

English title Examining Acoustic Emission of Engineered Ultrasound Loudspeakers

Polish title

Conference The IEEE Conference "Signal Processing: Algorithms, Architectures, Arrangements, and Applications" (SPA)

Preprint

Number

Volume

Pages 60 - 65

Conference site Poznań, Polska

Conference date 22.9.2014- 24.9.2014

Notes ISBN 978-83-62065-18-9

Abstract Measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of the human auditory system. Based on the measurements of the sound emitted from the two parametric arrays of ultrasonic transducers the directivity of the proposed system and the interaural crosstalk characteristics are determined. The application of the system concerns creating a personal audio space for users of mobile platforms, such as notebooks, and applying 3D audio algorithms without the need of using headphones.

Entry No. 542

Entry type conference paper

Authors P. Bratoszewski, K. Łopatka, A. Czyżewski

English title Examining Influence Of Video Framerate And Audio/Video Synchronization On Audio-Visual Speech Recognition Accuracy

Polish title

Conference 15th International Symposium on New Trends in Audio and Video

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 25.9.2014- 27.9.2014

Notes ISBN: 978-92-921663-5-1

Abstract The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The initial video framerate equals 100 frames per second. The test signals were recorded with a specialized hardware for synchronous registration of audio and video data. In a practical implementation, however, it is difficult to achieve a high rate of images per second and maintain the precise audio/video synchronization. Therefore, in this work it is assessed, how the lowered framerate and lack of synchronization be-tween audio and video data impairs the performance of the recognition engine. The lowered video framerate is enforced by downsampling the visual data. The lack of synchronization is simulated programmatically in the feature fusion process. The experiments are conducted employing the HTK engine (Hidden Markov Toolkit). Word Error Rate, correctness and accuracy measures are considered and a small dictionary of 11 words (numerals) is employed

Entry No. 543

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski, J. Kotus, B. Kostek

English title Frequently updated noise threat maps created with use of supercomputing grid

Polish title

Journal Noise Mapping

Volume 1

Number

Pages 32 - 39

Abstract An innovative supercomputing grid services devoted to noise threat evaluation were presented. The services described in this paper concern two different issues, first is the noise mapping, while the second concerns assessment of the noise dose and its influence on the human hearing system. The discussed services were developed within PL-Grid Infrastructure which accumulates Polish academic supercomputer centers. Selected experimental results achieved by usage of the services were presented. The assessment of environmental noise threats include creation of the noise maps using either offline or online data, acquired through a grid of monitoring stations. A concept of estimation of the source model parameters based on the measured sound level for the purpose of creating frequently updated noise maps was presented. Connection of the noise mapping grid service with a distributed sensor network enables to automatically update the noise maps for a specified time period. Moreover, an exceptional attribute of the developed software is the estimation of the auditory effects evoked by the exposure to noise. The estimation method uses a modified psychoacoustic model of hearing and is based on the calculated noise level values and on a given exposure period. Potential use scenarios of the grid services for research or educational purpose were introduced. Presentation of the results of predicted hearing threshold shift caused by exposure to excessive noise can disseminate awareness of the noise threat in public.

Entry No. 544

Entry type journal paper

Authors A. Czyżewski, P. Dalka, Ł. Kosikowski, B. Kunka, P. Odya

English title Multimodal human-computer interfaces based on advanced video and audio analysis

Polish title Multimodalne interfejsy komputerowe bazujące na zaawansowanych metodach analizy dźwięku i obrazu

Journal Advances in Intelligent Systems and Computing

Volume 300

Number

Pages 87 - 102

Notes Human-Computer Systems Interaction: Backgrounds and Applications 3

Abstract Multimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing the UPDRS (Unified Parkinson Disease Rating Scale) scale which is used to evaluate motor and behavioral symptoms of the Parkinson’s disease, based on the multimodal interface called Virtual-Touchpad (VTP) used for supporting medical diagnosis. The scent emitting multimodal computer interface provides an important supplement of the polysensoric stimulation process, playing an essential role in education and therapy of children with certain developmental disorders. The Smart Pen providing a tool for supporting therapy of developmental dyslexia is presented and results achieved with its application are discussed. The eye-gaze tracking system named Cyber Eye, developed at the Multimedia Systems Department employed to many kinds of experiments is presented including analysis of visual activity of patients remaining in vegetative state and their awareness evaluation. The paper is concluded with some general remarks concerning the role of multimodal computer interfaces applied to learning, therapy and everyday usage of computerized devices.

Entry No. 545

Entry type conference paper

Authors K. Lisowski, A. Czyżewski

English title Modelling Object Behaviour in a Video Surveillnace System Using Pawlak's Flowgraph

Polish title

Conference MCSS 2014 - Multimedia Communications, Services and Security

Preprint

Number

Volume

Pages 122 - 136

Conference site Kraków, Polska

Conference date 11.6.2014- 12.6.2014

Abstract In this paper, methodology of acquisition and processing of video streams for the purpose of modelling object behaviour is presented. Multilevel contextual video processing was also mentioned. The Pawlak’s flowgraph is used as a container for the knowledge related to the behaviour of objects in the area supervised by a video surveillance system. Spatio-temporal dependencies in transitions between cameras can be easily changed in reallife situations. In order to cope with such fluctuating conditions, an adaptive algorithm is implemented. Consequently, as it was shown the flowgraph reacts faster to the occurring changes.

Entry No. 546

Entry type conference paper

Authors J. Kotus, A. Ciarkowski, A. Czyżewski

English title Auto adaptation of mobile device characteristics to various acoustic conditions

Polish title

Conference 136th AES Convention

Preprint eBrief 136

Number

Volume

Pages

Conference site Berlin, Germany

Conference date 26.4.2014- 29.4.2014

Abstract The proposed methodology of auto adaptation of the mobile device characteristics to various acoustic conditions is presented in the paper. The first goal of this study was to determine the parameters of the acoustic path of the mobile device, for both transmitting (speaker) and receiver (microphone). Results of the measurement of characteristics of mobile devices were presented. Information about characteristics of individual parts of the sound path were used to design and to develop a technique of linearization of the device frequency response characteristics. Preliminary results obtained with the proposed methodology are presented. The performed research evolved into the design of an adaptive self-linearization method which compensates for the changing of acoustic conditions through continuous monitoring and regulating the audio settings.

Entry No. 547

Entry type report

Authors P. Trella, A. Czyżewski

English title

Polish title Dostosowanie algorytmów detekcji i śledzenia opuszków palców do współpracy z kamerą ToF w systemie SGRSwF – część druga

Report Number SAMS22/02/14

Streszczenie Zadanie polegało na adaptacji algorytmów z łańcucha przetwarzania opartego o obraz z kamery RGB, składającego się z kaskadowo połączonych filtrów ColorSegmentation (segmentacja dłoni), MinMaxCircles (detekcja opuszków palców), NPointTracker (śledzenie punktów - 2 w tym przypadku), oraz NPointDrawer (rozpoznawanie gestów oraz prezentacja wyników przetwarzania w postaci punktu odpowiadającego pozycji wirtualnego kursora na ekranie komputera) do nowego łańcucha opartego o obraz z kamery ToF. Rolę segmentacji dłoni przyjął nowy filtr progowania mapy głębi za pomocą technik rozmytych o nazwie FuzzyThreshold.

Entry No. 548

Entry type report

Authors M. Lech, P. Trella, P. Klinke, A. Czyżewski

English title

Polish title Zestawienie tabelaryczne aplikacji demonstracyjnych w systemie SGRSwF

Report Number SAMS23/02/14

Streszczenie W raporcie zebrano podstawowe informacje dotyczące opracowanych przez Katedrę Systemów Multimedialnych aplikacji demonstracyjnych w systemie SGRSwF.

Entry No. 549

Entry type conference paper

Authors A. Czyżewski, B. Kostek, P. Odya, B. Kunka, M. Lech

English title Intelligent multimodal human-computer interfaces

Polish title

Conference The 2014 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology

Preprint

Number

Volume

Pages 8 - 11

Conference site Warszawa, Polska

Conference date 11.8.2014- 14.8.2014

Notes Tutorial

Abstract Multimodal interfaces development history will be reviewed briefly in the introduction to the tutorial. Methods for intelligent processing of audio and video will be discussed in the context of their applications to multimodal human-computer interfaces. Some examples of applications of multimodal interfaces to education software and for the disabled people will be shown, including the eye-gaze tracking system named “Cyber Eye” employed to many kinds of experiments including analysis of visual activity of patients remaining in vegetative state and their awareness evaluation. The scent emitting multimodal computer interface, playing an essential role in education and therapy of children with certain developmental disorders will serve as one more practical example of applications. The multimodal interface called Virtual-Touchpad (VTP) used for supporting medical diagnosis will be presented also. The role of multimodal computer interfaces applied to learning, therapy and everyday usage of computerized devices will be illustrated by above mentioned and by some more practical examples. Moreover, the subject of intelligent audio & video surveillance providing a special case of multimodal interfacing will be addressed and illustrated with practical application examples.

Entry No. 550

Entry type conference paper

Authors B. Kostek, A. Kurowski, P. Kryger, A. Czyżewski

English title Sound Field Intensity Measurements and Visualization around the Human Head Model

Polish title Rozkłąd natężenia pola akustycznego w komorze bezechowej obecności sztucznej głowy i w przypadku braku jej obecności

Conference 137th Audio Eng. Society Convention

Preprint 160

Number

Volume

Pages 1 - 4

Conference site Los Angeles, USA

Conference date 9.9.2014- 12.9.2014

Bibliographic No. NCN

Notes link: http://www.aes.org/e-lib/browse.cfm?elib=17395, do projektu NCN

Abstract The main goal of this research study was to measure and visualize sound field intensity distribution in and without presence of the human head model. Measurements were performed in the anechoic chamber with the 5 cm grid. Experimental setup consisted of a multitone generator, two loudspeakers, human head model, intensimetric probe, the Cartesian robot applied for precise positioning of the acoustic sensor, and an analyzer. Based on the collected data a sound field visualization was created in the form of colored maps and arrows illustrating pressure and intensity vectors at a given point in the presence of the artificial head, as well as without this obstacle plus the difference resulted from the both mentioned conditions occurrence. A thorough analysis of the results obtained and conclusions follows the experiments presented in the paper.

Streszczenie W referacie przedstawiono wyniki analizy pomiarów akustycznych przeprowadzonych w komorze bezechowej z wykorzystaniem sondy USP. Na podstawie wyników uzyskanych z pomiarów ciśnienia akustycznego i prędkości cząstek dokonano wizualizacji rozkładu pola akustycznego w obecności sztucznej głowy i w przypadku braku jej obecności.

Entry No. 551

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Implementacja sprzężenia zwrotnego w łańcuchu filtrów z wykorzystaniem funkcji IConfigurator

Report Number SAMS24/02/14

Streszczenie Podstawą systemu SGRS jest przetwarzanie potokowe, jednokierunkowe. Jednakże wykorzystanie sprzężenia zwrotnego w niektórych łańcuchach przetwarzania jest niezbędne do ich prawidłowego działania. Praca ta demonstruje jak projektować filtry ze sprzężeniem zwrotnym oraz prezentuje przykładową aplikację demonstracyjną z wykorzystaniem sprzężenia zwrotnego.

Entry No. 552

Entry type report

Authors A. Czyżewski, J. Cichowski, W. Moskwa, B. Hermanowicz

English title Documentation of the Prototypes - Generator of Ultrasonic Haptic Feedback

Polish title

Report Number SAMS20/01/14

Abstract This document summarizes the research carried out from 2013-09-01 to 2011-01-31, it is deliverable of the task 6. The main objective of the mentioned task was the realization of the prototype to produce a concentrated beam of ultrasound. The GoUSHF (Generator of Ultrasonic Haptic Feedback) is designed using modular approach, the two modules are required to setup working prototype. The following sections present the realized PCBs (printed circuits boards) for logic and ultrasound units. The simulations of the implemented beamforming routines are presented for both realized ultrasonic arrays. The electrical schemas and bill of materials for specific prototypes were mentioned in the document entitled “H/W Low Level Design” and are not repeated in this report. The list of future work is presented at the end of the document.

Entry No. 553

Entry type report

Authors P. Klinke, A. Czyżewski

English title

Polish title Implementacja bloku decyzyjnego dla ekstrakcji pięciu opuszków palców na podstawie obrysu dłoni

Report Number SAMS21/01/14

Streszczenie System SGRS zawiera wiele implementacji różnych algorytmów rozpoznawania położenia opuszków palców. Praca ma na celu połączenie zaimplementowanych metod w jednej aplikacji demonstracyjnej. Użycie nowych narzędzi systemu SGRS pozwoli na wyświetlenie wyników wielu metod klasyfikacji naraz w celu łatwego porównania działania poszczególnych metod.

Entry No. 554

Entry type report

Authors S. Marszałkowski, A. Czyżewski

English title Hand Gesture Recognition using Histogram of Oriented Gradients and Support Vector Machine

Polish title

Report Number SAMS29/04/14

Abstract The method is based on evaluating well-normalized local histograms of image gradient orientations in a dense grid. The basic idea is that local object appearance and shape can often be characterized rather well by the distribution of local intensity gradients or edge directions, even without precise knowledge of the corresponding gradient or edge positions. In practice this is implemented by dividing the image window into small spatial regions (“cells”), for each cell accumulating a local 1-D histogram of gradient directions or edge orientations over the pixels of the cell. The combined histogram entries form the representation. For better invariance to illumination, shadowing, etc., it is also useful to contrast-normalize the local responses before using them. This can be done by accumulating a measure of local histogram “energy” over somewhat larger spatial regions (“blocks”) and using the results to normalize all of the cells in the block. We will refer to the normalized descriptor blocks as Histogram of Oriented Gradient (HOG) descriptors. Tiling the detection window with a dense (in fact, overlapping) grid of HOG descriptors and using the combined feature vector in a conventional SVM based window classifier gives our human detection chain.

Entry No. 555

Entry type report

Authors P. Trella, A. Czyżewski

English title The use of confidence map from Time-of-Flight camera id order to improve accuracy of gesture recognition in SGRSwF

Polish title

Report Number SAMS30/04/14

Abstract The depth image given by Time-of-Flight camera is often very noisy in places where there is no object in camera range. This type of noise can have higher level than useful signal but it appears only in described places. The task was to use (next to “depth”) an additional parameter called “confidence” to filter out that noise.

Entry No. 556

Entry type report

Authors P. Klinke, A. Czyżewski

English title Dynamic processing methods - particle filter

Polish title Metody przetwarzania dynamicznego - filtr cząsteczkowy

Report Number SAMS31/04/14

Abstract The aim of this work is to research methods of dynamic processing of video stream. Various methods are discussed, including Meanshift and Camshift methods, sequential Monte Carlo algorithm and Optical Flow analysis. Usage examples of every algorithm are proposed.

Entry No. 557

Entry type journal paper

Authors J. Cichowski, A. Czyżewski

English title ACTIVE RFID SYSTEM FOR OBJECTS LOCALIZATION AND IDENTIFICATION IN MULTIMODAL SURVEILLANCE INFRASTRUCTURE

Polish title Aktywny system RFID do lokalizacji i identyfikacji obiektów w wielomodalnej infrastrukturze bezpieczeństwa

Journal Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages 755 - 759

Abstract Przedstawiono prace koncepcyjne, badawcze oraz implementacyjne skoncentrowane na praktycznej realizacji systemu detekcji obiektów z wykorzystaniem kamer wizyjnych i identyfikacji radiowej. Zaproponowano rozbudowę wielomodalnego teleinformatycznego systemu bezpieczeństwa o warstwę identyfikacji radiowej obiektów. Omówiono założenia zaprojektowanego systemu oraz opracowaną warstwę sprzętową. Zaproponowano i przedyskutowano praktyczne zastosowania opisanego systemu.

Streszczenie Research and implementation focused on object detection application bases on video cameras and radio frequency identification are presented. The enhancement of the existing multimodal surveillance systems using radio modality for robust object detection is proposed. Assump-tions of the developed system together with the background of the implementation of the hardware layer are briefly described. The practical application and the preliminary results of the experiments are discussed.

Entry No. 558

Entry type report

Authors P. Bratoszewski, A. Czyżewski

English title Palm Gesture Recognition using shape subsampling method and SVM classification

Polish title

Report Number SAMS49/11/14

Abstract The aim of the work was to propose the method for hand gesture recognition. The proposed method is based on the analysis of the shape of user’s hand extracted from depth image acquired by Time of Flight (ToF) camera. Shape analysis results with number of parameters which are passed to the classification block which employs the Support Vector Machine (SVM) method.

Entry No. 559

Entry type report

Authors S. Marszałkowski, A. Czyżewski

English title Hand Gesture Recognition using Histogram of Oriented Gradients and Support Vector Machine v2 & v3

Polish title

Report Number SAMS50/11/14

Abstract The first section of this report describes three versions of application; the first version was introduced in previous report, so the attention is focused on second and third version. Second enables classifying only one gesture, but it allows for multi-scale window detection. Third enables multi-class detection but without multi-scaling, so the source image/frame has to be correctly resized. The following sections present the library for Support Vector Machine and for managing files. There also can be found a description of the selected image dataset for training SVM. The last section shows possible improvements.

Entry No. 560

Entry type conference paper

Authors J. Kotus, M. Szczodrak, K. Marciniuk, A. Czyżewski, B. Kostek

English title Creating Dynamic Psychoacoustic Maps of Hearing Threats for Outdoor Concerts Employing Supercomputing Grid

Polish title

Conference 136th International AES Convention

Preprint eBrief 150

Number

Volume

Pages

Conference site Berlin, Germany

Conference date 26.4.2014- 29.4.2014

Abstract The auditory effects caused by the outdoor concert are discussed in this paper. The analysis is based on the computation results obtained by means of supercomputing PL-Grid infrastructure and specific computational algorithms developed by the authors. The software consists of the outdoor sound propagation module and psychoacoustical noise dosimeter. The simulation was performed by means of real music recordings and the following outdoor propagation conditions were taken into account: speaker directivity, ground effect, building reflection, distance attenuation, and sound absorption by the atmosphere. On the basis of the proposed methodology the dynamic (one minute time resolution) psychoacoustic maps of hearing threats for considered area were created expressed by TTS (Temporary Threshold Shift) values in critical bands. Moreover, the results include also maps of sound level and noise dose values.

Entry No. 561

Entry type conference paper

Authors J. Cichowski, A. Czyżewski

English title ACTIVE RFID SYSTEM FOR OBJECTS LOCALIZATION AND IDENTIFICATION IN MULTIMODAL SURVEILLANCE INFRASTRUCTURE

Polish title Aktywny system RFID do lokalizacji i identyfikacji obiektów w wielomodalnej infrastrukturze bezpieczeństwa

Conference Kreajowe Sympozjum Telekomunikacji i Teleinformatyki KSTiT 2014

Preprint

Number 8-9

Volume

Pages 755 - 759

Conference site Poznań, Polska

Conference date 3.9.2014- 5.9.2014

Abstract Research and implementation focused on object detection application bases on video cameras and radio frequency identification are presented. The enhancement of the existing multimodal surveillance systems using radio modality for robust object detection is proposed. Assumptions of the developed system together with the background of the implementation of the hardware layer are briefly described. The practical application and the preliminary results of the experiments are discussed.

Streszczenie Przedstawiono prace koncepcyjne, badawcze oraz implementacyjne skoncentrowane na praktycznej realizacji systemu detekcji obiektów z wykorzystaniem kamer wizyjnych i identyfikacji radiowej. Zaproponowano rozbudowę wielomodalnego teleinformatycznego systemu bezpieczeństwa o warstwę identyfikacji radiowej obiektów. Omówiono założenia zaprojektowanego systemu oraz opracowaną warstwę sprzętową. Zaproponowano i przedyskutowano praktyczne zastosowania opisanego systemu.

Entry No. 562

Entry type journal paper

Authors M. Szczodrak, J. Kotus, A. Czyżewski, B. Kostek

English title Supercomputing Grid-Based Services for Hearing Protection and Acoustical Urban Planning, Research & Education

Polish title

Journal Lecture Notes in Computer Science

Volume 8500

Number

Pages 263 - 277

Notes DOI: 10.1007/978-3-319-10894-0_19

Abstract Specific computational environments, so-called domain grids, are developed within the PLGrid Plus project in order to prepare specialized IT solutions, i.e., dedicated software implementations and hardware (infrastructure adaptation), suited for particular research group demands. One of the PLGrid Plus domain grids, presented in this paper, is Acoustics. The article describes in detail two kinds of the acoustic domain services. The first can be used to calculate noise maps of large city areas, and is called "Noise Map". The second, called the "Hearing" service, enables simulations of noise impact on the human hearing system. Several kinds of usage scenarios of the developed services are also presented and illustrated by exemplary results. The infrastructure and the software developed can be utilized mainly for research and education purposes. The engineered software is intended for creating maps of noise threat for roads, railways and industrial sources. Integration of the software services with a distributed sensor network enables to automatically update the noise maps for a specific time period. A unique feature of the developed software is the possibility to estimate the auditory effects, which are caused by the exposure to noise. This estimation is based on the calculated noise levels and on a given exposure period. The outcomes of this research study are presented in form of a cumulative noise dose and characteristics of the temporary threshold shift.

Entry No. 563

Entry type report

Authors P. Klinke, A. Czyżewski

English title Tuning of implemented methods for usage scenarios

Polish title Dostrajanie opracowanych metod do scenariuszy użytkowych

Report Number SAMS39/08/14

Abstract The aim of this work was to revise all the methods implemented in previous works and go through parameters of processing to make sure they are adequate to the working scenarios. As a part of this work, fingers counting methods were tuned for ToF camera and merged with KSM Demo pipeline. A lower resolution of input image was proposed for the RGB pipeline and implemented filters were adapted to work with the new resolution.

Entry No. 564

Entry type conference paper

Authors P. Bratoszewski, K. Łopatka, A. Czyżewski

English title Examining Influence Of Video Framerate And Audio/Video Synchronization On Audio-Visual Speech Recognition Accuracy

Polish title

Conference Polish Academy of Sciences

Preprint

Number

Volume

Pages 75 - 85

Conference site Wrocław, Polska

Conference date 25.9.2014- 27.9.2014

Bibliographic No. 13

Notes Monografia pt.: Signal evaluation and monitoring in sound engineering

Abstract The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The initial video framerate equals 100 frames per second. The test signals were recorded with a specialized hardware for synchronous registration of audio and video data. In a practical implementation, however, it is difficult to achieve a high rate of images per second and maintain the precise audio/video synchronization. Therefore, in this work it is assessed, how the lowered framerate and lack of synchronization between audio and video data impairs the performance of the recognition engine. The lowered video framerate is enforced by downsampling the visual data. The lack of synchronization is simulated programmatically in the feature fusion process. The experiments are conducted employing the HTK engine (Hidden Markov Toolkit). Word Error Rate, correctness and accuracy measures are considered, while a small dictionary of 11 words (numerals) is employed.

Entry No. 565

Entry type journal paper

Authors J. Kotus, K. Łopatka, A. Czyżewski, G. Bogdanis

English title Processing of acoustical data in a multimodal bank operating room surveillance system

Polish title

Journal Multimedia Tools and Applications

Volume

Number

Pages

Notes http://dx.doi.org/10.1007/s11042-014-2264-z

Abstract An automatic surveillance system capable of detecting, classifying and localizing acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic events. The methods for calculating the direction of coming sound employing an acoustic vector sensor are presented. The localization is achieved by calculating the DOA (Direction of Arrival) histogram. The evaluation of the system based on experiments conducted in a real bank operating room is presented. The results of sound event detection, classification and localization are provided and discussed. The practical usability of the engineered methods is underlined by presenting the results of analyzing a staged robbery situation.

Entry No. 566

Entry type journal paper

Authors B. Kostek, A. Kupryjanow, A. Czyżewski

English title Knowledge representation of motor activity of patients with Parkinson’s disease

Polish title Paramteryzacja sygnałów biomedycznych pochodzących z aktywności ruchowej osób z chorobą Parkinsona

Journal Natural Computing An International Journal, DOI: 10.1007/s11047-014-9475-0

Volume

Number Dec.

Pages 1 - 13

Notes DOI: 10.1007/s11047-014-9475-0, link do artykułu: http://link.springer.com/article/10.1007/s11047-014-9475-0/fulltext.html

Abstract An approach to the knowledge representation extraction from biomedical signals analysis concerning motor activity of Parkinson disease patients is proposed in this paper. This is done utilizing accelerometers attached to their body as well as exploiting video image of their hand movements. Experiments are carried out employing artificial neural networks and support vector machine to the recognition of characteristic motor activity disorders in patients. Obtained results indicate that it is possible to interpret some selected patient’s body movements with a sufficiently high effectiveness.

Streszczenie W artykule przedstawiono analizę sygnałów biomedycznych zebranych za pomocą czujników w trakcie wybranych aktywności osoby z chorobą Parkinsona. Sparametryzowane sygnały zostały wykorzystane do automatycznego rozpoznawania aktywności za pomocą sztucznych sieci neuronowych i SVM. Zaproponowane metody akwizycji, parametryzacji oraz klasyfikacji okazały się skuteczne w automatycznym rozpoznawaniu aktywności ruchów rąk i chodu.

Entry No. 567

Entry type book

Authors A. Dziech, A. Czyżewski

English title Multimedia Communications, Services and Security, eds.

Polish title Multimedialna komunikacja, usługi i ich bezpieczeństwo, edycja wydania

Editor Springer

Pages 1 - 264

Notes Redaktor wydania materiałów konferencyjnych (Proceedings) 7th International Conference MCSS 2014, Kraków Polska, JUne 11-12, 2014

Entry No. 568

Entry type conference paper

Authors A. Czyżewski, B. Kunka

English title New methods for assessment and stimulation of non-communicative patients employing advanced multimodal HCI

Polish title Nowe metody oceny i stymulacji pacjentów niekomunikatywnych z wykorzystaniem zaawansowanych interfejsów multimodalnych człowiek-komputer

Conference VII Międzynarodowa Konferencja Opieki Długoterminowej

Preprint

Number

Volume

Pages 43 - 45

Conference site Toruń, Polska

Conference date 24.9.2014- 25.9.2014

Abstract In most cases of patients with locomotor system damage it is possible to find a solution to the medical problems originating from the injury. However, it is much more difficult to prevent cognitive and emotional impairments. Therefore, we believe that the technological support of therapists working with such patients on an everyday basis may be essential. We have acquired experience in designing and providing diagnostic and therapeutic tools based on gaze tracking, which are successfully used in numerous facilities of medical care. We particularly specialise in multimodal computer interfaces, which we have been developing for a long time within other projects. The cost of the multimodal system is relatively low. That is why, in the near future the interfaces which we will use and develop will become much more commonplace. This will allow us to reduce the final cost of the system that we design within this project. Therefore, we may introduce it to hospices, residential medical care facilities or even private houses where patients live who have often erroneously been diagnosed as vegetative state (VS) or minimally conscious (MCS).

Streszczenie W wielu przypadkach u chorych z uszkodzeniem narządów ruchu możliwe jest znalezienie rozwiązań dla problemów medycznych wynikłych z urazów. Jednakże znacznie trudniej zapobiegać zaburzeniom funkcji poznawczych i emocjonalnych. Dlatego uważamy, że wsparcie technologiczne terapeutów pracujących na co dzień z takimi pacjentami staje się niezbędne. Zdobyliśmy doświadczenie w projektowaniu i udostępnianiu narzędzi diagnostycznych i terapeutycznych w oparciu o technologie śledzenia wzroku, które są z powodzeniem stosowane w wielu obiektach opieki medycznej. Specjalizujemy się w multimodalnych interfejsach komputerowych, które były rozwijane przez długi czas w ramach innych projektów. Koszt opracowanego systemu multimodalnego jest stosunkowo niski, dlatego w najbliższej przyszłości interfejsy przez nas używane i doskonalone staną się bardziej powszechne. Pozwoli to zmniejszyć ostateczny koszt systemu, który opracowujemy. Będzie można zatem wprowadzić go do hospicjów, placówek stałej opieki medycznej, a nawet do prywatnych mieszkań, w których żyją pacjenci, często błędnie zdiagnozowani – będący rzekomo w stanie wegetatywnym (ang. VS) lub w stanie minimalnej świadomości (ang. MSC).

Entry No. 569

Entry type report

Authors P. Trella, A. Czyżewski

English title Hand segmentation using parallel processing by skin color segmentation and Mixture of Gaussian method

Polish title

Report Number SAMS27/02/14

Abstract The task was to design and implement processing graph capable to use results of two different segmentation methods. First method was skin color segmentation based on color keying in HSV space described wider in one of previous reports [2]. Second method is Mixture of Gaussian background subtraction algorithm (openCV, MOG2).

Entry No. 570

Entry type

Authors A. Czyżewski, B. Kostek, P. Odya

English title

Polish title Sposób wzrokowego wykonywania utworów dźwiękowych na instrumentach muzycznych z wykorzystaniem zapisu nutowego oraz układ do realizacji tego sposobu

Notes zgłoszenie patentowe

Entry No. 571

Entry type

Authors A. Czyżewski, P. Suchomski, P. Odya, K. Łopatka

English title

Polish title Sposób dopasowania parametrów toru fonicznego cyfrowego urządzenia elektronicznego

Streszczenie Przedmiotem wynalazku jest układ do poprawy jakości dźwięku cyfrowych urządzeń elektronicznych, w szczególności dopasowania parametrów toru fonicznego urządzeń takich jak laptop, komputer PC, smartfon, tablet i podobne, do preferencji słuchowych użytkownika, charakterystyk urządzenia i warunków odsłuchu.

Entry No. 572

Entry type book

Authors A. Czyżewski

English title

Polish title Prace badawcze i wdrożeniowe Zespołu Katedry Systemów Multimedialnych oraz Laboratorium Akustyki Fonicznej, Wydział Elektroniki, Telekomunikacji i Informatyki, Politechnika Gdańska

Editor Komitet Akustyki Polskiej Akademii Nauk, Warszawa 2014

Pages

Notes Rozdział w książce: 50 lat Komitetu Akustyki Polskiej Akademii Nauk, 1964-2014, Osiągnięcia i wydarzenia, pod red. A. Śliwińskiego i E. Kozaczki

Streszczenie W bogatym dorobku prac naukowych oraz badawczo-wdrożeniowym z zakresu akustyki - Katedry Systemów Multimedialnych kierowanej przez prof. dr hab. inż. Andrzeja Czyżewskiego oraz Laboratorium Akustyki Fonicznej związanej z osobą prof. dr hab. inż. Bożeny Kostek - obecny jest nurt związanych z pracami dedykowanymi monitoringowi akustycznemu środowiska. W latach 2009-2012 pracownicy tych Jednostek zrealizowali projekt badawczy (grant celowy MNiSzW), którego efektem był opracowany Multimedialny System Monitorowania Hałasu.

Entry No. 573

Entry type

Authors A. Czyżewski, J. Cichowski

English title

Polish title Układ do wizyjnego monitorowania obiektów, zwłaszcza towarów w punktach handlowych

Notes ZGŁOSZENIE

Abstract Układ do wizyjnego monitorowania obiektów, zwłaszcza towarów w punktach handlowych

Entry No. 574

Entry type report

Authors P. Klinke, A. Czyżewski

English title Development of demonstration application for fingertips detection

Polish title Zaprojektowanie aplikacji demonstracyjnej dla wykrywania opuszków palców

Report Number SAMS26/03/14

Abstract The task was to design a demonstration application that would implement fingertips detection. The idea of this application was to create two basic states of the app.

Entry No. 575

Entry type journal paper

Authors A. Kwiatkowska, P. Izdebski, B. Kunka, A. Czyżewski

English title Disordered reading and writing in post-comatose patients employing a video-based eye-gaze tracking system

Polish title Badanie zaburzeń czytania i pisania u pacjentów wybudzonych ze śpiączki z wykorzystaniem systemu śledzenia wzroku

Journal Current Psychosocial Problems in Traditional and Novel Approaches

Volume 3

Number

Pages 141 - 157

Bibliographic No. 34

Notes ISBN 978-83-8018-010-9

Entry No. 576

Entry type

Authors A. Czyzewski, B. Kunka, A. Kwiatkowska, B. Kostek

English title

Polish title System CyberOko do diagnozy i terapii osób w śpiączce

Abstract Przedmiotem zgłoszenia :know-how" jest system CyberOko do diagnozy i terapii osób w śpiączce, opracowany w ramach projektu "Typoszereg interfejsów multimodalnych..."

Entry No. 577

Entry type conference paper

Authors K. Łopatka, A. Czyżewski

English title Recognition of hazardous acoustic events employing parallel processing on a supercomputing cluster

Polish title Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem równoległego przetwarzania na klastrze superkomputerowym

Conference 18th AES Convention

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 7.5.2015- 10.5.2015

Notes Streszczenie w czasopiśmie: J. Audio Eng. Society, vol. 63, no. 7/8, p. 624-625, 2015.

Abstract A method for automatic recognition of hazardous acoustic events operating on a super computing cluster is introduced. The methods employed for detecting and classifying the acoustic events are outlined. The evaluation of the recognition engine is provided: both on the training set and using real-life signals. The algorithms yield sufficient performance in practical conditions to be employed in security surveillance systems. The specialized framework for parallel processing of multimedia data streams KASKADA, in which the methods are implemented, is briefly introduced. An experiment intended to assess outcomes of parallel processing of audio data on a supercomputing cluster is featured. It is shown that by employing supercomputing services the time needed to analyze the data is greatly reduced.

Streszczenie Przedstawiono metodę automatycznego rozpoznawania niebezpiecznych zdarzeń dźwiękowych działającą na klastrze superkomputerowym. Wprowadzono algorytmy detekcji i klasyfikacji zdarzeń akustycznych. Przeprowadzono ocenę skuteczności działania metody z wykorzystaniem sygnałów ze zbioru treningowego oraz rzeczywistych dźwięków. Osiagnięto wystarczającą skuteczność w warunkach praktycznych, aby zastosować algorytmy do celów automatycznego nadzoru bezpieczeństwa. Wykorzystano specjalistyczną platformę przetwarzania danych multimedialnych na klastrze - KASKADA. Przeprowadzono eksperyment w celu sprawdzenia zysku z równoległego przetwarzania danych fonicznych. Osiągnięto znaczące skrócenie czasu potrzebnego na przeanalizowanie danych.

Entry No. 578

Entry type conference paper

Authors Ł. Kosikowski, A. Czyżewski, A. Senderski

English title Visual and auditory attention stimulator for assisting pedagogical therapy

Polish title Stymulator uwagi wzrokowej i słuchowej do wspomagania terapii pedagogicznej

Conference 2015 8th International Conference on Human System Interaction (HSI)

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 25.6.2015- 27.6.2015

Abstract Visual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application of the proposed method could improve reading skills in those children. The effectiveness of the method has been shown primarily using the D2 attention test designed by R. Brickenkamp in its Polish adaptation made by E. Dajek.

Entry No. 579

Entry type conference paper

Authors K. Łopatka, J. Kotus, P. Bratoszewski, P. Spaleniak, M. Szykulski, A. Czyżewski

English title Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor

Polish title Interfejs głosowy wykorzystujący filtrację przestrzenną sygnałów z wektorowego czujnika akustycznego

Conference Human System Interactions (HSI), 2015 8th International Conference on Human System Interaction

Preprint

Number

Volume

Pages 82 - 87

Conference site Warszawa, Polska

Conference date 25.6.2015- 27.6.2015

Abstract Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed by an automatic speech recognition (ASR) engine. It is shown that employing spatial filtration of signals from the AVS probe leads to a significant reduction in word error rate (WER) for a dictionary of 184 words.

Entry No. 580

Entry type journal paper

Authors P. Bratoszewski, J. Cichowski, A. Czyżewski

English title Measurements and Simulations of Engineered Ultrasound Loudspeakers

Polish title

Journal Computational Methods in Science and Technology

Volume 21

Number 3

Pages 151 - 160

Bibliographic No. 13

Abstract Simulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction of the human auditory system. Based on the measurements of the sound emitted from the two parametric arrays of ultrasonic transducers the directivity of the proposed system and the interaural crosstalk characteristics are determined. Application of the system concerns creating a personal audio space for users of mobile platforms, such as notebooks, and applying 3D audio algorithms without the need of using headphones.

Entry No. 581

Entry type conference paper

Authors P. Bratoszewski, M. Szykulski, A. Czyżewski

English title Examining Influence of Distance to Microphone on Accuracy of Speech Recognition

Polish title

Conference 138th AES Convention

Preprint

Number

Volume

Pages 1 - 4

Conference site Warszawa, Polska

Conference date 7.5.2015- 10.5.2015

Bibliographic No. 8

Abstract The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal the Mel- Frequency Cepstral Coefficients (MFCC) are used. The experiments are conducted employing the HTK engine (Hidden Markov Toolkit) for the Automatic Speech Recognition (ASR) task. The dictionary of 184 words was employed and WER (Word Error Rate), correctness and accuracy measures were calculated in order to verify and to compare obtained results of speech recognition.

Entry No. 582

Entry type conference paper

Authors K. Łopatka, J. Kotus, P. Suchomski, A. Czyżewski, B. Kostek

English title Personal adaptive tuning of mobile computer audio

Polish title Adaptacyjne strojenie dźwięku do osobistych preferencji użytkownika komputera przenośnego

Conference 139th AES Convention

Preprint 9455

Number

Volume

Pages

Conference site New York, USA

Conference date 29.10.2015- 1.11.2015

Notes Streszczenie w czasopiśmie: J. Audio Eng. Society, vol. 63, no. 12, p. 1091, 2015.

Abstract An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences. The details of the algorithm implemented in the C++ programming language are provided. The processing is performed utilizing custom Audio Processing Objects (APO) installed in Windows sound system. The sound enhancement bundle is managed with a User Interface enabling control over the sound system. The results of subjective evaluation of the introduced methods devices are discussed.

Entry No. 583

Entry type

Authors A. Czyżewski, P. Odya

English title Screen-pen, especially for the tablet

Polish title Pisak ekranowy, zwłaszcza do tabletu

Notes Podano datę udzielenia praw ochronnych

Streszczenie Wzór użytkowy dotyczy pisaka ekranowego zwłaszcza do tabletu. Pisak składa się z obsadki i końcówki pisaka. Na obwodzie obsadki osadzona jest przelotowa nasadka (w formie foremnego graniastosłupa o przekroju trójkąta). Na każdej ze ścianek nasadki istnieje możliwość umieszczenia czujników ścisku bocznego. Na końcówce pisaka jest możliwość osadzenia czujnika nacisku pionowego.

Entry No. 584

Entry type

Authors B. Kostek, A. Czyżewski, P. Hoffmann

English title

Polish title Sposób modyfikacji częstotliwościowej sygnału dźwiękowego i układ do modyfikacji częstotliwościowej sygnału dźwiękowego

Streszczenie Przedmiotem wynalazku jest sposób modyfikacji częstotliwościowej sygnału dźwiękowego i układ do modyfikacji częstotliwościowej sygnału dźwiękowego, przeznaczony zwłaszcza do wykorzystania w pokojach odsłuchowych oraz studiach nagraniowych.

Entry No. 585

Entry type conference paper

Authors P. Bratoszewski, A. Czyżewski

English title Face Profile View Retrieval Using Time of Flight Camera Image Analysis

Polish title

Conference Pattern Recognition and Machine Intelligence 2015

Preprint

Number

Volume 9124

Pages 159 - 168

Conference site Warszawa, Polska

Conference date 30.6.2015- 3.7.2015

Bibliographic No. 18

Notes Projekt IDENT, wydane w Lecture Notes in Computer Science, Sprigner, isbn: 978-3-319-19940-5, link: http://dx.doi.org/10.1007/978-3-319-19941-2_16

Abstract Method for profile view retrieving of the human face is presented. The depth data from the 3D camera is taken as an input. The preprocessing is, besides of standard filtration, extended by the process of filling of the holes which are present in depth data. The keypoints, defined as the nose tip and the chin are detected in user’s face and tracked. The Kalman filtering is applied to smooth the coordinates of those points which can vary with each frame because of the subject’s movement in front of the camera. Knowing the locations of keypoints and having the depth data the contour of the user’s face a profile retrieval is attempted. Further filtering and modifications are introduced to theprofile view in order to enhance its representation. Data processing enhancements allow emphasizing minima and maxima in the contour signals leading to discrimination of the face profiles and enable robust facial landmarks tracking.

Entry No. 586

Entry type journal paper

Authors M. Lech, B. Kostek, A. Czyżewski

English title Multimedia polysensory integration training system dedicated to children with educational difficulties

Polish title doi:10.1007/s10844-015-0390-3

Journal Journ. of Intelligent Information Systems

Volume

Number

Pages 1 - 22

Notes Download Your e-Offprint (PDF file) Your 'Online First' electronic offprint is now available! Download your PDF file using the following link: http://www.springer.com/home?SGWID=0-0-1003-0-0&aqId=

Abstract This paper aims at presenting a multimedia system providing polysensory training for pupils with educational difficulties. The particularly interesting aspect of the system lies in the sonic interaction with image projection in which sounds generated lead to stimulation of a particular part of the human brain. The system architecture, video processing methods, therapeutic exercises and guidelines for children’s interaction with the system are presented. Results of pupils’ improvements after several weeks of exercising with the system are provided. The outcome of this study suggests that learning and developing through the interactive method helped to improve children’s spatial orientation skills.

Entry No. 587

Entry type conference paper

Authors K. Lisowski, A. Czyżewski

English title Pawlak's flow graph extensions for video surveillance systems

Polish title

Conference 2015 Federated Conference on Computer Science and Information Systems

Preprint

Number

Volume 5

Pages 81 - 87

Conference site Łódź, Polska

Conference date 13.9.2015- 16.9.2015

Bibliographic No. 384

Abstract The idea of the Pawlak's flow graphs is applicable to many problems in various fields related to decision algorithms or data mining. The flow graphs can be used also in the video surveillance systems. Especially in distributed multi-camera systems which are problematic to be handled by human operators because of their limited perception. In such systems automated video analysis needs to be implemented. Important part of this analysis is tracking object within a single camera and between cameras' fields of vision. One of element needed to re-identify the single real object besides object's visual features and spatio-temporal dependencies between cameras is a behaviour model. The flow graph after some modifications, is a suitable data structure, which concept is based on the rough set theory, to contained as a behaviour model in it. Additionally, the flow graph can be used to predict the future movement of given object. In this paper a survey of authors research works related to employing flowgraphs in video surveillance systems is contained. The flow graph creation based on the paths of objects inside supervised area will presented. Moreover, a method of building a probability tree on the basis of the flow graph and a method for adaptating the flowgraph to the changing topology of the camera network are also discussed.

Entry No. 588

Entry type journal paper

Authors K. Lisowski, A. Czyżewski

English title Complexity analysis of the Pawlak’s flowgraph extension for re-identification in multi-camera surveillance system

Polish title

Journal Multimedia Tools and Applications

Volume

Number

Pages 1 - 17

Notes DOI: 10.1007/s11042-015-2652-z

Abstract The idea of Pawlak’s flowgraph turned out to be a useful and convenient container for a knowledge of objects’ behaviour and movements within the area observed with a multi-camera surveillance system. Utilization of the flowgraph for modelling behaviour admittedly requires certain extensions and enhancements, but it allows for combining many rules into a one data structure and for obtaining parameters describing how objects tend to move through the supervised area. The main aim of this article is presentation of the complexity analysis of proposed modification of flowgraphs. This analysis contains considerations of issues such as memory efficiency and computational complexity of operations on the flowgraph. The measures related to space and time efficiency were also included.

Entry No. 589

Entry type journal paper

Authors A. Czyżewski, P. Bratoszewski, A. Ciarkowski, J. Cichowski, K. Lisowski, M. Szczodrak

English title Massive surveillance data processing with supercomputing cluster

Polish title

Journal Elsevier Information Sciences

Volume

Number 296

Pages 322 - 344

Notes dod. autorzy: G. Szwoch, H. Krawczyk

Abstract In recent years, increasingly complex algorithms for automated analysis of surveillance data are being developed. The rapid growth in the number of monitoring installations and higher expectations of the quality parameters of the captured data result in an enormous computational cost of analyzing the massive volume of data. In this paper a new model of online processing of surveillance data streams is proposed, which assumes the use of services running within a supercomputer platform. The study presents some of the highly parallelized algorithms for detecting safety-threatening events in high-resolution-video streams, which were developed during the research, and discusses their performance on the supercomputer platform

Entry No. 590

Entry type journal paper

Authors K. Kopaczewski, M. Szczodrak, A. Czyżewski, H. Krawczyk

English title A method for counting people attending large public events

Polish title

Journal Multimedia Tools and Applications

Volume

Number 74

Pages 4289 - 4301

Abstract The algorithm for people counting in crowded scenes, based on the idea of virtual gate is presented. The concept and practical application of the developed algorithm in real conditions is depicted. The aim of the work is to estimate the number of people passing through entrances of a large sport hall. The most challenging problem was the unpredicted behavior of people while entering the building. Flow of people fluctuated between single persons and dense crowd. A series of experiments during sport and entertainment events was made. The results of the experiments show high accuracy of the algorithm.

Entry No. 591

Entry type conference paper

Authors J. Kotus, W. Moskwa, A. Czyżewski, B. Kostek

English title Development of the Sound Field 3D Intensity Probe Based on Miniature Microphones

Polish title Projekt sondy mikrofonowej do pomiarów zjawisk falowych w rzeczywistym polu akustycznym

Conference 139 Audio Eng. Soc. Convention

Preprint 221

Number

Volume

Pages 1 - 4

Conference site New York, USA

Conference date 29.10.2015- 1.11.2015

Notes projekt NCN_P

Abstract The engineered measuring probe uses three pairs of miniature microphones coupled. The signals from the microphones after an initial amplification are fed to differential circuits. Due to the required symmetry of the circuit it was necessary to select electronic components very carefully. Moreover, additional digital signal processing techniques were applied to avoid amplitude and phase mismatch. The view of the engineered probe is presented in photographs. Characteristics of the probe measured in an anechoic chamber are attached followed by a discussion of achieved results. The obtained results were compared with the reference USP probe, produced by the Microflown company.

Streszczenie W referacie opisano projekt i przygotowanie sondy pomiarowej do pomiaru natężenia w polu akustycznym. Przygotowana sonda składa się z trzech par mikrofonów. Sonda mikrofonowa przed zastosowaniem jej do pomiarów pola akustycznego wymaga przeprowadzenia kalibracji jej czujników. Metoda kalibracji polega na porównaniu charakterystyki odbiorczej sondy z mikrofonem wzorcowym, a następnie wprowadzeniu korekt w charakterystykach amplitudowych i fazowych czujników. Pomiary wykonane zostały w komorze bezechowej.

Entry No. 592

Entry type conference paper

Authors J. Kotus, M. Szczodrak, A. Czyżewski, B. Kostek

English title

Polish title Długoterminowa ocena poziomu hałasu w wybranych szkołach

Conference XII sesja metodyczna, Jak kreować bezpieczny świat ucznia?

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 4.3.2015- 4.3.2015

Notes Prezentacja wygłoszona podczas XII sesji metodycznej, zorganizowanej przez Pedagogiczną Bibliotekę Wojewódzką w Gdańsku pod hasłem: Jak kreować bezpieczny świat ucznia?

Streszczenie W trakcie wystąpienia zostaną przedstawione doświadczenia autorów związane z długoterminowymi pomiarami poziomu hałasu w wybranych szkołach. Pomiary wykonano za pomocą stacji pomiarowej zamontowanej na stałe w wybranych szkołach. Pomiary były prowadzone przez 24 godziny na dobę. Obejmowały wyznaczanie parametrów szerokopasmowych jak również rozkład energii akustycznej w pasmach o szerokości 1/3 oktawy. We wprowadzeniu przybliżono znaczenie poszczególnych parametrów akustycznych. W toku wystąpienia zaprezentowano wyniki pomiarów hałasu przed wykonaniem adaptacji akustycznej oraz po jej zastosowaniu. Dodatkowo omówiono problematykę wpływu hałasu na słuch. Zilustrowano to zagadnienie wynikami symulacji zmiany czasowego przesunięcia progu słyszenia w następstwie ekspozycji na hałas panujący podczas przerw międzylekcyjnych. W trakcie wystąpienia przedstawiono również możliwe do zastosowania metody ograniczenia szkodliwego oddziaływania hałasu.

Entry No. 593

Entry type

Authors A. Czyżewski, P. Bratoszewski, K. Łopatka, J. Kotus, G. Szwoch

English title

Polish title Sposób i układ do komunikacji człowieka z samoczynnymi maszynami, zwłaszcza w celu wydawania poleceń głosowych

Notes Zgłoszenie patentowe PL

Streszczenie Przedmiotem wynalazku jest sposób i układ do komunikacji człowieka z samoczynnymi maszynami, zwłaszcza w celu wydawania poleceń głosowych, a także zdalnego sterowania w takich urządzeniach jak telefon, tablet, komputer pokładowy w samochodzie.

Entry No. 594

Entry type

Authors A. Czyżewski, P. Bratoszewski, K. Łopatka, J. Kotus, G. Szwoch

English title

Polish title Sposób i układ do poprawy jakości sygnału mowy w systemach rozpoznawania mowy i komunikacyjnych

Notes Zgłoszenie patentowe PL

Streszczenie Przedmiotem wynalazku jest sposób i układ do poprawy jakości sygnału mowy, zwłaszcza w systemach automatycznego rozpoznawania mowy, przeznaczony zwłaszcza do konferencji multimedialnych.

Entry No. 595

Entry type

Authors A. Czyżewski, P. Bratoszewski, K. Łopatka, J. Kotus, G. Szwoch

English title

Polish title Sposób i układ do poprawy jakości sygnału mowy w systemach rozpoznawania mowy, zwłaszcza w komunikacji człowieka z samoczynnymi maszynami

Notes Zgłoszenie patentowe UE

Streszczenie Przedmiotem wynalazku jest sposób i układ do poprawy jakości sygnału mowy w systemach rozpoznawania mowy, zwłaszcza w komunikacji człowieka z samoczynnymi maszynami, który może być przeznaczony również do wykorzystania w zdalnych konferencjach multimedialnych.

Entry No. 596

Entry type conference paper

Authors P. Szczuko, J. Kotus, M. Szczodrak, B. Kostek, A. Czyżewski

English title

Polish title Analiza drgań struny gitarowej z użyciem szybkich kamer

Conference 16th International Symposium on Sound Engineering and Tonmeistering

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 8.10.2015- 10.10.2015

Streszczenie W referacie przedstawiono metodę analizy i wizualizacji ruchu struny gitarowej. Drgania struny zostały zarejestrowane za pomocą szybkich kamer. Układ optyczny zastosowany do rejestracji został dobrany w taki sposób, by móc obserwować drgania wzdłuż struny. Obrazy zarejestrowane za pomocą szybkich kamer zostały przeanalizowane za pomocą algorytmów cyfrowego przetwarzania sygnałów tak, aby z dużą dokładnością śledzić wychylenia i odkształcenia struny, poprawić rozdzielczość przestrzenną i przekształcić te dane na przebieg akustyczny. Sygnał akustyczny obliczony na podstawie analizy wizyjnej został porównany z sygnałem odniesienia, zarejestrowanym za pomocą mikrofonu pomiarowego. Przeprowadzone badania mają na celu poznanie zjawiska przekazywania energii drgającej struny do korpusu instrumentu.

Entry No. 597

Entry type journal paper

Authors K. Łopatka, J. Kotus, A. Czyżewski

English title Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations

Polish title

Journal MULTIMEDIA TOOLS AND APPLICATIONS

Volume

Number

Pages 1 - 33

Notes DOI: 10.1007/s11042-015-3105-4

Abstract Evaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the classifier are introduced. The sound source localization algorithm based on the analysis of multichannel signals from the Acoustic Vector Sensor is presented. The methods are evaluated in an experiment conducted in the anechoic chamber, in which the representative events are played together with noise of differing intensity. The results of detection, classification and localization accuracy with respect to the Signal to Noise Ratio are discussed. The results show that the recognition and localization accuracy are strongly dependent on the acoustic conditions.We also found that the engineered algorithms provide a sufficient robustness in moderately intense noise in order to be applied to practical audio-visual surveillance systems.

Entry No. 598

Entry type

Authors A. Czyżewski, M. Lech, P. Hoffmann

English title

Polish title Elektroniczny pisak, zwłaszcza do ekranów dotykowych pojemnościowych

Streszczenie Przedmiotem wzoru użytkowego jest elektroniczny pisak, zwłaszcza do ekranów dotykowych pojemnościowych służący do wodzenia po tych ekranach.

Entry No. 599

Entry type

Authors A. Czyżewski, M. Lech

English title

Polish title Sposób i układ do bezkontaktowego składania podpisu i weryfikacji jego autentyczności

Streszczenie Sposób i układ do bezkontaktowego składania podpisu i weryfikacji jego autentyczności

Entry No. 600

Entry type

Authors A. Czyżewski, M. Lech

English title

Polish title Sposób i układ do bezkontaktowego składania podpisu i weryfikacji jego autentyczności

Streszczenie Sposób i układ do bezkontaktowego składania podpisu i weryfikacji jego autentyczności

Entry No. 601

Entry type

Authors A. Czyżewski, M. Lech, P. Hoffmann, J. Cichowski

English title

Polish title Układ do składania podpisu odręcznego i weryfikacji jego autentyczność

Streszczenie Przedmiotem wynalazku jest sposób i układ do składania podpisu odręcznego i weryfikacji jego autentyczności.

Entry No. 602

Entry type

Authors B. Kostek, A. Czyżewski, P. Hoffmann

English title

Polish title Sposób modyfikacji częstotliwościowej sygnału dźwiękowego i układ do modyfikacji częstotliwościowej sygnału dźwiękowego

Streszczenie Przedmiotem wynalazku jest sposób modyfikacji częstotliwościowej sygnału dźwiękowego i układ do modyfikacji częstotliwościowej sygnału dźwiękowego, przeznaczony zwłaszcza do wykorzystania w pokojach odsłuchowych oraz studiach nagraniowych.

Entry No. 603

Entry type

Authors A. Czyżewski, J. Kotus, B. Kostek

English title

Polish title Natężeniowe sondy mikrofonowe oparte na miniaturowych mikrofonach analogowych lub cyfrowych klasy MEMS

Notes Rozwiązanie zgłoszone wewnątrz PG, numer zgłoszenia 52/15, W.125097 zgłoszony do UP RP w dn. 05.05.2016r.

Streszczenie Przedmiotem zgłoszenia jest opracowanie sondy umożliwiającej wyznaczenie przestrzennego rozkładu natężenia dźwięku. Opracowana sonda do pomiaru natężenia dźwięku składa się z części akwizycji sygnałów akustycznych, obejmującej przestrzenny układ mikrofonów (analogowych lub cyfrowych) oraz układu realizującego funkcje korekcji sygnałów akustycznych i formowania sygnału wyjściowego. Ortogonalnie umieszczone trzy pary mikrofonów tworzą układy za pomocą których otrzymywane są sygnału prędkości akustycznej odpowiednio dla kierunków dla osi OX, OY i OZ. Mikrofon umieszczony centralnie dostarcza sygnał ciśnienia akustycznego.

Entry No. 604

Entry type journal paper

Authors A. Czyżewski, P. Hoffmann, G. Bogdanis

English title

Polish title Automatyczna weryfikacja klienta bankowego w oparciu o multimodalne technologie biometryczne

Journal Elektronika: konstrukcje, technologie, zastosowania

Volume

Number

Pages 34 - 37

Streszczenie W referacie przedstawiono przegląd rozwiązań wykorzystywanych w bankach do weryfikacji tożsamości klientów. Ponadto zawarto opis metod biometrycznych aktualnie wykorzystywanych w placówkach bankowych wraz z odniesieniem do skuteczności i wygody korzystania z dostępnych rozwiązań. Zaproponowano rozszerzenie zakresu wykorzystania technologii biometrycznych, wskazując kierunek rozwoju systemów bezpieczeństwa dla poprawy dostępu do usług i zwiększenia bezpieczeństwa transakcji. Referat zawiera informacje będące podstawą zainicjowania projektu IDENT, realizaowanego w ramach Programu Badań Stosowanych NCBR, który ma na celu poprawę skuteczności weryfikacji klientów bankowych z użyciem technologii biometrycznych.

Entry No. 605

Entry type conference paper

Authors J. Kotus, P. Szczuko, M. Szczodrak, A. Czyżewski

English title Application of Fast Cameras to String Vibrations Recording

Polish title

Conference 19th IEEE Conference SPA 2015, Signal Processing: Algorithms, Architectures, Arrangements, and Applications

Preprint

Number

Volume

Pages 104 - 109

Conference site Poznań, Polska

Conference date 23.9.2015- 25.9.2015

Abstract A hardware and software solution for guitar string vibration measurement by fast cameras is described. Orthogonal setup for 3D image acquisition is proposed capable to capture several thousand image frames per second. Dedicated image processing algorithm was developed and described in the paper, aimed at tracking the movement of some selected points along the string. Fast and accurate tracking results provided a detailed information about vibrations, that was transformed into sound samples. Described sound processing methods were applied in order to enable a comparison of captured string vibrations with the sound recorded using a microphone. Analysis of obtained results, conclusions, and future work plans are included.

Entry No. 606

Entry type conference paper

Authors A. Czyżewski, A. Korzeniewski, P. Odya, P. Szczuko, B. Kostek

English title Survey on applications of multimedia technology to examine impact of roadside advertising on drivers

Polish title Badania na temat zastosowania technologii multimedialnych w celu zbadania wpływu reklamy przydrożnych na kierowców

Conference 8th International Conference: Multimedia Communications, Services and Security (MCSS)

Preprint

Number 566

Volume

Pages 141 - 155

Conference site Kraków, Polska

Conference date 24.11.2015- 24.11.2015

Notes Communications in Computer and Information Science 566,

Abstract The correct location of ads, both static and moving, in close proximity of the roadway is an issue of high significance in the context of road safety. This publication aims to provide support in solving these issues by presenting a range of options for the implementation of extensive, multi-faceted research, using modern technology to allow an objective assessment of the risks arising from the presence of advertising spots in the roadway. The chosen research tools include the drivers’ reaction tracking systems based on the use of advanced multimedia technology. These systems may be integrated in the actual vehicle, allowing for performing the tests in real-life conditions or as part of an extended driving simulator. In addition, a part of the proposed approaches to researching the problem is to check drivers’ opinion using questionnaires and to analyze the traffic accidents taking place in close proximity to road advertising.

Entry No. 607

Entry type conference paper

Authors P. Szczuko, B. Kostek, J. Kotus, A. Czyżewski

English title Rough Set Based Modeling and Visualization of the Acoustic Field Around the Human Head

Polish title

Conference PReMI 2015

Preprint

Number

Volume

Pages 418 - 427

Conference site Warszawa,

Conference date 30.6.2015- 3.7.2015

Notes M. Kryszkiewicz et al. (Eds.): PReMI 2015, LNCS 9124. DOI: 10.1007/978-3-319-19941-2_40

Abstract The presented research aims at modeling acoustical wave propagation phenomena by applying rough set theory in a novel manner. In a typical listening environment sound intensity is determined by numerous factors: a distance from a sound source, signal levels and frequencies, obstacles’ locations and sizes. Contrarily, a free-field is characterized by direct, unimpeded propagation of the acoustical waves. The proposed approach is focused on processing sound field measurements performed in an anechoic chamber, collected by a dedicated acoustic probe, comprising thousands of datapoints for six signal frequencies, with and without the presence of a dummy head in a free-field. The rough set theory is applied for modeling the influence of an obstacle that a dummy head creates in a free-field and the effects of the head acoustic interferences, shading and diffraction. A data pre-processing method is proposed, involving coordinate system transformation, data discretization, and classification. Four rule sets are acquired, and achieved accuracy and coverage are assessed. Final results allow simplification of the model and new method for visualization.

Entry No. 608

Entry type conference paper

Authors P. Bratoszewski, G. Szwoch, A. Czyżewski

English title Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition

Polish title

Conference Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Preprint

Number

Volume

Pages 287 - 291

Conference site Poznań, Polska

Conference date 21.9.2016- 23.9.2016

Abstract The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy speech is considered. The speech signal was recorded in a real-life scenario in an office-like environment with the babble noise generated by the loudspeakers at different levels. The proposed method of visual voice activity detection is aimed at enhancing the accuracy of ASR when the ratio of signal to noise is low. The numerals in English language are used as speech material and Word Error Rate (WER) is employed for the evaluation purposes.

Entry No. 609

Entry type journal paper

Authors M. Lech, P. Bratoszewski, A. Czyżewski

English title

Polish title System Weryfikacji Autentyczności Podpisu Odręcznego

Journal Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages 1145 - 1148

Notes doi:10.15199/59.2016.8-9.77

Abstract The static and dynamic signature verification method employing a biometric pen equipped with 2 accelerometers, 2 gyroscopes and 3 pressure sensors, used with resistive surface, connecting wireless with computerized devices, has been presented in the paper. In the introduction the network architecture of the multimodal biometric system has been described. The software of the signature verification system and the utilized methodology of verification have been presented along with the results of FRR and FAR measures assessment.

Streszczenie W referacie przedstawiono system statycznej i dynamicznej weryfikacji autentyczności podpisu odręcznego, składanego piórem biometrycznym, wyposażonym w 2 akcelerometry, 2 żyroskopy i 3 czujniki ścisku, na rezystancyjnej powierzchni dotykowej, łączącym się bezprzewodowo z urządzeniami komputerowymi. We wstępie przedstawiono architekturę sieciową wielomodalnego systemu biometrii. Przedstawiono warstwę sprzętową systemu weryfikacji autentyczności podpisu, przyjętą metodykę weryfikacji oraz wyniki oceny miar FRR i FAR.

Entry No. 610

Entry type

Authors M. Lech, B. Kostek, A. Czyżewski

English title Method and system for audio mixing

Polish title Układ do miksowania dźwięku

Notes nr zgł. 395458, data zgł. 2011-06-28, nr WUP 11/2016, data pub. WUP 2016-11-30, nr, BUP 01/2013, data pub. BUP 2013-01-07

Streszczenie Sposób miksowania dźwięku polegający na zmianie parametrów i sterowaniu parametrami sygnału zapisanego na poszczególnych ścieżkach dźwiękowych składających się na końcowy sygnał foniczny za pomocą aplikacji komputerowej udostępniającej operacje miksowania dźwięku charakteryzuje się tym, że określone operacje miksowania wybiera się i wykonuje bezkontaktowo za pomocą gestów obiektów sterujących (OS) odbieranych przez moduł akwizycji gestów (K), które po ich przetworzeniu metodami cyfrowymi w urządzeniu sterującym (U) współpracującym z komputerem (C) wykorzystuje się do generowania sygnałów elektronicznych sterujących wyborem operacji miksowania dla aplikacji komputerowej udostępniającej operacje miksowania dźwięku, przy czym użytkownik dowolnie określa i modyfikuje powiązania gestów z poszczególnymi operacjami miksowania. System miksowania dźwięku zawiera zespół głośników (G) współpracujących z komputerem (C) wyposażonym w aplikację komputerową (AM) udostępniającą operacje miksowania dźwięku i wyposażony jest w urządzenie sterujące (U) sprzężone z komputerem (C) i posiadające moduł akwizycji gestów (K) sprzężony bezkontaktowo z obiektami sterującymi (OS).

Entry No. 611

Entry type journal paper

Authors J. Kotus, A. Czyżewski, B. Kostek

English title 3D Acoustic Field Intensity Probe Design and Measurements

Polish title

Journal Archives of Acoustics

Volume 41

Number 4

Pages 701 - 711

Notes DOI: 10.1515/aoa-2016-0067

Abstract The aim of this paper is two-fold. First, some basic notions on acoustic field intensity and its measurement are shortly recalled. Then, the equipment and the measurement procedure used in the sound intensity in the performed research study are described. The second goal is to present details of the design of the engineered 3D intensity probe, as well as the algorithms developed and applied for that purpose. Results of the intensity probe measurements along with the calibration procedure are then contained and discussed. Comparison between the engineered and the reference commercial probe confirms that the designed construction is applicable to the sound field intensity measurements with a sufficient effectiveness.

Entry No. 612

Entry type journal paper

Authors G. Szwoch, D. Ellwart, A. Czyżewski

English title Parallel implementation of background subtraction algorithms for real-time video processing on a supercomputer platform

Polish title Zrównoleglona implementacja algorytmów odejmowania tła do celów przetwarzania obrazu w czasie rzeczywistym na superkomputerze

Journal Journal of Real-Time Image Processing

Volume 11

Number 1

Pages 111 - 125

Notes http://link.springer.com/article/10.1007%2Fs11554-012-0310-5

Abstract Results of evaluation of the background subtraction algorithms implemented on a supercomputer platform in a parallel manner are presented in the paper. The aim of the work is to chose an algorithm, a number of threads and a task scheduling method, that together provide satisfactory accuracy and efficiency of a real-time processing of high resolution camera images, maintaining the cost of resources usage at a reasonable level. Two selected algorithms: the Gaussian mixture models and the Codebook, are presented and their computational complexity is discussed. Various approaches to the parallel implementation, including assigning the image pixels to threads, the task scheduling methods and the thread management systems, are presented. The experiments were performed on a supercomputer cluster, using a single machine with twelve physical cores. The accuracy and performance of the implemented algorithms were evaluated for varying image resolutions and numbers of concurrent processing threads. On a basis of the evaluation results, an optimal configuration for the parallel implementation of the system for real-time video content analysis on a supercomputer platform was proposed.

Streszczenie W atykule przedstawiono wyniki badania algorytmów odejmowania tła zaimplementowanych w formie równoległej na platformie superkomputera. Celem pracy był wybór algorytmu, liczby wątków i metody zarządzania wątkami, które pozwolą uzsykać zadowalającą dokładność i wydajność przy przetwarzaniu obrazów z kamer wysokiej rozdzielczości, zachowując odpowiedni poziom zużycia zasobów. Wybrano dwa algorytmy: Gaussian Mixture Method oraz Codebook. Przedstawiono różne podejścia do równoległej implementacji algorytmu, dotyczące przypisywania pikseli do wątków i zarządzania wątkami. Eksperymenty przeprowadzono na klastrze komputerowym, z użyciem pojedynczego węzła o 12 rdzeniach obliczeniowych. Zbadano dokładność i wydajność algorytmów w zależności od rozdzielczości obrazu i liczby wątków obliczeniowych. Na podstawie wyników badań zaproponowano optymalną konfigurację do równoległego przetwarzania strumieni wizyjnych.

Entry No. 613

Entry type conference paper

Authors A. Czyżewski, G. Bogdanis, B. Kostek, M. Lech, P. Bratoszewski, P. Hoffmann

English title Automatic verification of banking clients based on multimodal biometrics

Polish title

Conference Biometrics 2016

Preprint

Number

Volume

Pages

Conference site Londyn , Wielka Brytania

Conference date 18.10.2016- 20.10.2016

Abstract Within the scope of the IDENT project – a Multimodal biometric system for bank client identity verification developed within the NCBiR Applied Research Program, a multimodal technology is currently in development, improving biometric systems used to date through integration and intelligent application of innovative – new and already known biometric methods in form of an intelligent, multimodal bank stand. In the multimodal stand, the authors decided to use the following modalities: - dynamic signature based on multidimensional analysis, applied with a wireless pen with sensors, developed and built in the Department of Multimedia Systems - face contour registered with the use of laser photogrammetry - audio verification of identity with the use of free speech - video verification of identity - verification with the use of blood vessel distribution, based on hand analysis in infrared light A central element of the stand is a Biometric Hub, which functions as a modality integrator. Signals registered with the use of biometric sensors are parameterized by the Biometric Hub and then transferred to a Biometric Server, where the assessment and comparison of samples are performed. Control of the stand takes place via the Biometric Server, which supervises the process of biometric sampling. All activities performed in the stand are simultaneously performed on the Biometric Hub and on the consultant’s computer, and the results of operation of the stand are visible on the screens of the consultant and of the customer. Activities performed in the stand are registered in a biometric database. The basic functionality consists of enabling the collection of biometric samples, which in the next step serve for identity verification. Biometric data collected while the stand is operating are saved on a Biometric Server. Through global storage of biometric patterns, it is possible to confirm the identity of a customer in any bank unit taking part in the development of an Experimental, Distributed Biometric Lab. After the process of collecting biometric patterns is completed, the customer and the consultant are asked to share their opinions in a survey integrated with the software of the stand. According to the project execution plan, it is planned to build and launch 100 stands described above in 60 units of the PKO BP Bank. As all those stands are going to communicate with a central server, this will lead to the creation of a certain Distributed Bank Biometrics Laboratory.

Entry No. 614

Entry type conference paper

Authors B. Kostek, P. Szczuko, J. Kotus, M. Szczodrak, A. Czyżewski

English title Vibration analysis of acoustic guitar string employing high-speed video cameras

Polish title

Conference Spring (171st) 2016 Meeting of the Acoustical Society of America

Preprint

Number

Volume

Pages 1 - 1

Conference site Salt Lake City, USA

Conference date 23.5.2016- 27.5.2016

Abstract A method of analysis and visualization of displacements of an acoustic guitar string is presented. Vibrations of the strings are recorded using high-speed cameras. The optical system used for the recording is applied in order to make it possible to observe the vibrations along the string. Images recorded with high-speed cameras are analyzed using digital signal processing algorithms in order to track the shape of deflections and displacement of strings, with a high spatial resolution, and to convert the acquired video data into an acoustic signal. The acoustic signal derived from the visual analysis is then compared with a reference signal which was recorded simultaneously using a measurement microphone. The research experiments are aimed principally at studying the phenomena related to energy transfer of vibrating strings to the body of the instrument. [This research study was supported by the grant, funded by the Polish National Science Centre, decision number DEC-2012/05/B/ST7/02151.]

Entry No. 615

Entry type journal paper

Authors P. Szczuko, G. Szwoch, M. Szczodrak, J. Kotus, A. Czyżewski

English title

Polish title ZASTOSOWANIA DRONÓW I SENSORÓW WIZYJNYCH I AKUSTYCZNYCH DO ZDALNEJ DETEKCJI I LOKALIZACJI OBIEKTÓW I ZDARZEŃ

Journal Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages 1133 - 1137

Notes Liczba punktów 9

Streszczenie W referacie przedstawiono wybrane sensory akustyczne i wizyjne i propozycje ich zastosowania do wykrywania i lokalizacji obiektów i zdarzeń z pokładu drona. Opisano pokrótce zastosowane algorytmy analizy strumieni, przedstawiono wyniki badań stworzonych prototypów i metod, zaimplementowanych na wydajnych układach GPU.

Entry No. 616

Entry type conference paper

Authors A. Czyżewski , K. Marciniuk, B. Kostek

English title Dynamic Road Traffic Density Estimation Employing Noise Mapping with the Use of Grid Supercomputing

Polish title Dynamiczne szacowanie natężenia ruchu drogowego z wykorzystaniem odwzorowania hałasu z pomocą superkomputera

Conference Acoustical Society of America 2016 Meeting

Preprint 2478337

Number

Volume

Pages

Conference site Salt Lake City, USA

Conference date 23.5.2016- 27.7.2016

Abstract A noise prediction model of a large city agglomeration was elaborated in order to allow for a dynamic road traffic density estimation in vehicular networks. The implemented application adopts the model fed with traffic noise data based on frequently refreshed LDEN levels. Calculations were made with the use of the numerical model developed for his purpose and then implemented on the PL-Grid supercomputing infrastructure. Data obtained through supercomputing and through the use of a standard noise map computing software were collated with measured levels acquired from the acoustic city monitoring system and then analyzed. The comparison performed afterwards shows a relatively good accuracy of the developed model. The numerical model of traffic noise and its main sources are briefly characterized. A full day dynamic noise map can be browsed as a set of 24 noise maps, one for each hour of the day which in turn allows for vehicular traffic density estimation based exclusively on acoustical data.

Streszczenie Model przewidywania hałasu dużego miasta aglomeracji został opracowany, aby umożliwić dynamiczne szacowanie natężenia ruchu drogowego. Wdrożona aplikacja przyjmuje model zasilany danymi dotyczącymi hałasu ruchu w oparciu o często odświeżane poziomy LDEN. Obliczenia wykonano z wykorzystaniem numerycznego modelu, a następnie wdrożony na infrastrukturze PL-Grid Model numeryczny hałasu komunikacyjnego i jego główne źródła zostały krótko scharakteryzowane. Mapę całodniową dynamiczną hałasu można przeglądać jako zestaw 24 map hałasu, dla każdej pory dnia, co z kolei pozwala na oszacowanie gęstości ruchu kołowego opiera się wyłącznie na danych akustycznych.

Entry No. 617

Entry type journal paper

Authors A. Czyżewski, A. Górski, A. Korzeniewski, P. Odya, P. Szczuko

English title

Polish title Zastosowania elektroencefalograficznych interfejsów mózg-komputer do diagnozy i stymulacji osób po urazach mózgu

Journal Elektronika : konstrukcje, technologie, zastosowania

Volume 9

Number

Pages 75 - 78

Notes http://dx.doi.org/10.15199/13.2016.9.18 Liczba punktów 8

Streszczenie Przeanalizowano i opisano nowe rozwiązania kasków EEG, dostępne w laboratorium Katedry Systemów Multimedialnych Politechniki Gdańskiej. Opisano koncepcje prowadzenia z ich użyciem testów diagnostycznych i sesji terapeutycznych, polegających na stymulacji polisensorycznej, z podkreśleniem roli tego typu metod w ocenie świadomości stanu pacjentów pourazowych i usprawniania komunikacji osobami po urazach mózgu. Przedstawiono także sposób połączenia modalności elektroencefalograficznej ze śledzeniem wzroku w oparciu o skonstruowane urządzenie do tego celu. Omówiono ponadto zagadnienie badania słuchu u osób niekomunikujących się z wykorzystaniem nowoczesnego urządzenia diagnostycznego.

Entry No. 618

Entry type journal paper

Authors M. Szczodrak, A. Kurowski, J. Kotus, A. Czyżewski, B. Kostek

English title A system for acoustic field measurement employing cartesian robot

Polish title

Journal METROLOGY AND MEASUREMENT SYSTEMS

Volume 23

Number 3

Pages 333 - 343

Abstract A system setup for measurements of acoustic field, together with the results of 3D visualisations of acoustic energy flow are presented in the paper. Spatial sampling of the field is performed by a Cartesian robot. Automatization of the measurement process is achieved with the use of a specialized control system. The method is based on measuring the sound pressure (scalar) and particle velocity (vector) quantities. The aim of the system is to collect data with a high precision and repeatability. The system is employed for measurements of acoustic energy flow in the proximity of an artificial head in an anechoic chamber. In the measurement setup an algorithm for generation of the probe movement path is included. The algorithm finds the optimum path of the robot movement, taking into account a given 3D object shape present in the measurement space. The results are presented for two cases, first without any obstacle and the other - with an artificial head in the sound field.

Entry No. 619

Entry type journal paper

Authors M. Szykulski, P. Bratoszewski, J. Kotus, A. Czyżewski, B. Kostek

English title

Polish title KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY

Journal Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages

Notes doi:10.15199/59.2016.8-9.74

Abstract An audiovisual corpus containing 31 hours of English speech recordings is presented. The new corpus was created in order to assist the development of audiovisual speech recognition systems (AVSR). The corpus includes high-framerate stereoscopic video streams and audio recorded by both microphone array and a microphone built in a mobile computer. Owing to the inclusion of recordings made in noisy conditions, the corpus can be used to assess the robustness of speech recognition systems in the presence of acoustic noise.

Streszczenie W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus może być wykorzystany do badania wpływu zakłóceń na skuteczność rozpoznawania mowy.

Entry No. 620

Entry type

Authors A. Czyżewski, B. Kostek, G. Bogdanis, W. Sudomir

English title

Polish title Sposób i układ do weryfikacji tożsamości użytkownika w systemach informatycznych, zwłaszcza w systemach bankowych

Entry No. 621

Entry type

Authors A. Czyżewski, M. Lech, P. Hoffmann

English title

Polish title Sposób i układ identyfikacji i weryfikacji tożsamości, zwłaszcza klienta bankowego

Entry No. 622

Entry type conference paper

Authors P. Odya, B. Kostek, J. Kotus, M. Szczodrak, A. Czyżewski

English title Sound Field Analysis Around An Organ Pipe

Polish title

Conference DAGA 2016

Preprint

Number

Volume

Pages 275 - 278

Conference site Aachen, Niemcy

Conference date 14.3.2016- 17.3.2016

Notes poster

Abstract The aim of this paper is to examine sound field around an organ pipe measured under free-field conditions. Measurement methodology along with the equipment employed in this research study are described. Sound intensity is determined by utilizing an acoustic vector sensor. Issues related to the organ pipe activation providing constant air flow to secure long-term steady state responses of generated acoustic signals are presented. For this purpose an external compressor is applied. Sound energy flow is measured in a defined grid of points. The Cartesian robot is used for a precise positioning of the acoustic probe. Results of measurements of acoustic energy flow in an anechoic chamber are shown along with the analysis and visualization sound intensity distribution of radiated acoustic energy around the organ pipe.

Entry No. 623

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski

English title Face detection algorithms evaluation for the bank client verification

Polish title

Conference Signal Processing, Algorithms, Architectures, Arrangements, and Applications

Preprint

Number

Volume

Pages 186 - 190

Conference site Poznań, Polska

Conference date 21.9.2016- 23.9.2016

Notes ISBN 978-83-62065-24-0

Abstract Results of investigation of face detection algorithms in the video sequences are presented in the paper. The recordings were made with a miniature industrial USB camera in real conditions met in three bank operating rooms. The aim of the experiments was to check the practical usability of the face detection method in the biometric bank client verification system. The main assumption was to provide as much as possible user interaction with the application. Applied algorithms for face detection were described and results of the efficiency of face detection in the real bank environment conditions were presented and discussed.

Entry No. 624

Entry type conference paper

Authors J. Kotus, A. Czyżewski, B. Kostek

English title 3D acoustic field intensity probe design and measurements

Polish title

Conference XVII International Conference Noise Control 2016

Preprint

Number

Volume

Pages

Conference site Gniew, Polska

Conference date 22.5.2016- 25.5.2016

Abstract The aim of this paper is two-fold. First of all, some basic notions on acoustic field intensity and its measurement are shortly recalled. Then, the equipment and the measurement procedure used in the sound intensity in the performed research study are described. The second goal is to present details of the design of the engineered 3D intensity probe, as well as algorithms developed and applied. Results of the intensity probe measurements along with the calibration procedure are then contained and discussed. Comparison between the engineered and the reference commercial probe confirm that the designed construction is applicable to sound field intensity measurements with a sufficient effectiveness.

Entry No. 625

Entry type conference paper

Authors A. Czyżewski , B. Kostek

English title A Study in Experimental Methods of Human-Computer Communication for Patients After Severe Brain Injuries

Polish title Studium metod komunikacji człowiek-komputer dla pacjentów po ciężkich urazach mózgu

Conference 4th International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO) 2016

Preprint

Number

Volume

Pages

Conference site Granada, Spain

Conference date 20.4.2016- 22.4.2016

Notes Rozdział w książce Bioinformatics and Biomedical Engineering Volume 9656 of the series Lecture Notes in Computer Science pp. 689-703

Abstract Experimental research in the domain of multimedia technology applied to medical practice is discussed, employing a prototype of integrated multimodal system to assist diagnosis and polysensory stimulation of patients after severe brain injury. The system being developed includes among others: eye gaze tracker, and EEG monitoring of non-communicating patients after severe brain injuries. The proposed solutions are used for collecting and analyzing patients’ responses and interactions induced by the multimodal stimulation, resulting in assessing the influence of stimuli on increase of patient’s cognitive and communicative functions with the use of intelligent data analysis methods.

Entry No. 626

Entry type conference paper

Authors A. Czyżewski, B. Kostek, M. Szykulski, T.E Ciszewski

English title Building Knowledge for the Purpose of Lip Speech Identification

Polish title Przygotowanie bazy w celu indentyfikacji wizemów

Conference MISSI 2016, 10th International Conference on Multimedia & Network Information Systems

Preprint

Number 10

Volume 506

Pages 3 - 14

Conference site Wrocław, Polska

Conference date 14.9.2016- 16.9.2016

Notes link: http://link.springer.com/chapter/10.1007/978-3-319-43982-2_1 Advances in Intelligent Systems and Computing, Springer Verlag,

Abstract Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of three cameras. Video signals, synchronized with audio, were registered and then analyzed. Encountered problems related to video registration and results achieved are discussed. Słowa kluczowe: audio-visual speech recognition · AVSR · thermovision · stereovision · Time-of-Flight · phonetic transcription

Streszczenie W publikacji przedstawiono kolejne kroki związane z przygotowaniem bazy wideo-fonicznych nagrań mowy. W pierwszej kolejności zaproponowano materiał językowy do nagrań w kontekście analizy leksykalnej występowania fonemów w j. angielskim. Nagrano pięć osób z wykorzystaniem trzech kamer i jednoczesną rejestracją dźwięku. Przedstawiono i przedyskutowano przykłady analiz nagranego materiału wideo-fonicznego. indeksowanie: Web of Science, IEEE Xplore, Google Scholar, Springerlink, ISI Proceedings, SCOPUS

Entry No. 627

Entry type journal paper

Authors A. Czyżewski, B. Kostek, A. Górski

English title Method and Application of Auditory-Visual Attention Training

Polish title Metoda audiowizualnego treningu uwagi i jej zastosowanie

Journal J. Acoust. Soc. Amer.

Volume 139

Number 4

Pages 1993 - 1993

Notes http://dx.doi.org/10.1121/1.4949836

Abstract The main idea underlying the proposed attention training is to perform stimulation of the hearing and sight senses employing digital signal processing algorithms controlled by electroencephalography signals. The auditory and visual stimuli are designated to force the perception through hearing and sight senses by the appropriate hemisphere. The applied speech modification uses a non-uniform real-time speech stretching algorithm. The video content retrieval showing the speaker's face is slowed down, accordingly. Research experiments employed subjects with central auditory and visual processing disorders revealing severe communication difficulties. The effectiveness of the proposed method has been shown using formal attention focus tests. It was demonstrated that the proposed method of attention training helps improve speech understanding and reading skills in examined subjects. [Research sponsored by the Polish National Science Centre, Dec. No. DEC-2014/15/B/ST7/04724.]

Entry No. 628

Entry type journal paper

Authors A. Czyżewski , K. Marciniuk, B. Kostek

English title Dynamic Road Traffic Density Estimation Employing Noise Mapping with the Use of Grid Supercomputing

Polish title Szacowanie natężenia ruchu drogowego z zastosowaniem tworzenia map hałasu przy zastosowaniu gridu superkomputerowego

Journal J. Acoust. Soc. Amer.

Volume 139

Number 4

Pages 2006 - 2006

Notes http://dx.doi.org/10.1121/1.4949894

Abstract A noise prediction model of a large city agglomeration was elaborated in order to allow for a dynamic road traffic density estimation in vehicular networks. The implemented application adopts the model fed with traffic noise data based on frequently refreshed LDEN levels. Calculations were made with the use of the numerical model developed for his purpose and then implemented on the PL-Grid supercomputing infrastructure. Data obtained through supercomputing and through the use of a standard noise map computing software were collated with measured levels acquired from the acoustic city monitoring system and then analyzed. The comparison performed afterwards shows a relatively good accuracy of the developed model. The numerical model of traffic noise and its main sources are briefly characterized. A full day dynamic noise map can be browsed as a set of 24 noise maps, one for each hour of the day which in turn allows for vehicular traffic density estimation based exclusively on acoustical data.

Entry No. 629

Entry type conference paper

Authors M. Lech, A. Czyżewski

English title A handwritten signature verification method employing a tablet

Polish title

Conference Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Preprint

Number

Volume

Pages 45 - 50

Conference site Poznań, Polska

Conference date 21.9.2016- 23.9.2016

Abstract A signature verification system based on static features and time-domain functions of signals obtained using a tablet has been presented in the paper. The signature verification method, based mainly on dynamic time warping coupled with some signature image features, has been described. The FRR measures reflecting the method's efficiency have been evaluated for verification attempts performed directly after obtaining model signatures and for reiterated attempts made after two days. The FAR measures have been assessed both: for simple and for skilled forgeries. Obtained results are presented and discussed in the paper.

Entry No. 630

Entry type journal paper

Authors A. Kurowski, J. Kotus, B. Kostek, A. Czyżewski

English title NUMERICAL SIMULATION OF THE SOUND INTENSITY DISTRIBUTION IN THE PROXIMITY OF THE ACOUSTIC DIFFUSER

Polish title Pomiar rozkładu wektora natężenia dźwięku w pobliżu dyfuzora akustycznego weryfikowany symulacją komputerową

Journal Zeszyty naukowe WE PG

Volume 51

Number

Pages 97 - 101

Streszczenie Projektowanie adaptacji akustycznej pomieszczeń jest złożonym procesem, który wymaga możliwości przewidywania wpływu zastosowanych ustrojów akustycznych na sposób propagacji fal akustycznym w pomieszczeniu. Przykładem ustroju stosowanego do korekcji akustyki pomieszczeń jest dyfuzor akustyczny. Niniejsza praca opisuje proces pomiaru oraz numerycznej symulacji rozkładu wektora natężenia akustycznego w pobliżu dyfuzora. Analiza tego rozkładu pozwala zaobserwować zjawisko transportu energii akustycznej w pobliżu badanego obiektu. Wyniki badań przedstawiono w formie graficznej. Przygotowane zostały także mapy różnic pomiędzy rozkładem wektora natężenia dźwięku zmierzonego bez i z dyfuzorem. Jako obiekt referencyjny wykorzystana została płaska powierzchnia odbijająca. Dzięki takiemu podejściu możliwe było zaobserwowanie i opisanie wpływu zjawiska rozproszenia dźwięku przez dyfuzor na rozkład otaczającego pola akustycznego

Entry No. 631

Entry type conference paper

Authors A. Kurowski, J. Kotus, B. Kostek, A. Czyżewski

English title Numerical modeling of sound intensity distributions around acoustic transducer

Polish title

Conference 140th Audio Eng. Society Convention

Preprint 9525

Number

Volume

Pages 1 - 10

Conference site Paryż, Francja

Conference date 4.6.2016- 7.6.2016

Abstract The aim of this research study is to measure, simulate and compare sound intensity distribution generated by the acoustic transducers of the loudspeaker. The comparison of the gathered data allows for validating the numerical model of the acoustic radiation. An accurate model of a sound source is necessary in mathematical modeling of the sound field distribution near the scattering obstacles. An example of such obstacle is a human head. Preparation of a robust mathematical model of the sound field generated by a loudspeaker is one of the important factors in simulation of sound waves scattering by the human head. The numerical model is developed for the purpose of this kind of research.

Entry No. 632

Entry type journal paper

Authors T. Ciszewski, A. Czyżewski, B. Kostek

English title Methodology and technology for the polymodal allophonic speech transcription

Polish title Metodyka i technologia wielomodalnej transkrypcji mowy

Journal Proc. of Meetings on Acoustics, Acoustical Society of America

Volume 26

Number

Pages 1 - 15

Notes doi: 10.1121/2.0000300

Abstract A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for the same phoneme produced in different phonetic environments and the objective signal parameters (both audio and video) is carried out. The method is sensitive to minute allophonic detail as well as to accentual differences. It is shown that by using the analysis of video signals together with the acoustic signal, speech transcription can be performed more accurately and robustly than by using the acoustic modality alone. In particular, various features extracted from the visual signal are tested for their abilities to encode allophonic variations in pronunciation. New methods for modeling the accentual and allophonic variation of speech are developed.

Streszczenie Publikacja opisuje sposób automatycznej transkrypcji audiowizualnej mowy w oparciu o: akustyczną i wizualną reprezentację mowy. Założono łączenie modalności audio i wizualnej, które zapewniają efekt synergii w zakresie dokładności rozpoznawania mowy. W szczególności nacisk położono na wydobywania różnnych cech z zapisu wizyjnego, które zostały przetestowane pod kątem ich przydatności do reprezentowania różnic w wymowie na poziomie alofonicznym. N

Entry No. 633

Entry type conference paper

Authors B. Kostek, P. Szczuko, J. Kotus, M. Szczodrak, A. Czyżewski

English title Guitar String Sound Retrieved from Moving Pixels

Polish title

Conference Spring (171st) 2016 Meeting of the Acoustical Society of America

Preprint

Number

Volume

Pages 1 - 8

Conference site Salt Lake City, USA

Conference date 23.5.2016- 27.3.2016

Abstract The aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing the vibration image into to the sound recording. The sound reconstructed in this way is compared with the sound recorded synchronously with the reference measurement microphone.

Entry No. 634

Entry type journal paper

Authors K. Łopatka, A. Czyżewski, B. Kostek

English title Improving listeners' experience for movie playback through enhancing dialogue clarity in soundtracks

Polish title Poprawa jakości odbioru multimediów poprzez zwiększenie wyrazistości dialogów w ścieżce dźwiękowej

Journal Digital Signal Processing

Volume 48

Number

Pages 40 - 49

Notes doi:10.1016/j.dsp.2015.08.015, opublikowano online 8.09.2015 http://www.sciencedirect.com/science/article/pii/S105120041500264X

Abstract his paper presents a method for improving users' quality of experience through processing of movie soundtracks. The dialogue clarity enhancement algorithms were introduced for detecting dialogue in movie soundtrack mixes and then for amplifying the dialogue components. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity between left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve an increased dialogue intelligibility. Techniques for reduction of artifacts in the processed signal are also introduced. It is done through smoothing in the time domain and in the frequency domain, applied to reduce unpleasant artifacts. The results of objective and subjective tests are provided, which prove that an increased dialogue intelligibility is achieved with the aid of the proposed algorithm. The algorithm is particularly applicable in mobile devices while listening in changing conditions and in the presence of noise.

Streszczenie W artykule przedstawiono sposób poprawy percypowanej jakości multimediów (Quality of Experience) poprzez przetwarzanie ścieżek dźwiękowych filmów. Wprowadzono algorytm poprawy wyrazistości dialogów filmowych. Sygnały z kanałów przednich (lewy, prawy, środkowy) są analizowane w dziedzinie częstotliwości. Wybrane składowe częstotliwościowe, które wykazują dużą dysparycję pomiędzy kanałem środkowym a kanałami bocznymi, są zidentyfikowane jako związane z dialogiem i wzmocnione w celu zwiększenia wyrazistości mowy. Opisano techniki redukcji artefaktów w przetworzonym sygnale. Polegają one na wygładzaniu w dziedzinie czasu i częstotliwości. Przedstawiono wyniki testów obiektywnych oraz subiektywnych. które potwierdzają, że dzięki zastosowaniu zaproponowanego algorytmu osiąga się zwiększoną wyrazistość mowy. Algorytm znajduje zastosowanie zwłaszcza przy odsłuchu w zmiennych warunkach w obecności zakłóceń zewnętrznych.

Entry No. 635

Entry type conference paper

Authors A. Czyżewski, P. Hoffmann, P. Bratoszewski

English title

Polish title Prezentacja stanowiska biometrycznego

Conference Bio Day w Banku PKO S.A.

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 12.10.2016- 12.10.2016

Entry No. 636

Entry type conference paper

Authors A. Czyżewski, A. Korzeniewski, P. Odya, P. Szczuko, R. Kupniewski

English title

Polish title Multimodalne stanowisko do polisensorycznej diagnozy i stymulacji osób z zaburzeniami komunikacji

Conference XV Krajowa Konferencja Elektroniki

Preprint

Number

Volume

Pages 333 - 338

Conference site Darłówko Wschodnie, Polska

Conference date 6.6.2016- 10.6.2016

Streszczenie Celem komunikatu plakatowego jest prezentacja eksperymentalnego zintegrowanego systemu multimodalnego, przeznaczonego do wykorzystania w diagnozowaniu i stymulacji polisensorycznej osób niekomunikujących się, w szczególności osób z ciężkimi urazami mózgu. Interfejs użytkownika wykorzystuje śledzenie wzroku i monitorowanie elektroencefalograficzne. Ponadto elementami tego stanowiska są: emiter bodźców zapachowych oraz urządzenie do obiektywnych badań słuchu, a także system projekcji autostereoskopowej przeznaczony do stymulowania osób poprzez ich zanurzanie w otoczeniu wirtualnym. Zaprezentowane zostaną rozwiązania multimodalnej akwizycji i analizy pozyskiwanych danych do celu oceny wpływu stymulacji na poprawę funkcji kognitywnych i zdolności komunikacyjnych. Ocena zdolności komunikacyjnych z zastosowaniem opracowanego systemu jest prowadzona w wiodących w skali kraju placówkach opieki długoterminowej nad pacjentami po urazach mózgowych, w którymi nawiązano współpracę.

Entry No. 637

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Technologia dynamicznego podpisu biometrycznego

Conference VI Konferencja i Narodowy Test Interoperacyjności Podpisu Elektronicznego CommonSign 2016

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 26.10.2016- 27.10.2016

Notes referat udokumentowany wydrukiem wygłoszonej prezentacji

Abstract Przedstawiono opracowane wyposażenie Multimodalnego stanowiska bankowego, udostępniającego możliwość identyfikacji biometrycznej. Omówiono integrację wielu metod biometrycznej weryfikacji tożsamości w zakresie sprzętowym i programowym. Uzasadniono możliwość zmniejszenia ryzyka błędnej weryfikacji tożsamości przy użyciu technologii dynamicznego podpisu biometrycznego. Zilustrowano budowę eksperymentalnego stanowiska bankowego na Politechnice Gdańskiej

Entry No. 638

Entry type journal paper

Authors A. Czyżewski, A. Ciarkowski, B. Kostek, J. Kotus, K. Łopatka, P. Suchomski

English title Adaptive Personal Tuning of Sound in Mobile Computers

Polish title

Journal J. Audio Eng. Soc.

Volume 64

Number 6

Pages 405 - 428

Notes https://doi.org/10.17743/jaes.2016.0014

Abstract An integrated methodology for enhancing audio quality in mobile computers is presented, whose key features are adapting the acoustic track to changing acoustic conditions of the environment, and matching audio characteristics to the users’ individual preferences. Signal processing algorithms included linearizing the frequency response, enhancing dialogue intelligibility, and adjusted dynamics to the users’ hearing characteristics. Algorithms were tested on two different computers (an All-in-one and a laptop), both of which were located in quiet office-like conditions but in the presence of strong noise. In general, test results showed that audio processing methods were useful tools for the improvement of the sound quality in compact computers. For example, although most the listeners were untrained, the processing for speech clarity in noise (dialogue enhancement and dynamics processing) yielded the highest scores. The majority of the results indicated that listeners perceive the processing as being desirable and useful.

Entry No. 639

Entry type conference paper

Authors B. Kostek, M. Plewa, A. Czyżewski

English title Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

Polish title Materiał foniczno-wizyjny nagrany w różnych warunkach na potrzeby automatycznej transkrypcji mowy

Conference 141 Audio Eng. Soc. Convention

Preprint 9648

Number

Volume

Pages 1 - 9

Conference site Los Angeles, USA

Conference date 29.9.2016- 2.10.2016

Notes wersja elektroniczna, http://www.aes.org/e-lib/browse.cfm?elib=18452

Abstract Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but the need is also encountered in applications such as automatic recognition of speech in movies for non-native speakers, for impaired users, and as a support for multimedia systems. This work contains an attempt to analyze speech recorded in various conditions. First, audio and video recordings of specially constructed list of words in English were prepared in order to perform dedicated audio and video analyses in the future stages of the research aiming at audio-visual speech recognition systems (AVSR) development. A dataset of audio-video recordings was prepared and examples of analyses are described in the paper.

Streszczenie W referacie przedstawiono materiał foniczno-wizyjny nagrany w różnych warunkach akustycznych. Zawarto w nim przykłady analiz oraz przedstawiono dyskusję wyników w kontekście możliwości automatycznej transkrypcji mowy.

Entry No. 640

Entry type journal paper

Authors A. Czyżewski, T. Ciszewski, B. Kostek

English title Methodology and technology for the polymodal allophonic speech transcription

Polish title Metodologia i technologia wielomodalnej transkrypcji mowy

Journal J. Acoust. Soc. Amer.

Volume 139

Number 4

Pages 2017 - 2017

Notes http://dx.doi.org/10.1121/1.4949947

Abstract A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory setting of speech organs for the same phoneme produced in different phonetic environments and the objective signal parameters (both audio and video) is carried out. The method is sensitive to minute allophonic detail as well as to accentual differences. It is shown that by using the analysis of video signals together with the acoustic signal, speech transcription can be performed more accurately and robustly than by using the acoustic modality alone. In particular, various features extracted from the visual signal are tested for their abilities to encode allophonic variations in pronunciation. New methods for modeling the accentual and allophonic variation of speech are developed.

Entry No. 641

Entry type conference paper

Authors A. Czyżewski

English title Verification of banking clients based on multimodal biometrics

Polish title Weryfikacja klientów bankowych oparta na wielomodalnej biometrii

Conference Impact'16 fintech/insurtech

Preprint

Number

Volume

Pages

Conference site Wrocław , Polska

Conference date 7.12.2016- 8.12.2016

Notes keynote speech (referat plenarny) - wystąpienie udokumentowane wydrukiem prezentacji

Abstract Methods for verifying of banking clients based on multimodal biometrics was presented in keynote speech. The prevalence of various biometric techniques has been discussed. A block diagram of the developed multimodal biometric system was shown and its individual components described, individually, such as: the biometric pen, Hand Vain biometrics, face biometrics, voice biometrics.

Streszczenie Metody weryfikowania klientów bankowych w oparciu o multimodalnych biometrycznych zostały przedstawione w wystąpieniu plenarnym. Stopień upowszechnienia różnych technik biometrycznych został omówiony. Schemat blokowy opracowanego multimodalnego systemu biometrycznej weryfikacji klientów bankowych został pokazany i jego poszczególne elementy indywidualnie opisane, takie jak: pióro biometryczne, skaner naczyń biometrycznych dłoni, biometria twarzy, biometria głosowa.

Entry No. 642

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Nowe możliwości technologicznego wsparcia osób z zaburzeniami przetwarzania słuchowego - doświadczeń Politechniki Gdańskiej

Conference SLI, APD Nowe spojrzenie na specyficzne trudności w funkcjonowaniu dzieci z zaburzeniami językowymi i słuchowymi

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 21.10.2016- 21.10.2016

Notes wystąpienie plenarne potwierdzone wydrukiem prezentacji i certyfikatem uczestnictwa

Streszczenie W referacie plenarnym przedstawiono nowe możliwości technologicznego wsparcia osób z zaburzeniami przetwarzania słuchowego, wynikające z prac badawczych i aplikacyjnych, prowadzonych w Politechnice Gdańskiej. Dotyczą one przede wszystkim metod synchronizacja półkul mózgowych, stymulacja uwagi słuchowej i wzrokowej, analizy sygnałów elektroencefalograficznych (EEG), nowych zastosowań systemów śledzenia wzroku i in.

Entry No. 643

Entry type conference paper

Authors A. Czyżewski

English title Methods of human-computer communication to diagnose and to stimulate patients with severe traumatic brain injuries

Polish title Metody komunikacji człowiek-komputer do diagnozy i stymulowania pacjentów z ciężkimi uszkodzeniami mózgu

Conference Investing in Medical Innovations Congress & Fair

Preprint

Number

Volume

Pages

Conference site Katowice, Polska

Conference date 18.2.2016- 19.2.2016

Notes referat plenarny udokumentowany w programie konferencji i w wydruku prezentacji

Abstract The increasing number of people who undergo brain damage is one of the most characteristic features of our contemporary society. Causes of brain damage may lead to a coma which may result in fast recovery or in a vegetative state (VS), a locked-in syndrome (LIS) or in brain death. There is a widespread erroneous belief that patients who recover from a coma without regaining consciousness, remain in a permanent vegetative state. In the vast majority of residential medical care facilities such patients are not diagnosed to a sufficiently detailed extent. European studies show that the ratio of incorrect diagnostic assessment of patients' consciousness amounts to as much as 40%. Therefore, within the project we develop an application of integrated technologies: eye-gaze tracking (EGT), Auditory Brainstem Response (ABR), electroencephalography (EEG), electromyography (EMG), virtual reality and scent emitting for the polysensory stimulation, which together with results of subjective assessment in GCS (Glasgow Coma Scale), can support diagnosis and can positively influence cognitive and communicative functions of patients with brain injuries.

Streszczenie Zwiększająca się liczba osób, które doznały uszkodzenia mózgu jest jedną z najbardziej charakterystycznych cech współczesnego społeczeństwa. Uszkodzenia mózgu mogą spowodować śpiączkę, która może ustąpić (wybudzenie)lub może przejść w utrwalony stan wegetatywny (VS), zespół zamknięcia (SIT) lub śmierć mózgowa. Panuje powszechne przekonanie, że pacjenci wybudzeni ze śpiączki nie odzyskawszy przytomności, pozostają w trwałym stanie wegetatywnym. W ogromnej większości placówek opieki medycznej tacy pacjenci nie są zdiagnozowani w wystarczająco szczegółowym stopniu. Badania europejskie wskazują, że wskaźnik błędnej oceny diagnostycznej świadomości pacjenta wynosi nawet 40%. Dlatego w ramach projektu opracowujemy aplikacje zintegrowanych technologii: śledzenia wzroku (EGT), słuchowe potencjały wywołane pnia mózgu(ABR), elektroencefalografia (EEG), elektromiografii (EMG), wirtualna rzeczywistość i zapach emitowany w ramach stymulacji polisensorycznej, które razem z wynikami subiektywnej ocenie GCS (Skala Glasgow), mogą zwiększać trafność diagnozy i przez to pozytywnie wpłynąć na funkcje poznawcze i komunikacyjne pacjentów z urazami mózgu.

Entry No. 644

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Interfejsy multimodalne - nowe technologie usprawniające komunikację człowiek-komputer

Conference "Mózgi Mózgom", Tydzień Mózgu

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 14.3.2016- 20.3.2016

Notes wystąpienie plenarne potwierdzone wydrukiem prezentacji

Streszczenie Korzystanie z komputerów stało się obecnie codzienną i powszechną praktyką. Istnieją jednak nowe sposoby komunikowania się z komputerami, inne niż za pomocą tradycyjnej klawiatury i myszki, które mają zastosowanie w diagnostyce i w usprawnianiu zmysłów słuchu, wzroku a nawet powonienia. Przy ich użyciu możliwe staje się zwiększanie efektywności wspomaganego komputerowo nauczania, także poprzez poprawę skuteczności diagnostyki deficytów utrudniających uczenie się oraz dzięki odpowiednim treningom prowadzonym z użyciem komputera. Obok często występujących problemów z niedowzrocznością i niedosłuchem oraz niepłynnością mowy, u stosunkowo licznej populacji występują specyficzne problemy spowodowane zaburzeniami przetwarzania bodźców na poziomie centralnym, tzn. w ośrodkach zlokalizowanych w korze mózgowej. Problemy te przejawiają się odpowiednio w postaci: syndromu leniwego oka (amblyopia), ograniczeniu stereopsji (widzenia przestrzennego), w słabym rozumieniu szybkiej mowy, w nasilonym jąkaniu się, w dysleksji, dysgrafii i in. Wyniki nowych badań eksperymentalnych dowodzą, że z pomocą odpowiednio dostosowanych urządzeń komputerowych można także diagnozować choroby neurodegeneracyjne, np. parkinsonizm, a nawet badać stopień świadomości osób znajdujących się w śpiączce lub w stanie wegetatywnym. W referacie zostały zaprezentowane opracowane w Politechnice Gdańskiej przykładowe rozwiązania, które umożliwiają badanie i usprawnianie zmysłów komunikacji oraz diagnozowanie chorób neurodegeneracyjnych, a także nawiązywanie kontaktu z otoczeniem przez osoby, które utraciły ten kontakt z powodu urazów mózgu.

Entry No. 645

Entry type conference paper

Authors A. Czyżewski

English title

Polish title CyberOko

Conference Ocena i wspomaganie rozwoju psychoruchowego dziecka – nowe trendy w diagnostyce i terapii

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 16.5.2016- 16.5.2016

Notes Referat z okazji otrzymania głównej nagrody w konkursie "Innowacje na rzecz poprawy jakości życia i zdrowia osób z dysfunkcjami", udokumentowany wydrukiem prezentacji i kopią otrzymanego dyplomu

Streszczenie Prowadzone badania dotyczą dziedziny technologii multimedialnych, w zastosowaniu do szczególnie trudnych przypadków neurologicznych, jakimi są ciężkie urazy mózgowe, z wykorzystaniem opracowanego systemu multimodalnego, przeznaczonego do wykorzystania w diagnozowaniu i stymulacji polisensorycznej, z interfejsem użytkownika opartym na śledzeniu wzroku i monitorowaniu elektroencefalograficznym pacjentów niekomunikujących się. Rozwiązania multimodalnej akwizycji i analizy pozyskiwanych danych wykorzystywane są do pozyskiwania informacji o reakcjach na bodźce, w celu oceny wpływu stymulacji na poprawę funkcji kognitywnych i zdolności komunikacyjnych pacjenta. Ocena jest prowadzona z wykorzystaniem metod inteligentnej analizy zawartości bazy danych pozyskanych w placówkach opieki długoterminowej nad pacjentami po urazach mózgowych.

Entry No. 646

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Metody komunikacji człowiek-komputer do diagnozowania i stymulacji pacjentów z ciężkimi urazami mózgu

Conference Konferencja Stowarzyszenia Absolwentów w Gdańskim Uniwersytecie Medycznym

Preprint

Number

Volume

Pages

Conference site Gdansk, Polska

Conference date 5.2.2016- 5.2.2016

Notes referat udokumentowany wydrukiem wygłoszonej prezentacji i ogłoszeniem o wystąpieniu

Abstract W referacie wygłoszonym w Gdańskim Uniwersytecie Medycznym przedstawiono tezę, że dotychczasowe metody diagnostyki i terapii są długotrwałe, w wielu przypadkach uciążliwe dla pacjenta, wymagają kosztownych urządzeń, ich skuteczność bywa często niezbyt wysoka. Starano się wykazać, że dzięki zastosowaniu nowoczesnych technologii multimedialnych można usprawnić i ułatwić proces edukacji, terapii lub rehabilitacji, zaś opracowane rozwiązania mogą także znaleźć zastosowanie w wielu dziedzinach medycznej diagnostyki i terapii.

Entry No. 647

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Wybrane zastosowania technologii multimedialnej do celów wykrywania obiektów i zdarzeń

Conference

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 12.5.2016- 12.5.2016

Notes referat plenarny udokumentowany w programie konferencji i w wydruku prezentacji

Abstract W referacie wygłoszonym na zaproszenie Polskiej Platformy Bezpieczeństwa Wewnętrznego na Stadionie Narodowym w dniu 12 maja 2016 r. przedstawiono następujące zagadnienia: detekcja multimodalna obiektów i zdarzeń, pasywny radar akustyczny, komunikacja pojazdów z drogowymi systemami monitoringu, inteligentne znaki drogowe, detekcja zdarzeń za pomocą dronów UAV.

Entry No. 648

Entry type conference paper

Authors M. Szczodrak, A. Czyżewski

English title

Polish title Zastosowanie zintegrowanego czujnika położenia i przyspieszeń do szacowania stanu nawierzchni dróg

Conference X Polski Kongres ITS

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 16.5.2017- 17.5.2017

Streszczenie Ocena stanu nawierzchni dróg stanowi istotne zadanie organów nadzorujących drogi, a informacja jest niezwykle ważna dla kierowców, zarówno z punktu widzenia komfortu jak i bezpieczeństwa. W referacie przedstawiono zastosowanie miniaturowego czujnika położenia i przyśpieszeń, zainstalowanego w miniaturowym przewoźnym urządzeniu pomiarowym, do oszacowania stanu nawierzchni drogi. Urządzenie jest przeznaczone do zamontowania w pojeździe, zawiera wbudowany akcelerometr, żyroskop i odbiornik GPS. Małe rozmiary sprawiają, że takie urządzenia mogą być użytkowane w sposób wygodny przez wielu uczestników ruchu drogowego, co umożliwia dużą skalę organizowanych pomiarów i częstą aktualizację danych, stanowiąc potencjalnie wartość dodaną w odniesieniu do stosowania kosztownego sprzętu profesjonalnego w niezbyt częstych odstępach czasowych przez zarządców dróg. Przeprowadzono eksperymenty, podczas których zebrano dane pomiarowe z kilkudziesięciu przejazdów ustaloną trasą o zróżnicowanym stanie nawierzchni. Podjęto próby klasyfikacji jakości nawierzchni na podstawie stworzonego algorytmu. Ponadto, integracja położenia i wielowymiarowo monitorowanych przyśpieszeń z odczytami danych z czujników pojazdu pozwala pozyskać wiele dodatkowych informacji o sposobie poruszania się pojazdów po drogach. W referacie opisano budowę i oprogramowanie zintegrowanego czujnika i przedstawiono stan zaawansowania prac związanych z jego badaniami w Politechnice Gdańskiej.

Entry No. 649

Entry type book

Authors P. Szczuko, M. Lech, A. Czyżewski

English title Comparison of Classification Methods for EEG Signals of Real and Imaginary Motion

Polish title

Editor U. Stańczyk et al. (eds.), Advances in Feature Selection for Data and Pattern Recognition, Intelligent Systems Reference Library 138,

Pages 227 - 239

Abstract The classification of EEG signals provides an important element of brain- computer interface (BCI) applications, underlying an efficient interaction between a human and a computer application. The BCI applications can be especially useful for people with disabilities. Numerous experiments aim at recognition of motion intent of left or right hand being useful for locked-in-state or paralyzed subjects in controlling computer applications. The chapter presents an experimental study of several methods for real motion and motion intent classification (rest/upper/lower limbs motion, and rest/left/right hand motion). First, our approach to EEG recordings segmentation and feature extraction is presented. Then, 5 classifiers (Naïve Bayes, Decision Trees, Random Forest, Nearest-Neighbors NNge, Rough Set classifier) are trained and tested using examples from an open database. Feature subsets are se- lected for consecutive classification experiments, reducing the number of required EEG electrodes. Methods comparison and obtained results are presented, and a study of features feeding the classifiers is provided. Differences among participating sub- jects and accuracies for real and imaginary motion are discussed. It is shown that though classification accuracy varies from person to person, it could exceed 80% for some classifiers.

Entry No. 650

Entry type conference paper

Authors B. Kostek, M. Piotrowska, A. Czyżewski

English title Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers

Polish title

Conference 143rd Audio Engineering Society Convention

Preprint 9847

Number

Volume

Pages

Conference site New York, USA

Conference date 18.10.2017- 21.10.2017

Abstract The purpose of this study was to apply Self-Organizing Maps to differentiate between the correct and the incorrect allophone pronunciations and to compare the results with subjective evaluation. Recordings of a list of target words, containing selected allophones of English plosive consonants, the velar nasal and the lateral consonant, were made twice. First, the target words were read from the list by nine non-native speakers and then repeated after a phonology expert’s recorded sample. Afterwards, two recorded signal sets were segmented into allophones and parameterized. For that purpose, a set of descriptors, commonly employed in music information retrieval, was utilized to determine whether they are effective in allophone analysis. The phonology expert’s task was to evaluate the pronunciation accuracy of each uttered allophone. Extracted feature vectors along with the assigned ratings were applied to SOMs.

Entry No. 651

Entry type conference paper

Authors B. Kostek, M. Piotrowska, T. Ciszewski, A. Czyżewski

English title Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

Polish title

Conference 142nd Audio Engineering Society Convention

Preprint 9716

Number

Volume

Pages

Conference site

Conference date 20.5.2017- 23.5.2017

Abstract An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted in partitioning by editing two recorded sets of words into allophones, then signals were analyzed and subsequently audio excerpts were parametrized. The comparison of two sets of allophones was reinforced by the phonology expert’s assessment of produced speech sounds. Analyses presented in this paper allowed for determining a set of parameters describing an allophone pronunciation.

Entry No. 652

Entry type conference paper

Authors A. Czyżewski, B. Kostek, A. Kurowski, P. Szczuko, M. Lech, P. Odya, A. Kwiatkowska

English title Multimodal Approach For Polysensory Stimulation And Diagnosis Of Subjects With Severe Communication Disorders

Polish title

Conference HCist - International Conference on Health and Social Care Information Systems and Technologies

Preprint

Number

Volume

Pages

Conference site Barcelona, Hiszpania

Conference date 8.11.2017- 10.11.2017

Abstract An experimental multimodal system, designed for polysensory diagnosis and stimulation of non-communicative subjects, with severe brain injuries is presented. The user interface uses an eye-tracking device and EEG monitoring of the subject. The system is evaluated on 9 patients, data analysis methods are described, and experiments of correlating Glasgow Coma Scale with extracted features describing subjects performance in therapeutic exercises exploiting EEG and eyetracker are presented. Performance metrics are proposed, and k-means clusters used to define concepts for mental states related to EEG and eyetracking activity. Finally, it is shown that the strongest correlations are between the number of detected mental states and GCSe score, and between maximal length of mental state and GCSm. Weaker correlations are reported as well. Moreover an approach to classification of real and imaginary motion of limbs is presented and discussed. Classifiers based on SVM, Artificial Neural Networks, and Rough Sets were trained and accuracy reaching 91% for the real, and up to 100% for the imaginary type of motion was observed. Assessments of communication skills and therapy is possible with the system, already employed in long-term care facility.

Entry No. 653

Entry type journal paper

Authors A. Czyżewski, B. Kostek, A. Kurowski, P. Szczuko, M. Lech, P. Odya, A. Kwiatkowska

English title Multimodal approach for polysensory stimulation and diagnosis of subjects with severe communication disorders

Polish title

Journal Procedia Computer Science

Volume 121

Number

Pages 238 - 243

Abstract An experimental multimodal system, designed for polysensory diagnosis and stimulation of non-communicative subjects, with severe brain injuries is presented. The user interface uses an eye-tracking device and EEG monitoring of the subject. The system is evaluated on 9 patients, data analysis methods are described, and experiments of correlating Glasgow Coma Scale with extracted features describing subjects performance in therapeutic exercises exploiting EEG and eyetracker are presented. Performance metrics are proposed, and k-means clusters used to define concepts for mental states related to EEG and eyetracking activity. Finally, it is shown that the strongest correlations are between the number of detected mental states and GCSe score, and between maximal length of mental state and GCSm. Weaker correlations are reported as well. Moreover an approach to classification of real and imaginary motion of limbs is presented and discussed. Classifiers based on SVM, Artificial Neural Networks, and Rough Sets were trained and accuracy reaching 91% for the real, and up to 100% for the imaginary type of motion was observed. Assessments of communication skills and therapy is possible with the system, already employed in long-term care facility.

Entry No. 654

Entry type conference paper

Authors A. Czyżewski, P. Bratoszewski, P. Hoffmann, M. Lech, M. Szczodrak

English title The project IDENT: Multimodal biometric system for bank client identity verification

Polish title

Conference 9th International Conference, MCSS 2017

Preprint

Number

Volume 785

Pages 16 - 32

Conference site Kraków, Polska

Conference date 16.11.2017- 17.11.2017

Bibliographic No. 13

Notes Communications in Computer and Information Science

Streszczenie Biometric identity verification methods are implemented inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank cli-ent voice recognition and hand vein distribution verification. A secure communication system based on an intra-bank client-server architecture was designed for this purpose. Hitherto achieved progress within the project is reported in this paper with a focus on the design and implementation of the developed biometric authentication system. Implemented multimodal biometric client identity verification methods are briefly outlined and results of hitherto obtained biometric sam-ple acquisition and analysis are reported.

Entry No. 655

Entry type conference paper

Authors K. Lisowski, A. Czyżewski

English title A Method of Object Re-identiciation Applicable to Multicamera Surveillance Systems

Polish title Metoda reindentyfikacji obiektów do zastosowań w wielokamerowym monitoringu

Conference International Conference on Multimedia Communications, Services and Security, IEEE MCSS

Preprint

Number

Volume

Pages 98 - 109

Conference site Kraków, Polska

Conference date 16.10.2017- 17.10.2017

Notes Dziech A., Czyżewski A. (eds) Multimedia Communications, Services and Security. MCSS 2017. Communications in Computer and Information Science, vol 785. Springer, Cham

Abstract The paper addresses some challenges pertaining to the methods for tracking of objects in multi-camera systems. The tracking methods related to a single Field of Vision (FOV) are quite different from inter-camera tracking, especially in case of non-overlapping FOVs. In this case, the processing is directed to determine the probability of a particular object’s identity seen in a pair of cameras in the presence of places non-observed by any camera, thus an object can disappear in one observed region and then re-appear in another one. A methodology for evaluation of the introduced re-identification method is presented in the paper. Problems related to the preparation of the ground-truth database and to the impact of a single-camera tracking on the efficiency of the re-identification algorithm are discussed.

Streszczenie Artykuł porusza niektóre problemy związane z metodami do śledzenia obiektów w systemach wielokamerowych. Metody śledzenia związane z jednym polem widzenia (FOV) są zupełnie inne niż śledzenie między kamerami, szczególnie w przypadku nienakładających się na siebie pól widzenia. W W tym przypadku przetwarzanie jest ukierunkowane na określenie prawdopodobieństwa określona tożsamość obiektu widoczna w parze kamer w obecności Miejsca nieobserwowane przez żadną kamerę, przez co obiekt może zniknąć w jednym obserwowanym regionie, a następnie pojawi się ponownie w innym. Metodologia oceny wprowadzonej metody ponownej identyfikacji jest przedstawiona w referacie. Omówiono problemy związane z przygotowaniem bazy danych typu ground-truth oraz wpływ śledzenia pojedynczej kamery na wydajność algorytmu ponownej identyfikacji.

Entry No. 656

Entry type

Authors J. Kotus, G. Szwoch, A. Czyżewski

English title Sound intensity probe with correction system and calibration system and the method of correction and calibration of this probe

Polish title Sonda natężeniowa wraz z układem korekcji i układem kalibracji oraz sposób korekcji i kalibracji tej sondy natężeniowej

Notes Zgłoszenie patentowe PL

Streszczenie Wynalazek dotyczy sondy natężeniowej wyposażonej w układ korekcji (amplitudy i fazy) oraz sposobu i układu do kalibracji sondy natężeniowej. Proces kalibracji przeprowadza się zgodnie ze sposobem i w układzie ujawnionym w niniejszym zgłoszeniu. Celem kalibracji jest określenie parametrów niezbędnych do prawidłowego działania układu korekcji, będącego integralną częścią sondy natężeniowej.

Entry No. 657

Entry type conference paper

Authors M. Lech, A. Czyżewski

English title Handwritten signature verification system employing wireless biometric pen

Polish title

Conference 23rd International Symposium on Methodologies for Intelligent Systems (ISMIS)

Preprint

Number

Volume

Pages 1 - 10

Conference site Warszawa, Polska

Conference date 26.6.2017- 29.6.2017

Abstract The handwritten signature verification system being a part of the developed multimodal biometric banking stand is presented. The hardware component of the solution is described with a focus on the signature acquisition and on verification procedures. The signature is acquired employing an accelerometer and a gyroscope built-in the biometric pen plus pressure sensors for the assessment of the proper pen grip and then the signature verification method based on adapted Dynamic Time Warping (DTW) method is applied. Hitherto achieved FRR and FAR measures for the verification based exlusively on the biometric pen sensors and for the comparison on the parameters retrieved from the signature scanning pad are compared.

Entry No. 658

Entry type journal paper

Authors M. Lech, A. Czyżewski

English title Modified dynamic time warping method applied to handwritten signature authenticity verification

Polish title Zmodyfikowana metoda dynamicznego marszczenia czasu w zastosowaniu do weryfikacji autentyczności podpisu odręcznego

Journal Elektronika : konstrukcje, technologie, zastosowania

Volume 58

Number 4

Pages 18 - 25

Bibliographic No. 8

Abstract A signature verification system based on static features and time-domain functions of signals obtained using a tablet has been presented in the paper. The signature verification method, based mainly on dynamic time warping coupled with some signature image features, has been described. The FRR measures reflecting the method’s efficiency have been evaluated for verification attempts performed directly after obtaining model signatures and for delayed attempts made after two days. The FAR measures have been assessed both: for simple and for skilled forgeries. The dynamic time warping-based verification has been also examined after applying it to the signals obtained using the developed biometric pen. Obtained results are presented and discussed in the paper.

Streszczenie W artykule przedstawiono system weryfikacji autentyczności podpisu oparty na cechach statycznych i funkcjach czasowych sygnałów pozyskanych na tablecie. Opisano metodę weryfikacji autentyczności podpisu, opartą głównie na metodzie dynamicznego marszczenia czasu i wykorzystującą cechy statyczne wizerunku podpisu. Wyznaczono miary FRR określające skuteczność metody, dla prób weryfikacji następujących bezpośrednio po pozyskaniu modeli podpisów i dla prób odłożonych w czasie o 2 dni. Miary FAR wyznaczono zarówno dla fałszerstw prostych, jak i szkolonych. Weryfikacja oparta na metodzie dynamicznego marszczenia czasu została również zbadana po zastosowaniu jej do sygnałów pozyskanych z wykorzystaniem opracowanego pióra biometrycznego. W artykule przedstawiono i poddano dyskusji otrzymane wyniki.

Entry No. 659

Entry type journal paper

Authors A. Czyżewski, M. Piotrowska, B. Kostek

English title Analysis of allophones based on audio signal recordings and parameterization

Polish title Ocena wymowy wybranych alofonów na podstawie sparametryzowanych reprezentacji sygnału mowy

Journal J. Acoust. Soc. Amer.

Volume 141

Number

Pages 3521

Abstract The aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping in time demands a precise determination of allophonic pronunciation in the context of phonemic transcription. The presented study is focused on creation of speech recordings that may serve for the analysis of allophone variation. Two sets of recordings are prepared. The first one consists of words read by the non-native speakers. Tempo of reading is forced by a teleprompter. In the second case, every word is played back from the recordings of the phonology expert and then the speaker repeats a particular word. The last stage is the assessment of recordings by the same expert. Scores assigned by the expert are included as a reference for signal analysis and parametrization. [Research sponsored by the Polish National Science Centre, Dec. No.2015/17/B/ST6/01874.]

Streszczenie Celem pracy było przygotowanie bazy nagrań wybranych słów, które posłużyły następnie do edycji wybranych alofonów. Nagrania dotyczyły mówców anglojęzycznych, jak również mówców o różnym stopniu znajomości j. angielskiego (w tym drugim przypadku nagrania powtórzone dwukrotnie, za drugim razem z odsłuchem mówcy anglojęzycznego). Kolejnym punktem był opis parametryczny wybranych alofonoów w kontekście automatycznej oceny wymowy.

Entry No. 660

Entry type book

Authors J. Kotus, A. Czyżewski

English title

Polish title Modelowanie procesów słuchowych w celu oceny ryzyka uszkodzeń słuchu

Editor

Pages 269 - 301

Notes Rozdział 12. Rozdział w monografii: Modelowanie procesów fizjologicznych i patologicznych. Redakcja: K. Cieślicki, T. Lipniacki, J. Waniewski (w druku)

Streszczenie W rozdziale przedstawiono sposób modelowania fizjologicznych procesów zachodzących w systemie słuchowym. Opisano szczegółowo autorską metodę oceny szkodliwego oddziaływania hałasu na słuch. W rozdziale opisano również kocepcję psychoakustycznej dozymetrii hałasowej. Bazuje ona na wyznaczaniu wartości czasowego przesunięcia progu słyszenia w pasmach krytycznych słuchu. Uwzględnia również efekty wywołane przez refleks akustyczny. W pracy przedstawiono również wyniki badań, które umożliły dokonania praktycznej weryfikacji zaproponowanej metody.

Entry No. 661

Entry type journal paper

Authors K. Marciniuk, B. Kostek, A. Czyżewski

English title Classifying type of vehicles on the basis of data extracted from audio signal characteristics

Polish title

Journal J. Acoust. Soc. Amer.

Volume 141

Number 5

Pages 3883 - 3883

Abstract The aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox, designed for music parameter extraction, is used to obtain a vector of parameters. Correlation analyses are performed to check whether extracted parameters enable to separate selected types of vehicle-associated noise, e.g.: car, truck and motorcycle. Behrens-Fisher statistics is used to find the most suitable parameters that may be contained in the optimized feature vector. The last step is to build a decision system that allows for the automatic classification of a vehicle type. The results of automatic classification of prepared vehicle-noise related samples are shown and discussed. Research was supported by the Polish National Centre for Research and Development within the grant No. OT4- 4B/AGH-PG-WSTKT.

Entry No. 662

Entry type conference paper

Authors A. Czyżewski, B. Kostek

English title Assessment of hearing in coma patients employing auditory brainstem response, electroencephalography, and eye-gaze-tracking

Polish title Ocena słuchu u pacjentów w śpiączce z wykorzystaniem odpowiedzi słuchowej pnia mózgu, elektroencefalografii i śledzenia wzroku

Conference Acoustics'17 (Acoustical Society of America)

Preprint

Number

Volume

Pages

Conference site Boston, USA

Conference date 25.6.2017- 29.6.2017

Notes The Journal of the Acoustical Society of America 141, 3903 (2017)

Abstract The results of the study conducted by Tagliaferri et al. in 12 European countries indicate that the ratio of registered brain injury cases in Europe amounts to 150-300 per 100 000 people, with the European mean value of 235 cases per 100 000 people. The project presented in the paper assumes development of a combined metric of patients’ state remaining in coma by intelligent fusion of GCS (subjective Glasgow Coma Scale or its derivatives) with objective data acquired using ABR (Auditory Brainstem Response), EEG (electroencephalography), and EGT (Eye-Gaze-Tracking). Variety of coma patients from cooperating medical care centers were examined. Senses examination involved the assessment of their function by a medical specialist, with special attention paid to hearing tests results obtained with an ABR measuring device. The assessment included speech-based cognitive functions such as comprehension, phonematic hearing and auditory gnosia. Achieved results are discussed in the paper showing that most patients remaining in coma after a severe brain injury have preserved the ability to receive sound stimuli. [The project was partially funded by the Polish National Science Centre on the basis of the decision No. DEC-2014/15/B/ST7/04724.]

Streszczenie Wyniki badania przeprowadzonego przez Tagliaferri et al. w 12 krajach europejskich wskazuje, że stosunek zarejestrowanych przypadków obrażeń mózgu w Europie wynosi 150-300 na 100 000 osób, przy średniej europejskiej wynoszącej 235 przypadków na 100 000 osób. Przedstawiony projekt zakłada opracowanie połączonego wskaźnika stanu pacjentów pozostających w stanie śpiączki poprzez inteligentną syntezę GCS (subiektywna skala Glasgow Coma Scale lub jej pochodne) z obiektywnymi danymi uzyskanymi przy użyciu ABR (Auditory Brainstem Response), EEG (elektroencefalografia), i EGT (Eye-Gaze-Tracking). Badano różnorodność pacjentów w śpiączce ze współpracujących ośrodków opieki medycznej. Badanie zmysłów obejmowało ocenę ich funkcji przez lekarza specjalistę, ze szczególnym uwzględnieniem wyników badań słuchu uzyskanych za pomocą urządzenia pomiarowego ABR. Ocena obejmowała funkcje kognitywne oparte na mowie, takie jak rozumienie, słuch fonemiczny i gnoza słuchowa. Osiągnięte wyniki zostały omówione w artykule pokazującym, że większość pacjentów pozostających w śpiączce po ciężkim uszkodzeniu mózgu zachowało zdolność do odbierania bodźców dźwiękowych. [Projekt był częściowo finansowany przez Narodowe Centrum Nauki na podstawie decyzji nr DEC-2014/15 / B / ST7 / 04724.]

Entry No. 663

Entry type book

Authors Ł. Kosikowski, A. Czyżewski, A. Senderski

English title Visual and Auditory Attention Stimulator for Assisting Pedagogical Therapy

Polish title Stymulator uwagi wizualnej i słuchowej wspomagający terapię pedagogiczną

Editor Advances in Intelligent Systems and Computing. Springer. Human-Computer Systems Interaction Backgrounds and Applications 4, Z. S. Hippe, J. L. Kulikowski, T. Mroczek Editors

Pages 31 - 41

Notes vol. 551

Abstract Visual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application of the proposed method could improve reading skills in those children. The effectiveness of the method has been shown primarily using the D2 attention test designed by R. Brickenkamp in its Polish adaptation made by E. Dajek.

Streszczenie Stymulator uwagi wizualnej i słuchowej zapewnia system opracowany w celu poprawy umiejętności czytania za pomocą jednoczesnej prezentacji tekstu w jego formie wizualnej i przekształconej formie słuchowej wraz z powiązanym materiałem filmowym. W opisywanych badaniach wzięło udział 40 dzieci w wieku 8 13 lat mających trudności z nauką czytania, u których zdiagnozowano dysleksję rozwojową. Wykazano, że zastosowanie proponowanej metody może poprawić umiejętności czytania u tych dzieci. Skuteczność metody wykazano przede wszystkim za pomocą testu uwagi D2 opracowanego przez R. Brickenkampa w polskiej adaptacji E. Dajeka.

Entry No. 664

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Technologiczne wsparcie komunikacji osób w stanie ograniczonej świadomości

Conference Śpiączka i stany ograniczonej świadomości. Komunikacja niewerbalna – możliwości i najnowsze rozwiązania

Preprint

Number

Volume

Pages 1 - 46

Conference site Bydgoszcz, Polska

Conference date 6.10.2017- 6.10.2017

Notes prezentacja Power Point

Abstract Przedstawiono opracowywane w Politechnice Gdańskiej rozwiązań z zakresu wsparcia technologicznego osób w stanach obniżonej świadomości, takie jak system śledzenia wzroku, oprogramowanie do analizy fal mózgowych otrzymywanych za pomocą kasków elekroencefalograficznych, urządzenia do prądowej stymulacji przezczaszkowej. Zaprezentowano wyniki badań z udziałem osób o obniżonej świadomości, prowadzonych w Niepublicznym Zakładzie Opieki Zdrowotnej EPIMIGREN w Osielsku.

Entry No. 665

Entry type conference paper

Authors A. Czyżewski

English title Study of hearing and emotional state of coma patients based on brainwave analysis

Polish title Badania słuchu i stanu emocjonalnego osób w śpiączce na podstawie analizy fal mózgowych

Conference Ku standardom opieki nad pacjentem w śpiączce", międzynarodowa konferencja zorganizowana przez Fundację „Światło”

Preprint

Number

Volume

Pages 1 - 48

Conference site

Conference date 7.9.2017- 8.9.2017

Notes prezentacja Power Point

Streszczenie Głównym zastosowaniem opracowywanego aktualnie, zintegrowanego systemu do komunikowania się z pacjentami apalicznymi jest obiektywizacja procesu diagnozy stanu świadomości oraz prowadzenie terapii pacjentów zdiagnozowanych jako osoby w stanie wegetatywnym. Stosowanie tego typu rozwiązań jest możliwe w zakładach opiekuńczo-leczniczych, w domach opieki, a także w domach prywatnych, gdzie przebywają osoby doświadczone urazem lub udarem mózgu. Terapia i neurorehabilitacja pacjenta polega w tym przypadku na pobudzaniu funkcji poznawczych oraz na stymulowaniu zmysłów, w szczególności: wzroku i słuchu. Dodatkową badaną funkcjonalnością jest możliwość komunikowania się pacjenta z otoczeniem poprzez wybór gotowych poleceń wyświetlanych na ekranie, bądź obsługę wirtualnej klawiatury. Innowacyjne rozwiązanie w toku opracowania dotyczy integracji kilku różnych technologii: śledzenia wzroku, badania słuchu metodami obiektywnymi oraz analizy bioelektrycznej aktywności mózgu pacjenta. Opracowanymi technologiami są zwłaszcza: metody analizy fal EEG oraz metoda i oprogramowanie do dokonywania diagnozy stanu świadomości pacjentów i prowadzenia ich stymulacji polisensorycznej, które są wykorzystywane zarówno w procesie diagnozy, jak i terapii pacjentów uważanych za osoby w stanie wegetatywnym (apalicznych). Znamienne są wyniki badań słuchu wśród tego rodzaju osób, które wskazują, że większość spośród nich ma zachowane podstawowe funkcje tego zmysłu. Również zastosowanie nowoczesnych kasków elektroencefalograficznych w wielu przypadkach wskazuje na możliwości potencjalnego zwiększenia interakcji z otoczeniem przy wykorzystaniu nowoczesnych technologii. Dalsze poszukiwania w tej dziedzinie wybiegają jednak jeszcze dalej, w kierunku stymulowania mózgu z pomocą implantowanych (wszczepionych) elektrod, porozumiewania się za pomocą dźwięków muzycznych, a nawet umożliwienia interakcji chorego cyberprzestrzenią i Internetem Rzeczy.

Entry No. 666

Entry type conference paper

Authors A. Rojczyk, T. Ciszewski, G. Szwoch, A. Czyżewski

English title Using Facial Motion Capture technology in second-language speech

Polish title Korzystanie z technologii Face-Motion Capture do akwizycji mowy artykułowanej w drugim języku

Conference 11th International Conference on Native and Non-native Accents of English Accents

Preprint

Number

Volume

Pages 1 - 16

Conference site Gdynia, Polska

Conference date 30.11.2017- 2.12.2017

Notes prezentacja Power Point

Abstract Vowel classification from visual cues are discussed. Prospects for using FMC in second-language speech are presented on the basis of experimental results employing 6 Vicon Vero cameras to register markers.

Streszczenie Omówiono klasyfikację wizualną alofonów mowy angielskiej na podstawie analizy trajektorii markerów w systemie przechwytywania ruchów ust. Perspektywy wykorzystania FMC w mowie w drugim języku przedstawiono na podstawie wyników eksperymentalnych wykorzystujących 6 kamer Vicon Vero do rejestracji markerów.

Entry No. 667

Entry type journal paper

Authors A. Czyżewski, B. Kostek, P. Bratoszewski, J. Kotus, M. Szykulski

English title An audio-visual corpus for multimodal automatic speech recognition

Polish title

Journal Journ. of Intelligent Information Systems

Volume

Number

Pages 1 - 27

Bibliographic No. 54

Notes http://dx.doi.org/10.1007/s10844-016-0438-z

Abstract A review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight camera accompanied by audio recorded using both: a microphone array and a microphone built in a mobile computer. For the purpose of applications related to AVSR systems training, every utterance was manually labeled, resulting in label files added to the corpus repository. Owing to the inclusion of recordings made in noisy conditions the elaborated corpus can also be used for testing robustness of speech recognition systems in the presence of acoustic background noise. The process of building the corpus, including the recording, labeling and post-processing phases is described in the paper. Results achieved with the developed audio-visual automatic speech recognition (ASR) engine trained and tested with the material contained in the corpus are presented and discussed together with comparative test results employing a state-of-the-art/commercial ASR engine. In order to demonstrate the practical use of the corpus it is made available for the public use.

Entry No. 668

Entry type conference paper

Authors A. Kurowski, P. Odya, P. Szczuko, M. Lech, P. Spaleniak, B. Kostek, A. Czyżewski

English title Multimodal System for Diagnosis and Polysensory Stimulation of Subjects with Communication Disorders

Polish title

Conference 23rd International Symposium on Methodologies for Intelligent Systems ISMIS 2017

Preprint

Number

Volume

Pages 47 - 56

Conference site Warszawa, Polska

Conference date 26.6.2017- 29.6.2017

Abstract An experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their immersion in a virtual environment. Data analysis methods are described, and experiments associated with classification of mental states during listening exercises as well as audio-visual stimuli are presented and discussed. Feature extraction was based on discrete wavelet transformation and clustering employing the k-means algorithm was designed. All algorithms were implemented in the Python programming language with the use of Open Source libraries. Tests of the proposed system were performed in a Special School and Educational Center in Koś-cierzyna, Poland. Results and comparison with data gathered from the control group of healthy people are presented and discussed.

Entry No. 669

Entry type book

Authors A. Kwiatkowska, A. Czyżewski

English title

Polish title Komputerowe oko świadomości

Editor Akademicka Oficyna Wydawnicza EXIT

Pages 1 - 239

Streszczenie Znane metody badania osób w śpiączce nie dają odpowiedzi na pytanie jak funkcjonuje poznawczo osoba wybudzona ze śpiączki z obniżoną świadomością. Książka podsumowuje wyniki badań przybliżających odpowiedź na powyższe pytanie. Część prezentowanych badań była prowadzona przez autorów i współpracowników już wcześniej, z wykorzystaniem skonstruowanego urządzenia do śledzenia wzroku, zaś nowsze prezentowane badania dostarczyły wyników w zakresie: zdolności czytania i pisania osób w śpiączce, obiektywnych badań słuchu, analizy sygnałów elektroencefalograficznych i in., uzyskanych w efekcie rozwijania technologii komunikowania się osób z komputerami w ramach realizacji projektu badawczego finansowanego przez Narodowe Centrum Nauki (DEC-2014/15/B/ST7/04724).

Entry No. 670

Entry type book

Authors A. Kurowski, P. Odya, P. Szczuko, M. Lech, P. Spaleniak, B. Kostek, A. Czyżewski

English title Multimodal system for diagnosis and polysensory stimulation of subjects with communication disorders

Polish title

Editor Springer Verlag

Pages 47 - 56

Abstract An experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their immersion in a virtual environment. Data analysis methods are described, and experiments associated with classification of mental states during listening exercises as well as audio-visual stimuli are presented and discussed. Feature extraction was based on discrete wavelet transformation and clustering employing the k-means algorithm was designed. All algorithms were implemented in the Python programming language with the use of Open Source libraries. Tests of the proposed system were performed in a Special School and Educational Center in Koś-cierzyna, Poland. Results and comparison with data gathered from the control group of healthy people are presented and discussed.

Entry No. 671

Entry type conference paper

Authors K. Mrozik, A. Kurowski, B. Kostek, A. Czyżewski

English title Comparison of selected electroencephalographic signal classification methods

Polish title

Conference SPA2017 Signal Processing: Algorithms, Architectures, Arrangements, and Application

Preprint

Number

Volume

Pages 34 - 41

Conference site Poznań, Polska

Conference date 20.9.2017- 22.9.2017

Abstract A variety of methods exists for electroencephalographic (EEG) signals classification. In this paper, we briefly review selected methods developed for such a purpose. First, a short description of the EEG signal characteristics is shown. Then, a comparison between the selected EEG signal classification methods, based on the overview of research studies on this topic, is presented. Examples of methods included in the study are: Artificial Neural Networks, Support Vector Machines, Fuzzy or k-Means Clustering. Similarities and differences between all considered methods of an automatic EEG signal classification with a focus on consecutive stages of such a process are reviewed. Examples of EEG classification, considering various types of usage and target applications along with their effectiveness, are also shown.

Entry No. 672

Entry type conference paper

Authors P. Bratoszewski, A. Czyżewski, P. Hoffmann, M. Lech, M. Szczodrak

English title Pilot Testing of Developed Multimodal Biometric Identity Verification System

Polish title

Conference Signal Processing Algorithms, Architectures, Arrangements, and Applications 2017

Preprint

Number

Volume

Pages 184 - 189

Conference site Poznań, Polska

Conference date 20.9.2017- 22.9.2017

Bibliographic No. 12

Abstract The bank client identity verification system developed in the course of the IDENT project is presented. The total number of five biometric modalities including: dynamic signature proofing, voice recognition, face image verification, face contour extraction and hand blood vessels distribution comparison have been developed and studied. The experimental data were acquired employing multiple biometric sensors installed at engineered biometric terminals. The biometric portraits of 125 subjects were registered and stored in the database during the presented pilot study and then verified experimentally. The analysis of FAR and FRR measures obtained for developed biometric applications was made. Problem-specific survey was done on the basis of questionnaires completed by the subjects in order to assess the look and feel of the developed biometric system as well as to collect opinions concerning its implementation in banking outlets. A discussion concerning the quality of registered signals and results achieved in the pilot study is included.

Entry No. 673

Entry type journal paper

Authors P. Szczuko, A. Czyżewski, P. Hoffmann, P. Bratoszewski, M. Lech

English title Validating data acquired with experimental multimodal biometric system installed in bank branches

Polish title

Journal Journ. of Intelligent Information Systems

Volume

Number

Pages 1 - 31

Bibliographic No. 34

Abstract An experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment process. The analytical studies and experimental work conducted in the course of this work will lead towards methodologies and solutions of the multimodal biometric technology, which is planned for further development. Before this stage is achieved a study on the data usefulness acquired from a variety of biometric sensors and from survey questionnaires filled in by banking tellers and clients was done. The decision-related sets were approximated by the Rough Set method offering efficient algorithms and tools for finding hidden patterns in data. Prediction of evaluated biometric data quality, based on enrollment samples and on user subjective opinions was made employing the developed method. After an introduction to the principles of applied biometric identity verification methods, the knowledge modelling approach is presented together with achieved results and conclusions.

Entry No. 674

Entry type conference paper

Authors M. Szczodrak, K. Marciniuk, A. Czyżewski

English title Road surface roughness estimation employing integrated position and acceleration sensor

Polish title

Conference SPA 2017

Preprint

Number

Volume

Pages 228 - 232

Conference site Poznań, Polska

Conference date 20.9.2017- 22.9.2017

Notes ISBN-13 978-83-62065-28-8

Abstract Assessment of a surface quality being an essential task for the authorities supervising the roads provides the subject of the paper. Information about riding quality of a pavement, important for drivers, both in terms of their comfort and safety is collected during experiments employing mobile sensors. The paper describes the use of a miniature position and acceleration sensor for evaluation of the roughness of the road surface. The device designed for the installation in vehicles includes a built-in multi-axis accelerometer and a GPS receiver. Measurement data were collected on the basis of road trip records with regards to diversified roughness of the surface and to varied vehicle speed on each investigated road section. Data were gathered for various vehicle body types and then attempts were made to the classification of the road surface based on the created algorithm.

Entry No. 675

Entry type book

Authors K. Marciniuk, B. Kostek, A. Czyżewski

English title Traffic Noise Analysis Applied to Automatic Vehicle Counting and Classification

Polish title

Editor Springer, Multimedia Communications, Services and Security, MCSS 2017

Pages 110 - 123

Notes Best paper

Abstract Problems related to determining traffic noise characteristics are discussed in the context of automatic dynamic noise analysis based on noise level measurements and traffic prediction models. The obtained analytical results provide the second goal of the study, namely automatic vehicle counting and classification. Several traffic prediction models are presented and compared to the results of in-situ noise level measurements. Synchronized audio recordings were made to determine Sound Quality parameters describing the nature of acquired sound signals. Video recordings and information about the traffic structure using commercially available automatic vehicle detection methods were also collected in order to create ground truth data used for the experiments.

Entry No. 676

Entry type conference paper

Authors P. Szczuko, M. Lech, A. Czyżewski

English title Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals

Polish title

Conference 23rd International Symposium on Methodologies for Intelligent Systems (ISMIS)

Preprint

Number

Volume

Pages 1 - 10

Conference site Warszawa, Polska

Conference date 26.6.2017- 29.6.2017

Abstract A method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discus-sion are provided. Among applied algorithms the highest accuracy was achieved with: Rough Set, SVM and ANN methods.

Entry No. 677

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski

English title Evaluation of Face Detection Algorithms for the Bank Client Identity Verification

Polish title

Journal Foundations of Computing and Decision Sciences

Volume 42

Number 2

Pages 137 - 148

Abstract Results of investigation of face detection algorithms efficiency in the banking client visual verification system are presented. The video recordings were made in real conditions met in three bank operating outlets employing a miniature industrial USB camera. The aim of the experiments was to check the practical usability of the face detection method in the biometric bank client verification system. The main assumption was to provide a simplified as much as possible user interaction with the application. Applied algorithms for face detection are described and achieved results of face detection in the real bank environment conditions are presented. Practical limitations of the application based on encountered problems are discussed.

Entry No. 678

Entry type journal paper

Authors D. Jachimski, A. Czyżewski, T. Ciszewski

English title A comparative study of English viseme recognition methods and algorithms

Polish title Studium porównawcze metod rozpoznawania wizemów angielskich - metody i algorytmy

Journal Multmedia Tools and Applications

Volume

Number

Pages 1 - 38

Abstract An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction and an appropriate selection of the classifier were sought. The experimental research was conducted on the basis of a spoken corpus in which speech was represented both acoustically and visually. The extracted features represented three types: geometrical, textural and mixed ones. The features were processed employing the classification algorithms based on Hidden Markov Models and Sequential Minimal Optimization. Tests were carried out employing the processed video material recorded with English native speakers who read specially prepared list of commands. The obtained results are discussed in the paper.

Streszczenie Elementarna jednostka wizualna wizem - była przedmiotem przygotowania wektora cech komponentu wizyjnego systemu rozpoznawania.mowy audiowizualnej. Celem prezentowanych badań jest przegląd różnych podejść do problemu, implementacja algorytmów zaproponowanych w literaturze i porównawcze badanie ich skuteczności. Eksperymentalne badania przeprowadzono na podstawie korpusu mowy, w którym mowa była reprezentowana zarówno akustycznie, jak i wizualnie. Wyodrębnione cechy reprezentowały trzy rodzaje parametrów: geometryczne, teksturalne i mieszane. Wyekstrahowane cechy zostały przetworzone przy użyciu algorytmów klasyfikacji opartych na ukrytych modelach Markowa i sekwencyjnej optymalizacji. Testy zostały przeprowadzone przy użyciu przetworzonego materiału wideo zarejestrowanego przez natywnych mówców języka angielskiego którzy czytali specjalnie przygotowaną listę poleceń. Uzyskane wyniki omówiono w artkule.

Entry No. 679

Entry type conference paper

Authors A. Kurowski, K. Mrozik, B. Kostek, A. Czyżewski

English title Automatic Clustering of EEG-Based Data Associated with Brain Activity

Polish title

Conference The 11th edition of International Conference on Multimedia & Network Information Systems MISSI 2018

Preprint

Number

Volume

Pages 470 - 479

Conference site Wrocław, Polska

Conference date 12.9.2018- 14.9.2018

Abstract The aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain. Next, the parameterization process is performed in two ways, i.e. by applying Discrete Wavelet Transform and utilizing an autoencoder network. The resulting sets of parameters are then used for the data clustering and the effectiveness of correct assignment of data into adequate clusters is checked. It occurs that the performance of wavelets- and autoencoders-based parametrization is similar, however in several cases, autoencoders allowed for obtaining a higher mean distance and lower standard deviation than distances provided by the wavelet-based method. Moreover, a supervised classification of signals is performed as a form of benchmarking.

Entry No. 680

Entry type conference paper

Authors P. Odya, A. Czyżewski, A. Sroczyński, B. Kostek

English title A Device for Measuring Auditory Brainstem Responses to Audio

Polish title Urządzenie do pomiarów słuchowych potencjałów wywołanych pnia mózgu za pomocą sygnałów fonicznych

Conference 145th AES Convention

Preprint

Number 485

Volume

Pages

Conference site Nowy Jork, USA

Conference date 17.10.2018- 20.10.2018

Abstract Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds are processed in the hearing system including brain. The paper contains technical details related to the design of the device, including its hardware and software parts. The test results that have been carried out to verify the operation of the device are also described.

Entry No. 681

Entry type conference paper

Authors A. Czyżewski, J. Kotus, P. Odya, P. Szczuko, K. Jamróz, M. Szczodrak

English title Project INZNAK: intelligent road signs for adaptive traffic control, communicating in V2X technology

Polish title

Conference 12th International Road Safety Conference GAMBIT 2018 Road Innovations for Safety National and regional perspective

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 12.4.2018- 13.4.2018

Notes autorzy c.d.: J. Oskarbski, A. Sroczyński, J. Mioduski, T. Śmiałkowski

Abstract The intelligent road sign being developed within the project will communicate the speed calculated on the basis of the information received from the similar signs placed along the section of highway, connected with each other through wireless network, optionally using remote control. Its development requires focus on a number of research and technological issues.

Entry No. 682

Entry type conference paper

Authors D. Grabowski, M. Szczodrak, A. Czyżewski

English title

Polish title System informowania o stanie nawierzchni dróg z wykorzystaniem metod cyfrowego przetwarzania obrazów

Conference XI POLSKI KONGRES ITS

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 22.5.2018- 23.5.2018

Streszczenie Informacja o stanie nawierzchni drogi jest niezwykle istotna dla kierowców, zarówno z punktu widzenia komfortu jak i bezpieczeństwa jazdy, a jej ocena jest zadaniem organów nadzorujących drogi. W referacie przedstawiono koncepcję oraz wynik działania opracowanego systemu, który służy do wykrywania typowych uszkodzeń nawierzchni drogi, wykorzystując sygnały wizyjne. Urządzeniem rejestrującym stan nawierzchni są niewielkich rozmiarów kamera RGB oraz kamera 3D zamontowane na pojeździe. Lokalną jednostkę wykonującą wstępne przetwarzanie obrazu stanowi miniaturowy komputer. Klasyfikacja obrazów odbywa się na jednostce zdalnej, o większej mocy obliczeniowej, z zastosowaniem wytrenowanej wcześniej sieci neuronowej. System prezentacji danych o stanie nawierzchni udostępniono poprzez stronę internetową. Przeprowadzono eksperymenty, podczas których zebrano dane pomiarowe z kilkudziesięciu przejazdów ustaloną trasą o urozmaiconym stanie nawierzchni. Wyniki klasyfikacji odzwierciedlają zadowalającą skuteczność działania opracowanego systemu.

Entry No. 683

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Effective vehicle detection from various camera locations

Polish title

Conference PL in ML

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 14.12.2018- 17.12.2018

Notes Sesja plakatowa

Entry No. 684

Entry type conference paper

Authors K. Marciniuk, M. Szczodrak, A. Czyżewski

English title An application of acoustic sensors for the monitoring of road traffic

Polish title

Conference SPA 2018, Signal Processing Algorithms, Architectures, Arrangements and Applications

Preprint

Number

Volume

Pages 208 - 212

Conference site Poznan, Polska

Conference date 19.9.2018- 21.9.2018

Notes ISBN: 978-83-62065-31-8, https://ieeexplore.ieee.org/document/8563406

Abstract Assessment of road traffic parameters for the developed intelligent speed limit setting decision system constitutes the subject addressed in the paper. Current traffic conditions providing vital data source for the calculation of the locally fitted speed limits are assessed employing an economical embedded platform placed at the roadside. The use of the developed platform employing a low-powered processing unit with a set of microphones, an accelerometer and some other sensors, for the estimation of the essential road traffic parameters is presented in the paper. Acoustical signal processing-based vehicle counting attempts were made, and an acceleration sensor was used in order to detect the heavy vehicles pass-bys. Obtained results based on the measurements were discussed in the paper. Evaluation of the proposed methods is provided.

Entry No. 685

Entry type journal paper

Authors A. Czyżewski, B. Kostek

English title Marianna Sankiewicz and Gustaw Budzynski (1921-2018) Obituary

Polish title Wspomnienie o działalności naukowej i organizacyjnej prof. M. Sankiewicz i G. Budzyńskim

Journal J. Audio Eng. Soc.

Volume 66

Number 7-8

Pages 644 - 644

Abstract Organizing the Polish AES Section became real during the session of the National Sound Engineering Symposium held at the Gdansk Technical University in 1991. AES President Tim Shelton came to the Gdansk Symposium and inaugurated the activities of the newly founded PS-AES.

Streszczenie Przedmiotem artykułu są wspomnienia o fundatorach Polskiej Sekcji towarzystwa naukowego Audio Engineering Society.

Entry No. 686

Entry type report

Authors M. Szczodrak, J. Kotus, G. Szwoch, S. Cygert, P. Szczuko, A. Czyżewski

English title

Polish title Projekt demonstratora technologicznego - warstwa konstrukcyjna

Report Number POIR-INUSER/03/08/18

Notes c.d. autorzy: P. Sokołowski, D. Weber, Z. Rutka

Streszczenie Raport zawiera projekt dotyczący warstwy konstrukcyjnej wielomodalnego demonstratora technologicznego do celu wizyjnego i wibroakustycznego monitorowania turbin wiatrowych. Przedstawiono szczegóły techniczne elementów tworzących demonstrator. Wskazano rolę i cel stosowania poszczególnych elementów. Projekt demonstratora sporządzono w oparciu o koncepcje analizatora przedstawione w raporcie POIR-INUSER/02/08/2018. Na etapie koncepcji wyróżniono dwa różne typy projektowanego analizatora: zewnętrzny i wewnętrzny.

Entry No. 687

Entry type journal paper

Authors M. Szczodrak, D. Grabowski, A. Czyżewski

English title Employing economical methods for pavement defects estimation

Polish title

Journal MATEC Web Conf.

Volume 231

Number

Pages 1 - 8

Abstract It is a common practise that measurements of road surface conditions are made using professional and expensive apparatus. Typically a van or a truck equipped with a set of professional sensors i.e. laser scanners of surface is used, therefore the measurement update period is often quite long. Two alternative low-cost methods for estimating road pavement defects and failures were proposed and investigated by the authors. The first one is based on accelerometers application and the other one employs image analysis acquired by cameras installed on a vehicle.

Entry No. 688

Entry type book

Authors G. Korvel, A. Kurowski, B. Kostek, A. Czyżewski

English title Speech analytics based on machine learning

Polish title Analiza sygnału mowy za pomocą uczenia głębokiego

Editor Springer International Publishing AG, part of Springer Nature, tytuł książki: Intelligent Systems Reference Library, vol. 149

Pages 129 - 157

Notes rozdział w książce

Abstract In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information retrieval (MIR) domain. Then, phoneme classification beyond the typically used techniques is extended towards exploring Deep Neural Networks (DNNs). This is done by combining Convolutional Neural Networks (CNNs) with audio data converted to the time-frequency space domain (i.e. spectrograms) and then exported as images. In this way a two-dimensional representation of speech feature space is employed. When preparing the phoneme dataset for CNNs, zero padding and interpolation techniques are used. The obtained results show an improvement in classification accuracy in the case of allophones of the phoneme /l/, when CNNs coupled with spectrogram representation are employed. Contrarily, in the case of vowel classification, the results are better for the approach based on pre-selected features and a conventional machine learning algorithm.

Streszczenie Celem badań było wykorzystanie uczenia głębokiego do analizy alofonów i fonemów. W ekstrakcji cech użyto wybrane parametry stosowane w automatycznym wyszukiwaniu muzyki (Music Information Retrieval - MIR). W klasyfikacji wykorzystano zarówno typowe algorytmy uczące, jak również sieci splotowe. Większa efektywność została uzyskana w przypadku zastosowania spektrogramów (jako cech sygnału mowy) oraz uczenia głębokiego.

Entry No. 689

Entry type conference paper

Authors K. Lisowski, A. Czyżewski

English title Modelling of Objects Behaviour for Their Re-identification in Multi-camera Surveillance System Employing Particle Filters and Flow Graphs

Polish title

Conference 10th International Conference, IP&C’2018 Bydgoszcz, Poland, November 2018

Preprint

Number

Volume 10

Pages 79 - 86

Conference site Bydgoszcz, Springer, Polska

Conference date 14.11.2018- 16.11.2018

Notes https://link.springer.com/chapter/10.1007%2F978-3-030-03658-4_10

Entry No. 690

Entry type journal paper

Authors A. Czyżewski, B. Kostek

English title In Memoriam Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering

Polish title Wspomnienie o prof. M. Sankiewicz i G. Budzyńskim

Journal Archives of Acoustics

Volume 43

Number 3

Pages 353 - 355

Abstract Biography and scientific achievements of Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering.

Streszczenie Przedmiotem artykułu jest wspomnienie dorobku prof. M. Sankiewicz i G. Budzyńskiego, którzy byli fundatorami kierunku inżyniera dźwięku w Polsce.

Entry No. 691

Entry type conference paper

Authors A. Czyżewski

English title New applications of sound and vision engineering

Polish title

Conference Proceedings of the 11th International Conference MISSI 2018

Preprint

Number

Volume 833

Pages 7 - 9

Conference site Wrocław, Springer Cham, Polska

Conference date 12.9.2018- 14.9.2018

Entry No. 692

Entry type journal paper

Authors A. Rojczyk, T. Ciszewski, G. Szwoch, A. Czyżewski

English title Visual perception of vowels from static and dynamic cues

Polish title

Journal J. Acoust. Soc. Amer.

Volume 143

Number EL328

Pages 328 - 332

Entry No. 693

Entry type journal paper

Authors A. Czyżewski, S. Zaporowski, B. Kostek

English title Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Polish title

Journal J. Acoust. Soc. Amer.

Volume 144

Number 3

Pages 1801 - 1802

Notes ASA Meeting, Vancouver 5-9. 11. 2018

Entry No. 694

Entry type journal paper

Authors D. Jachimski, A. Czyżewski

English title A comparative study of English viseme recognition methods and algorithms

Polish title

Journal Multimedia Tools and Applications

Volume 77

Number 13

Pages 16495 - 16532

Notes first on-line 07 October 2017

Entry No. 695

Entry type journal paper

Authors A. Czyżewski, M. Szczodrak

English title Road pavement defect assessment through vibration analysis inside vehicles

Polish title

Journal J. Acoust. Soc. Amer.

Volume 144

Number 3

Pages 1796 - 1796

Abstract Miniature accelerometer sensors were used for the evaluation of road surface roughness. The device designed for installation in the vehicles is composed of a GPS receiver and of multi-axis accelerometers. Smartphones with built-in accelerometers were also used. Measurement data were collected through the recording of road trips employing 3 car types on diversified surface roughness roads and with varied vehicle speed on each investigated road section. The first step of data processing was the sensor alignment made to achieve proper values of acceleration vector. Subsequently, the influence of the car suspension system to the measurement results was diminished employing a designed filter. The magnitude of coefficients of Gabor transform analysis of accelerometer signals was calculated to discover differences between good and damaged surface. Research results show that road sections quality can be assessed by the applied vibration analysis. Although the precision of low-cost devices may be lower than the application of expensive professional laser profilograph scanning on road pavements, they can help to increase the effectiveness and the coverage of road surface damage detection, through the monitoring of road surfaces with typical cars instead of special test vehicles only.

Entry No. 696

Entry type conference paper

Authors M. Piotrowska, G. Korvel, A. Kurowski, B. Kostek, A. Czyżewski

English title Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"

Polish title

Conference 145 Audio Engineering Society Convention

Preprint 10070

Number

Volume

Pages

Conference site New York, USA

Conference date 17.10.2018- 20.10.2018

Abstract The purpose of this study is to involve both Convolutional Neural Networks and a typical learning algorithm in the allophone classification process. A list of words including aspirated and non-aspirated allophones pronounced by native and non-native English speakers is recorded and then edited and analyzed. Allophones extracted from English speakers’ recordings are presented in the form of two-dimensional spectrogram images and used as input to train the Convolutional Neural Networks. Various settings of the spectral representation are analyzed to determine adequate option for the allophone classification. Then, testing is performed on the basis of non-native speakers’ utterances. The same approach is repeated employing learning algorithm but based on feature vectors. The archived classification results are promising as high accuracy is observed.

Entry No. 697

Entry type conference paper

Authors M. Piotrowska, G. Korvel, B. Kostek, A. Rojczyk, A. Czyżewski

English title Objectivization of phonological evaluation of speech elements by means of audio parametrization

Polish title

Conference 2018 11th International Conference on Human System Interaction (HSI)

Preprint

Number

Volume

Pages 325 - 331

Conference site Gdańsk, Polska

Conference date 4.7.2018- 6.7.2018

Notes Proc. w WoS

Abstract This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal and prefortis clipping. A set of audio features based on mechanism of each phonological process was created. Recordings of phonetic material prepared by phonology expert were executed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted in partitioning by editing two recorded sets of words into allophones, then signals were analyzed and subsequently audio excerpts were parametrized. The comparison of two sets of allophones was reinforced by the phonology expert’s assessment of produced speech sounds. Analyses presented in this paper allowed for discovering a set of parameters, which enable to determine whether the target processes were pronounced correctly.

Entry No. 698

Entry type book

Authors P. Szczuko, M. Lech, A. Czyżewski

English title Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals

Polish title

Editor Springer

Pages 247 - 257

Notes Rozdział w książce "Intelligent Methods and Big Data in Industrial Applications. Studies in Big Data, vol 40."

Abstract A method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discussion are provided. Among applied algorithms the highest accuracy was achieved with: Rough Set, SVM and ANN methods.

Entry No. 699

Entry type journal paper

Authors P. Hoffmann, A. Czyżewski, P. Szczuko, A. Kurowski, M. Lech, M. Szczodrak

English title Analysis of results of large-scale multimodal biometric identity verification experiment

Polish title

Journal IET Biometrics

Volume

Number

Pages 1 - 12

Streszczenie An analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification methods are described in the paper. Then, relationships between ratios of successful and failed verifications between pairs, triplets, and quartets of biometric modalities are studied. An analysis of the sentiment of clients and of banking tellers related to each identity verification attempt was performed based on linguistic methods. The data mining process is described, based on the rough sets methodology, aimed at deriving rules pertaining to consecutive identity verification attempts.

Entry No. 700

Entry type conference paper

Authors M. Szczodrak, D. Grabowski, A. Czyżewski

English title Employing economical methods for pavement defects estimation

Polish title

Conference 12th International Road Safety Conference GAMBIT 2018

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 12.4.2018- 13.4.2018

Abstract Public roads management authorities are required by law regulations to detect and to assess road pavement defects and failures. High costs of carrying out road surface condition measurements reduce their frequency, thus extending the time needed to reveal damages requiring repair. Very high prices of the equipment used by road managers for this purpose have a significant impact on the costs and on the scale of this kind of measurements.

Entry No. 701

Entry type journal paper

Authors S. Cygert, A Czyżewski

English title Eulerian motion magnification applied to structural health monitoring of wind turbines

Polish title

Journal J. Acoust. Soc. Amer.

Volume 144

Number 1796

Pages 1796 - 1796

Abstract Several types of defects may occur in wind turbines, as physical damage of blades or gearbox malfunction. A wind farm monitoring and damage prediction system is built to observe abnormal vibrations of elements of wind turbine: blades, nacelle, and tower. Contactless methods are developed which do not require turbine stopping. In this work, structural health monitoring of a wind turbine is evaluated using a conversion from the captured and processed video to the acoustic signal, employing the method of Eulerian motion magnification in video. It was assumed that this task can be achieved using a stabilized high-speed video camera only, directed at the wind turbine without any additional sensors mounted on windmill blades or on its body. Moreover, the developed vector sound intensity probe was used for spatial measurements in order to recover the vibration modes of wind turbines. Finally, statistical methods were applied to the processing of computed features reflecting vibrations in order to determine wind turbine technical condition. The developed method was evaluated empirically in real wind farm

Entry No. 702

Entry type conference paper

Authors A. Czyżewski, M. Szczodrak

English title Road pavement defect assessment through vibration analysis inside vehicles

Polish title

Conference 176th Meeting of the Acoustical Society of America and 2018 Acoustics Week in Canada

Preprint

Number

Volume

Pages

Conference site Victoria, Kanada

Conference date 5.11.2018- 9.11.2018

Abstract Miniature accelerometer sensors were used for the evaluation of road surface roughness. The device designed for installation in the vehicles is composed of a GPS receiver and of multi-axis accelerometers. Smartphones with built-in accelerometers were also used. Measurement data were collected through the recording of road trips employing 3 car types on diversified surface roughness roads and with varied vehicle speed on each investigated road section. The first step of data processing was the sensor alignment made to achieve proper values of acceleration vector. Subsequently, the influence of the car suspension system to the measurement results was diminished employing a designed filter. The magnitude of coefficients of Gabor transform analysis of accelerometer signals was calculated to discover differences between good and damaged surface. Research results show that road sections quality can be assessed by the applied vibration analysis. Although the precision of low-cost devices may be lower than the application of expensive professional laser profilograph scanning on road pavements, they can help to increase the effectiveness and the coverage of road surface damage detection, through the monitoring of road surfaces with typical cars instead of special test vehicles only.

Entry No. 703

Entry type book

Authors P. Szczuko, M. Lech, A. Czyżewski

English title Comparison of Classification Methods for EEG Signals of Real and Imaginary Motion

Polish title

Editor Springer

Pages 227 - 239

Notes Rozdział w książce "Advances in Feature Selection for Data and Pattern Recognition"

Abstract The classification of EEG signals provides an important element of brain-computer interface (BCI) applications, underlying an efficient interaction between a human and a computer application. The BCI applications can be especially useful for people with disabilities. Numerous experiments aim at recognition of motion intent of left or right hand being useful for locked-in-state or paralyzed subjects in controlling computer applications. The chapter presents an experimental study of several methods for real motion and motion intent classification (rest/upper/lower limbs motion, and rest/left/right hand motion). First, our approach to EEG recordings segmentation and feature extraction is presented. Then, 5 classifiers (Naïve Bayes, Decision Trees, Random Forest, Nearest-Neighbors NNge, Rough Set classifier) are trained and tested using examples from an open database. Feature subsets are selected for consecutive classification experiments, reducing the number of required EEG electrodes. Methods comparison and obtained results are presented, and a study of features feeding the classifiers is provided. Differences among participating subjects and accuracies for real and imaginary motion are discussed. It is shown that though classification accuracy varies from person to person, it could exceed 80% for some classifiers.

Entry No. 704

Entry type conference paper

Authors S. Cygert, G. Szwoch, S. Zaporowski, A. Czyżewski

English title Vocalic Segments Classification Assisted by Mouth Motion Capture

Polish title

Conference 2018 11th International Conference on Human System Interaction (HSI)

Preprint

Number

Volume

Pages 318 - 324

Conference site Gdańsk, Polska

Conference date 4.7.2018- 4.7.2018

Notes https://www.scopus.com/record/display.uri?eid=2-s2.0-85052748795&origin=inward&txGid=8a4749eac63728adc5c1ce3f8c3042c6

Abstract Visual features convey important information for automatic speech recognition (ASR), especially in noisy environment. The purpose of this study is to evaluate to what extent visual data (i.e. lip reading) can enhance recognition accuracy in the multi-modal approach. For that purpose motion capture markers were placed on speakers' faces to obtain lips tracking data during speaking. Different parameterizations strategies were tested and the accuracy of phonemes recognition in different experiments was analyzed. The obtained results and further challenges related to the bi-modal feature extraction process and decision systems employment are discussed.

Entry No. 705

Entry type journal paper

Authors Sz Zaporowski, S. Cygert, G. Szwoch, G. Korvel, A. Czyżewski

English title

Polish title REJESTRACJA, PARAMETRYZACJA I KLASYFIKACJA ALOFONÓW Z WYKORZYSTANIEM BIMODALNOŚCI

Journal Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej

Volume

Number 58

Pages

Streszczenie Praca dotyczy rejestracji i parametryzacji alofonów w języku angielskim z wykorzystaniem dwóch modalności. W badaniach dokonano rejestracji wypowiedzi w języku angielskim mówców, których znajomość tego języka odpowiada poziomowi rodowitego mówcy. W kolejnym etapie wyodrębnione zostały alofony z nagrań fonicznych i odpowiadające im sygnały wizyjne. W procesie tworzenia wektorów cech wykorzystano odrębne systemy parametryzacji, osobne dla każdej modalności. Do parametryzacji sygnału fonicznego użyto typowych deskryptorów stosowanych w obszarze rozpoznawania mowy i muzyki. W nagraniach z systemu przechwytywania ruchu zaproponowano własne rozwiązania. Do klasyfikacji alofonów wykorzystano sieci neuronowe oraz maszynę wektorów nośnych w podejściu jedno- i dwumodalnym. Stwierdzono, że skuteczność rozpoznawania wzrasta wraz z wykorzystaniem więcej niż jednej modalności.

Entry No. 706

Entry type conference paper

Authors Sz Zaporowski, A. Czyżewski

English title Selection of Features for Multimodal Vocalic Segments Classification

Polish title

Conference 11th International Conference on Multimedia & Network Information Systems

Preprint

Number

Volume

Pages

Conference site Wrocław, Polska

Conference date 12.9.2018- 14.9.2018

Abstract English speech recognition experiments are presented employing both: audio signal and Facial Motion Capture (FMC) recordings. The principal aim of the study was to evaluate the inﬂuence of feature vector dimension reduction for the accuracy of vocalic segments classiﬁcation employing neural networks. Several parameter reduction strategies were adopted, namely: Extremely Randomized Trees, Principal Component Analysis and Recursive Parameter Elimination. The feature extraction process is explained, applied feature selection methods are presented and obtained results are discussed

Entry No. 707

Entry type journal paper

Authors A. Czyżewski, P. Bratoszewski, P. Hoffmann, A. Kurowski, M. Lech, M. Szczodrak

English title Performance Analysis of Developed Multimodal Biometric Identity Verification System

Polish title

Journal Elektronika : konstrukcje, technologie, zastosowania

Volume 4

Number

Pages 37 - 44

Abstract The bank client identity verification system developed in the course of the IDENT project is presented. The total number of five biometric modalities including: dynamic handwritten signature proofing, voice recognition, face image verification, face contour extraction and hand blood vessels distribution comparison have been developed and studied. The experimental data were acquired employing multiple biometric sensors installed at engineered biometric terminals. The biometric portraits of more than 10 000 bank clients were registered and stored in the database during the presented study and then verified experimentally. Problem- specific survey was done on the basis of questionnaires completed by the subjects in order to assess the look and feel of the developed biometric system as well as to collect opinions concerning its implementation in banking outlets. A discussion concerning the quality of registered data and results achieved in the study is included.

Streszczenie W artykule przedstawiono system weryfikacji tożsamości klienta bankowego opracowany w ramach projektu IDENT. Opracowano i przebadano pięć metod biometrycznych, w tym: rozpoznawanie dynamicznej reprezentacji podpisu odręcznego, weryfikację głosową, weryfikację obrazu twarzy, rozpoznawanie ekstrahonego konturu twarzy i porównywanie rozkładu naczyń krwionośnych dłoni. Przedstawione w artykule dane badawcze pozyskano za pomocą wielu czujników biometrycznych zainstalowanych w skonstruowanych stanowiskach biometrycznych. Łącznie z wykorzystaniem skonstruowanych stanowisk zarejestrowano próbki biometryczne pochodzące od ponad 10 000 klientów banku. W trakcie badania uczestnicy, tzn. klienci i doradcy bankowi byli proszeni o wypełnienie ankiet w celu ułatwienia oceny wyglądu i sposobu działania opracowanego systemu biometrycznego oraz zebrania opinii na temat jego przyszłego wdrożenia w placówkach bankowych. W artykule przedstawiono wyniki analiz zgromadzonych danych, z uwzględnieniem wzajemnej korelacji poszczególnych modalności oraz semantycznej analizy ankiet wypełnionych przez uczestników badania.

Entry No. 708

Entry type journal paper

Authors P. Szczuko, A. Czyżewski, M. Szczodrak

English title Variable length sliding models for banking clients face biometry

Polish title

Journal MULTIMEDIA TOOLS AND APPLICATIONS

Volume

Number

Pages 1 - 18

Abstract An experiment was organized in 100 bank branches to acquire biometric samples from nearly 5000 clients including face images. A procedure for creating face verification models based on continuously expanding database of biometric samples is proposed, implemented, and tested. The presented model applies to circumstances where it is possible to collect and to take into account new biometric samples after each positive verification of the user. Thus the model can evolve in time, and it can follow changes of user face characteristics, e.g. changes in complexion, variable amount of facial hair, arriving wrinkles, cheeks chubbiness appearance, etc., introduced as effects of changing lifestyle, sunbathing, gaining weight, aging or other processes. The variable length sliding models derived from the gathered experimental data are described in the paper.

Entry No. 709

Entry type conference paper

Authors A. Czyżewski, A. Sroczyński, T. Śmiałkowski, P. Hoffmann

English title Development of Intelligent Road Signs with V2X Interface for Adaptive Traffic Controlling

Polish title

Conference 6th International Conference on Models and Technologies for Intelligent Tranportation Systems MT-ITS 2019

Preprint

Number

Volume

Pages

Conference site Kraków, Polska

Conference date 5.6.2019- 7.6.2019

Notes nr rekordu w MojaPG 150848

Abstract The objective of this paper is to present a practical project of intelligent road signs, under which a series of new products for the regulation of traffic is being created. The engineering part of the project, described in this paper, was preceded by a series of experimental studies, the results of which were described in another paper accepted for publication at the MTS-ITS conference 2019, entitled "Comparative study on the effectiveness of various types road traffic detectors". A new kind of intelligent road signs which will enable the prevention of the most common collisions on highways, resulting from the rapid stacking of vehicles resulting most often from accidental heavy braking. A range of products is being developed, including intelligent road signs: standing, hanging and mobile ones, displaying dynamically updated driving the speed limit, determined automatically, through an embedded electronic module, enabling multimodal measurement of traffic conditions. Solving a number of research and construction problems, such as: effective and independent of weather conditions traffic monitoring based on simultaneous analysis of several types of data representation, development of a method of calculating gradients and histograms of vehicle speed for various types of road situations or traffic topologies. Moreover, creating a platform for self-organizing reliable wireless connections among road signs equipped with innovative displays and power supplies and carrying out prototype tests are carried out. As a result, advanced conceptually products for increasing road safety for which there is a market demand are being prepared for future implementation.

Entry No. 710

Entry type book

Authors Sz Zaporowski, A. Czyżewski

English title Multimedia and Network Information Systems : Proceedings of the 11th International Conference MISSI 2018

Polish title

Editor Springer

Pages 490 - 500

Streszczenie English speech recognition experiments are presented employing both: audio signal and Facial Motion Capture (FMC) recordings. The principal aim of the study was to evaluate the inﬂuence of feature vector dimension reduction for the accuracy of vocalic segments classiﬁcation employing neural networks. Several parameter reduction strategies were adopted, namely: Extremely Randomized Trees, Principal Component Analysis and Recursive Parameter Elimination. The feature extraction process is explained, applied feature selection methods are presented and obtained results are discussed

Entry No. 711

Entry type conference paper

Authors S. Cygert, A. Czyżewski, M. Stefaniak, B. Kostek

English title Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification

Polish title

Conference 146th Audio Engineering Society Convention

Preprint

Number

Volume

Pages

Conference site Dublin, Irlandia

Conference date 20.3.2019- 23.3.2019

Abstract The recordings were made with a fast video camera and with a microphone. Using fast cameras allowed for observation of the micro vibrations of the object structure. Motion-magnified video recordings of wind turbines on a wind farm were made for the purpose of building a damage prediction system. An idea was to use video to recover sound & vibrations in order to obtain a contactless diagnostic method for wind turbines. The recovered signals can be analyzed in a way similar to accelerometer signals, employing spectral analysis. They can be also played back through headphones and compared with sounds recorded by microphones.

Entry No. 712

Entry type book

Authors A. Kurowski, K. Mrozik, B. Kostek, A. Czyżewski

English title Automatic Clustering of EEG-Based Data Associated with Brain Activity

Polish title

Editor Springer

Pages 470 - 479

Abstract The aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain. Next, the parameterization process is performed in two ways, i.e. by applying Discrete Wavelet Transform and utilizing an autoencoder network. The resulting sets of parameters are then used for the data clustering and the effectiveness of correct assignment of data into adequate clusters is checked. It occurs that the performance of wavelets- and autoencoders-based parametrization is similar, however in several cases, autoencoders allowed for obtaining a higher mean distance and lower standard deviation than distances provided by the wavelet-based method. Moreover, a supervised classification of signals is performed as a form of benchmarking.

Entry No. 713

Entry type journal paper

Authors M. Piotrowska, G. Korvel, B. Kostek, T. Ciszewski, A. Czyżewski

English title MACHINE LEARNING-BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

Polish title

Journal Int. J. Appl. Math. Comput. Sci.

Volume

Number

Pages

Notes publikacja w 2019

Abstract Automatic classification methods, such as Artificial Neural Networks (ANNs), k-Nearest Neighbor (KNN) and Self-Organizing Maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally- and/or contextually-conditioned allophones. For each word a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers and pgonologo experts were selected. For the purpose of the present study a sub-list of 103 words containing the English alveolar lateral phoneme /l/ was compiled. The list includes ’dark’ (velarized) allophonic realizations (which occur before a consonant or at the end of the word before silence) and 52 ’clear’ allophonic realizations (which occur before a vowel), as well as voicing variants. The recorded signals were segmented into allophones and parametrized using a set of descriptors, originating from the MPEG 7 standard, plus dedicated time-based parameters as well as modified MFCC features proposed by the authors. Classification methods such as ANNs, kNN and SOM were employed to automatically detect the two types of allophones. Various sets of features were tested to achieve the best performance of the automatic methods. In the final experiment, a selected set of features was used for the automatic evaluation of the pronunciation of dark /l/ by non-native speakers.

Entry No. 714

Entry type book

Authors A. Kurowski, A. Czyżewski

English title Assessment of Therapeutic Progress After Acquired Brain Injury Employing Electroencephalography and Autoencoder Neural Networks

Polish title

Editor Springer

Pages 961 - 967

Abstract A method developed for parametrization of EEG signals gathered from participants with acquired brain injuries is shown. Signals were recorded during therapeutic session consisting of a series of computer assisted exercises. Data acquisition was performed in a neurorehabilitation center located in Poland. The presented method may be used for comparing the performance of subjects with acquired brain injuries (ABI) who are involved in concentration training program. It may also allow for an assessment of relative difference in performance of two participants involved to exercises by comparing parameters derived from EEG signals acquired in the course of therapeutic sessions. The parametrization method is based on autoencoder neural networks. The efficiency of parameters extracted employing the algorithm was compared to parameters derived from the spectrum of EEG signal. As it was confirmed by achieved results, the presented autoencoder-based method may be applied to predict ABI subjects’ performance in attention training sessions.

Entry No. 715

Entry type book

Authors G. Korvel, A. Kurowski, B. Kostek, A. Czyżewski

English title Speech Analytics Based on Machine Learning

Polish title

Editor Springer

Pages 129 - 157

Abstract In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information retrieval (MIR) domain. Then, phoneme classification beyond the typically used techniques is extended towards exploring Deep Neural Networks (DNNs). This is done by combining Convolutional Neural Networks (CNNs) with audio data converted to the time-frequency space domain (i.e. spectrograms) and then exported as images. In this way a two-dimensional representation of speech feature space is employed. When preparing the phoneme dataset for CNNs, zero padding and interpolation techniques are used. The obtained results show an improvement in classification accuracy in the case of allophones of the phoneme /l/, when CNNs coupled with spectrogram representation are employed. Contrarily, in the case of vowel classification, the results are better for the approach based on pre-selected features and a conventional machine learning algorithm.

Entry No. 716

Entry type conference paper

Authors A. Kurowski, Sz Zaporowski, A. Czyżewski

English title Automatic labeling of traffic sound recordings using autoencoder-derived features

Polish title

Conference

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date 18.9.2019- 20.9.2019

Notes Signal Processing - Algorithms, Architectures, Arrangements and Applications (SPA2019)

Abstract An approach to detection of events occurring in road traffic using autoencoders is presented. Extensions of existing algorithms of acoustic road events detection employing Mel Frequency Cepstral Coefficients combined with classifiers based on k nearest neighbors, Support Vector Machines, and random forests are used. In our research, the acoustic signal gathered from the microphone placed near the road is split into frames and converted into a 2-dimensional form of Mel-cepstrogram. Next, the sequence of mel-cepstrograms is processed by the autoencoder neural network, which assigns a unique embedding to each of processed mel-cepstrograms. The embeddings may be treated as features which can be fed on the input of other machine learning-based classifiers. In our research, we prepared such an autoencoder and compared it with a standard solution of parameterization consisting of averaging MFCC throughout all the analyzed frames. Both types of features were then treated as an input for selected types classifiers. It was found, that parameters derived by the autoencoder neural network may be useful for improving the performance of classifiers in case of problematic classes such as detection of single and multiple vehicles passes.

Entry No. 717

Entry type journal paper

Authors A. Kurowski, K. Mrozik, B. Kostek, A. Czyżewski

English title Comparison of the effectiveness of automatic EEG signal class separation algorithms

Polish title

Journal Journal of Intelligent & Fuzzy Systems

Volume

Number

Pages 1 - 7

Abstract In this paper, an algorithm for automatic brain activity class identification of EEG (electroencephalographic) signals is presented. EEG signals are gathered from seventeen subjects performing one of the three tasks: resting, watching a music video and playing a simple logic game. The methodology applied consists of several steps, namely: signal acquisition, signal processing utilizing z-score normalization, parametrization and activity classification. The EEG signal is acquired from a headset containing 14 electrodes. For the parametrization two methods are used, namely, DiscreteWavelet Transform (DWT) employed as a reference parametrization technique and autoencoder neural network. Parameters obtained with those methods are fed to the input of classifiers which assigned them to one of three activity classes. Then, the effectiveness of the assignment of the frames of EEG data into appropriate classes is observed and compared. Results obtained using both methods show differences in accuracy with regard to the task detected depending on factors such as type of parametrization or complexity of the classifier employed for EEG activity classification.

Entry No. 718

Entry type journal paper

Authors A. Kurowski, K. Mrozik, B. Kostek, A. Czyżewski

English title Method for Clustering of Brain Activity Data Derived from EEG Signals

Polish title

Journal Fundamenta Informaticae

Volume 168

Number 2-4

Pages 249 - 268

Abstract A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets, Daubechies and Symlets. Euclidean distance between clusters normalized with respect to the standard deviation of the whole set of data are used to separate each task performed by participants. The results of this stage allow for an assessment of separability between subsets of data associated with each activity performed by experiment participants. The speed of convergence of the training process employing deep learning-based clustering is also measured.

Entry No. 719

Entry type

Authors A. Czyżewski, P. Hoffmann, M. Lech

English title

Polish title Układ do bezkontaktowego składania podpisu

Entry No. 720

Entry type journal paper

Authors M. Lech, M.T Kucewicz, A. Czyżewski

English title Human Computer Interface for Tracking Eye Movements Improves Assessment and Diagnosis of Patients With Acquired Brain Injuries

Polish title

Journal Front. Neurol.

Volume 10

Number 6

Pages 1 - 9

Abstract One of the first clinical signs differentiating the minimally conscious state from the vegetative state is the presence of smooth pursuit eye movements occurring in direct response to moving salient stimuli. Glasgow Coma Scale (GCS) is one of the most commonly used diagnostic tools for acute phase assessment of the level of consciousness, together with a neurological examination. These classic measures are limited to qualitative neurological examination without more quantitative measures provided from e.g., tasks with tracking position of the gaze. Among this and other limitations, it is prone to a relatively high rate of misdiagnosis. Here, we developed an interface for gaze tracking to enhance the assessment of consciousness in 10 patients with acquired brain injuries. According to the acute phase GCS assessment, nine of them were considered unaware and below the minimally conscious state. Chronic neurological examination confirmed six of them below the minimally conscious state. Our new Human Computer Interface (HCI) revealed that six patients were conscious enough to complete at least one of the gaze tracking tasks. Among these six patients, one was originally diagnosed as remaining in a vegetative state and one in coma. The patient diagnosed as remaining in a chronic vegetative state scored six GCS points acutely. Following assessment with our HCI the patient was re-diagnosed with a possible locked-in syndrome. Our HCI method provides a new complementary tool for clinical assessment of patients suffering from disorders of consciousness.

Entry No. 721

Entry type

Authors A. Czyżewski, P. Hoffmann, M. Lech

English title

Polish title Układ do bezkontaktowego składania podpisu

Entry No. 722

Entry type conference paper

Authors Sz Zaporowski, B. Kostek, A. Czyżewski

English title Automatic Transcription of Speech to International Phonetic Alphabet Employing Acoustical and Facial Motion Capture Data

Polish title

Conference International Conference on Digital Image & Signal Processing

Preprint

Number

Volume

Pages

Conference site Oxford, Wielka Brytania

Conference date 29.4.2019- 30.4.2019

Notes Plakat

Abstract An approach to ASR systems combined with the IPA transcription is presented. The system can provide STT accuracy in the range of 70-80%, which could be not enough for discerning classes in practice. Experimental allophone detection was implemented with the use of allophone boundaires. However, the complex nature of the issue and the need to manually mark allophones boundaries by phonology specialists should be taken into account in this particular experiment, since it influences results. That is visible especially, when comparing present results with results of previous author’s research in this subject.

Entry No. 723

Entry type journal paper

Authors P. Szczuko, A. Czyżewski, P. Hoffmann, P. Bratoszewski, M. Lech

English title Validating data acquired with experimental multimodal biometric system installed in bank branches

Polish title

Journal Journ. of Intelligent Information Systems

Volume 52

Number

Pages 1 - 31

Notes https://link.springer.com/article/10.1007%2Fs10844-017-0491-2

Abstract An experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment process. The analytical studies and experimental work conducted in the course of this work will lead towards methodologies and solutions of the multimodal biometric technology, which is planned for further development. Before this stage is achieved a study on the data usefulness acquired from a variety of biometric sensors and from survey questionnaires filled in by banking tellers and clients was done. The decision-related sets were approximated by the Rough Set method offering efficient algorithms and tools for finding hidden patterns in data. Prediction of evaluated biometric data quality, based on enrollment samples and on user subjective opinions was made employing the developed method. After an introduction to the principles of applied biometric identity verification methods, the knowledge modelling approach is presented together with achieved results and conclusions.

Entry No. 724

Entry type journal paper

Authors P. Szczuko, A. Czyżewski, M. Szczodrak

English title Variable length sliding models for banking clients face biometry

Polish title

Journal Multimedia Tools and Applications

Volume 78

Number 6

Pages 7749 - 7766

Notes https://link.springer.com/article/10.1007%2Fs11042-018-6432-4

Abstract An experiment was organized in 100 bank branches to acquire biometric samples from nearly 5000 clients including face images. A procedure for creating face verification models based on continuously expanding database of biometric samples is proposed, implemented, and tested. The presented model applies to circumstances where it is possible to collect and to take into account new biometric samples after each positive verification of the user. Thus the model can evolve in time, and it can follow changes of user face characteristics, e.g. changes in complexion, variable amount of facial hair, arriving wrinkles, cheeks chubbiness appearance, etc., introduced as effects of changing lifestyle, sunbathing, gaining weight, aging or other processes. The variable length sliding models derived from the gathered experimental data are described in the paper.

Entry No. 725

Entry type journal paper

Authors P. Szczuko, A. Czyżewski, P. Spaleniak

English title Clarity of Facial Expressions for Various Types of Face Motion Capture Visualizations

Polish title

Journal Transactions on Visualization and Computer Graphics

Volume

Number

Pages 1 - 11

Notes w recenzjach

Abstract The presented work focuses on obtaining clear and readable facial expression animations by choosing the most appropriate 3D face models and motion capture setup. First, a background is provided, introducing and explaining technical aspects of face motion capture with regards to maximization of expression capture accuracy. The prior work is presented and discussed, and key aspects influencing facial expression readability are explained. The problem of emotional clarity is discussed. Our own approach is aimed at verifying effects of the chosen configuration of face markers used for motion capture recording, and the realism level of a virtual face to obtain clear and readable facial expressions. First, an optimal number of markers to balance the simplicity of the setup and motion quality is determined experimentally. Then, animations are created containing four basic emotions projected on different virtual face models, varying in the level of realism. Subjective tests of emotion recognition and expression clarity were conducted, and their results analyzed. Dependencies between model realism and emotion recognition and between frame rate and clarity are discussed.

Entry No. 726

Entry type conference paper

Authors A. Czyżewski, A. Kurowski, Sz Zaporowski

English title Application of autoencoder to traffic noise analysis

Polish title

Conference 178th Meeting of the Acoustical Society of America

Preprint

Number

Volume

Pages

Conference site San Diego, USA

Conference date 2.12.2019- 6.12.2019

Notes Plakat

Abstract The aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to the automatic annotation of road traffic-related events based on noise analysis. Two-dimensional representation of traffic sounds based on Mel Frequency Cepstral Coefficients (MFCC) was fed the autoencoder neural network, and after that classified with k-nearest neighbors algorithm, Support Vector Machines, and random forests. Obtained results show that sound recordings can help determine the number of vehicles passing on the road. However, instead of being treated as independent, this method output should be combined with another source of data, e.g., video processing results or microwave radar data readings. Comparative results of vehicle counting obtained with the use of autoencoder and different classifiers are shown in the paper.

Entry No. 727

Entry type journal paper

Authors A. Czyżewski, A. Kurowski, Sz Zaporowski

English title Application of autoencoder to traffic noise analysis

Polish title

Journal J. Acoust. Soc. Amer.

Volume 146

Number 4

Pages 2958 - 2958

Abstract The aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to the automatic annotation of road traffic-related events based on noise analysis. Two-dimensional representation of traffic sounds based on Mel Frequency Cepstral Coefficients (MFCC) was fed the autoencoder neural network, and after that classified with k-nearest neighbors algorithm, Support Vector Machines, and random forests. Obtained results show that sound recordings can help determine the number of vehicles passing on the road. However, instead of being treated as independent, this method output should be combined with another source of data, e.g., video processing results or microwave radar data readings. Comparative results of vehicle counting obtained with the use of autoencoder and different classifiers are shown in the paper. [The Polish National Centre finances the project for Research and Development (NCBR) from the European Regional Development Fund No. POIR.04.01.04-00-0089/16 entitled: "INZNAK: Intelligent Road Signs with V2X Interface for Adaptive Traffic Controlling."]

Entry No. 728

Entry type conference paper

Authors S. Stefaniak, S. Cygert, A. Czyżewski

English title Recovering sound and vibrations produced by wind turbine structures employing video motion magnification

Polish title

Conference International Conference on Digital Image & Signal Processing

Preprint

Number

Volume

Pages

Conference site Oxford, Wielka Brytania

Conference date 29.4.2019- 30.4.2019

Notes plakat

Abstract This paper presents the experimental setup engineered for the purpose ofrecovering sound produced by wind turbine structures employing videomotion magnification. The paper contains a description of the implementedmethod for counting the number of propeller rotations in the same videoimage of wind turbines. Motion-magnified video recordings of wind turbineson a wind farm were made for the purpose of building a damage predictionsystem. An idea was to use video to recover sound and vibrations in orderto obtain a contactless diagnostic method for wind turbines.

Entry No. 729

Entry type conference paper

Authors A. Czyżewski, S. Cygert, G. Szwoch, J. Kotus, D. Weber, M. Szczodrak

English title Comparative study on the effectiveness of various types of road traffic intensity detectors

Polish title

Conference 6th Int. Conf. Models and Technologies for Intelligent Transportation Systems (MT-ITS)

Preprint

Number

Volume

Pages

Conference site Kraków, Polska

Conference date 5.6.2019- 7.6.2019

Notes D. Koszewski, A. Sroczyński, T. Śmiałkowski, K. Jamroz, W, Kustra, P. Hoffmann

Abstract Vehicle detection and speed measurements are crucial tasks in traffic monitoring systems. In this work, we focus on several types of electronic sensors, operating on different physical principles in order to compare their effectiveness in real traffic conditions. Commercial solutions are based on road tubes, microwave sensors, LiDARs, and video cameras. Distributed traffic monitoring systems require a high number of monitoring stations. In order to improve the accuracy of traffic monitoring, several modalities, complementing each other, may be used in the monitoring stations. In this paper, we propose a multimodal approach to traffic monitoring, using sensors and signal processing algorithms developed specifically for the described task. The aim of the work described here is to test each modality in a real-life scenario, assess their accuracy and to evaluate their usefulness for multimodal traffic monitoring stations. The modalities described in the paper are: Doppler sensor with custom signal processing, video analysis based on cameras and neural networks (employing deep learning algorithms), audio monitoring based on an acoustic vector sensor developed by the authors, as well as LiDAR and Bluetooth as supplementary means of traffic monitoring. Additionally, road tubes and a commercial video-based monitoring system were used in order to provide reference data. Consequently, we can present in this paper a comparative study on the effectiveness of traffic sensors operating based on different principles of work.

Entry No. 730

Entry type journal paper

Authors A. Czyżewski, B. Kostek

English title Remembrance about Marianna Sankiewicz and Gustaw Budzyński – our teachers and scientific mentors

Polish title

Journal Archives of Acoustics

Volume 44

Number 3

Pages 615 - 615

Notes 150859 w moja.pg, 66th Open Seminar on Acoustics Boszkowo, Poland, September 18 – 20, 2019

Entry No. 731

Entry type journal paper

Authors B. Kostek, A. Czyżewski

English title Sound engineering as our commitment to its creators in Poland

Polish title

Journal Archives of Acoustics

Volume 44

Number 3

Pages 617 - 617

Notes 150860 w moja.pg, 66th Open Seminar on Acoustics Boszkowo, Poland, September 18 – 20, 2019

Entry No. 732

Entry type conference paper

Authors A. Czyżewski

English title

Polish title Projekt INZNAK: aktywne znaki drogowe

Conference Polski Kongres Drogowy. VI Warmińsko-Mazurskie Forum Drogowqe

Preprint

Number

Volume

Pages

Conference site

Conference date 22.9.2019- 24.9.2019

Notes nr rekordu w MojaPG 150864

Entry No. 733

Entry type conference paper

Authors A. Czyżewski, D. Grabowski, . Czyzewski

English title

Polish title Automatyczna ocena śliskości nawierzchni drogowej z wykorzystaniem splotowych sieci neuronowych

Conference XII Polski Kongres ITS

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 22.5.2019- 23.5.2019

Notes nr rekordu w MojaPG 150866

Entry No. 734

Entry type conference paper

Authors A. Czyżewski, P. Hoffmann

English title

Polish title Projekt INZNAK: Inteligentne znaki drogowe do adaptacyjnego sterowania ruchem pojazdów

Conference "Drzwi Otwarte Centrum Sterowania Ruchem m.st. Warszawy"

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date 10.1.2019- 10.1.2019

Notes tej prezentacji nie odnotowałem w MojaPG, ale warto ją umieścić na liście publikacji w raporcie z realizacji projektu INZNAK

Entry No. 735

Entry type journal paper

Authors A. Kwiatkowska, M. Lech, P. Odya, A. Czyżewski

English title Post-comatose patients with minimal consciousness tend to preserve reading comprehension skills but neglect syntax and spelling

Polish title

Journal Sci. Rep.

Volume 9

Number 19929

Pages 1 - 12

Abstract Modern eye tracking technology provides a means for communication with patients suffering from disorders of consciousness (DoC) or remaining in locked-in-state. However, being able to use an eye tracker for controlling text-based contents by such patients requires preserved reading ability in the first place. To our knowledge, this aspect, although of great social importance, so far has seemed to be neglected. In the paper, we presented the possibility of using an eye-tracking technology for assessing reading comprehension skills in post-comatose patients with minimal consciousness. We prepared various syllable-, word- and sentence-based tasks, controlled by gaze, used for assessing the reading comprehension skills. The obtained results showed that people with minimal consciousness preserved the reading comprehension skills, in most cases to a high extent, but had difficulties with recognizing errors in the written text. The ability to maintain attention during performing the tasks was in statistically significant correlation with motivation, and that one was in a statistically significant correlation with the reading ability. The results indicate that post-comatose patients with minimal consciousness can read words and sentences, hence some useful hints may be provided for the development of gaze tracking-based human-computer interfaces for these people.

Entry No. 736

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Vehicle detector training with minimal supervision

Polish title

Conference The International Conference on Digital Image and Signal Processing (DISP 2019)

Preprint

Number

Volume

Pages

Conference site

Conference date 29.4.2019- 30.4.2019

Notes nr rekordu w MojaPG 150878, ISBN: 978‐1‐912532‐09‐4

Abstract Recently many efficient object detectors based on convolutional neural networks (CNN) have been developed and they achieved impressive performance on many computer vision tasks. However, in order to achieve practical results, CNNs require really large annotated datasets for training. While many such databases are available, many of them can only be used for research purposes. Also some problems exist where such datasets are not available at all or they are limited in scope, e.g. many robotics applications. However, it is usually possible to obtain a large set of unlabelled data which contain useful information. The above mentioned possibility justifies a development of methods that exploit unlabelled data and they work with a minimal number of required annotations. In this work we follow recent self-supervised learning paradigm. Large unlabelled dataset of traffic monitoring was acquired by the authors. Then CNN was trained in order to perform moving objects segmentation based on labels obtained from a unsupervised motionbased segmentation algorithm. Even though collected labels are not perfect, they still allow CNN to learn an efficient feature representation. In the next step we fine-tuned the CNN algorithm on a limited set of manually labelled ground-truth data for object detection. Subsequently, we investigated the relation between the number of labels used for fine-tuning and final detection performance on the test set. We also compared the results with CNN pretrained on ImageNet which is now a common technique. Vehicle detection results obtained on our custom dataset are presented in the paper. The obtained results are promising, because they demonstrate that even when only limited ground-truth data are available, it is still possible to learn efficient feature representation given large collection of unlabelled data. The presented approach seems applicable to any object detector setting where there is an access to a large set of unlabelled data with moving objects of interest.

Entry No. 737

Entry type conference paper

Authors D. Grabowski, A. Czyżewski

English title Road slippery detection based on CCTV cameras using convolutional neural networks

Polish title

Conference The International Conference on Digital Image and Signal Processing (DISP 2019)

Preprint

Number

Volume

Pages

Conference site

Conference date 29.4.2019- 30.4.2019

Notes nr rekordu w MojaPG 150884

Abstract One of the important causes of road accidents is the surface slippery. The problem causes an increased braking distance and a worse wheels grip. Drivers often do not adjust the speed of the vehicle to conditions on the road, posing a threat to themselves and other road users. Therefore, it is important to quickly inform users about wet, snowy or icy road to enhance safety. Currently in Poland, in the system operated by the General Management for National Roads and Motorways, more than 700 CCTV cameras are installed along national roads and motorways. Basing on camera images, the operator decides whether to send a message about the danger to the drivers. Unfortunately, with the expanding number of devices, the system’s reaction time is increasing. Therefore, for the sake of safety, it is necessary to create a way to detect and to assess road conditions faster through automatic operation mode. To solve this problem we decided to use convolutional neural networks (CNN). As a part of work, the software was created to periodically download and to aggregate road images in the database. In addition, weather condition data from the place where a photo was taken, were assigned to each record. Thanks to this, a base of 1 million photographs was obtained with labels defining the condition of the road surfaces. Unfortunately, camera images are not standardized throughout the system. They are characterized by different resolution, lighting, noise occurrence, and other factors. Therefore, a work is underway to measure the quality of the photo and choose the best ones to use in the learning process. As an effect, using the Tensor Flow library, the CNN model will be created and trained, allowing the classification of road photos with the division into categories: dry, wet, icy, and snowy. It will be possible on this basis to automatically generate an alert about the danger for road users.

Entry No. 738

Entry type journal paper

Authors A. Czyżewski, A. Kurowski, S. Zaporowski

English title Application of autoencoder to traffic noise analysis

Polish title

Journal J. Acoust. Soc. Amer.

Volume 146

Number 2

Pages 2958 - 2958

Notes nr rekordu w MojaPG 150902

Abstract The aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to the automatic annotation of road traffic-related events based on noise analysis. Two-dimensional representation of traffic sounds based on Mel Frequency Cepstral Coefficients (MFCC) was fed the autoencoder neural network, and after that classified with k-nearest neighbors algorithm, Support Vector Machines, and random forests. Obtained results show that sound recordings can help determine the number of vehicles passing on the road. However, instead of being treated as independent, this method output should be combined with another source of data, e.g., video processing results or microwave radar data readings. Comparative results of vehicle counting obtained with the use of autoencoder and different classifiers are shown in the paper. [The Polish National Centre finances the project for Research and Development (NCBR) from the European Regional Development Fund No. POIR.04.01.04-00-0089/16 entitled: "INZNAK: Intelligent Road Signs with V2X Interface for Adaptive Traffic Controlling."]

Entry No. 739

Entry type journal paper

Authors A. Czyżewski, J. Kotus, G. Szwoch

English title Estimating traffic intensity employing passive acoustic radar and enhanced microwave Doppler radar sensor

Polish title

Journal Remote Sensing

Volume 12

Number 1

Pages 110 - 110

Notes Special Issue: Radar and Sonar Imaging and Processing

Abstract Innovative road signs that can autonomously display the speed limit in cases where the traffic situation requires it are under development. The autonomous road sign contains many types of sensors, of which the subject of interest in this article is the Doppler sensor we have improved and the constructed and calibrated acoustic probe. An algorithm to perform vehicle detection and tracking, as well as vehicle speed measurement, in a signal acquired with a continuous wave Doppler sensor, is discussed. A method is also presented and studied experimentally for counting vehicles and for determining their movement direction by means of acoustic vector sensor application. The assumptions of the method employing spatial distribution of sound intensity determined with the help of an integrated 3D sound intensity probe are discussed. The enhanced Doppler radar and the developed sound intensity probe were used for the experiments which are described and analyzed in the paper.

Entry No. 740

Entry type journal paper

Authors S. Zaporowski, A. Czyzewski

English title Audio Feature Analysis for Precise Vocalic Segments Classification in English for Security Purposes

Polish title

Journal Multimedia Tools and Applications

Volume

Number

Pages

Notes Zgłoszenie po konferencji MCSS

Abstract An approach to identifying the most meaningful Mel-Frequency Cepstral Coeffi-cients representing selected allophones and vocalic segments for their classifica-tion is presented in the paper. For this purpose, experiments were carried out us-ing algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal recorded employing 7 speakers who spoke English at the native or near-native speaker level withing a Standard Southern British English variety accent. The recordings were analyzed by special-ists from the field of phonology in order to extract vocalic segments and selected allophones. Then parameterization was made using Mel Frequency Cepstral Co-efficients, Delta MFCC, and Delta Delta MFCC. In the next stage, feature vectors were passed to the input of individual algorithms utilized to reduce the size of the vector by previously mentioned algorithms. The vectors prepared in this way have been used for classifying allophones and vocalic segments employing sim-ple Artificial Neural Network (ANN) and Support Vector Machine (SVM). The classification results using both classifiers and methods applied for reducing the number of parameters were presented. Also, clusterization was conducted using K-means algorithm. The results of the reduction are also shown explicitly by in-dicating parameters proven to be significant and those rejected by particular algo-rithms. Factors influencing the obtained results were discussed. Difficulties asso-ciated with obtaining the data set, labeling, and research on allophones classifica-tion were also pointed-out. Allophone-based speech analysis was also considered in the context of forensic speaker recognition.

Entry No. 741

Entry type journal paper

Authors S. Cygert, A Czyżewski

English title Toward Robust Pedestrian Detection With Data Augmentation

Polish title

Journal IEEE Access

Volume 8

Number

Pages 13667 - 13668

Abstract In this article, the problem of creating a safe pedestrian detection model that can operate in the real world is tackled. While recent advances have led to significantly improved detection accuracy on various benchmarks, existing deep learning models are vulnerable to invisible to the human eye changes in the input image which raises concerns about its safety. A popular and simple technique for improving robustness is using data augmentation. In this work, the robustness of existing data augmentation techniques is evaluated to propose a new simple augmentation scheme where during training, an image is combined with a patch of a stylized version of that image. Evaluation of pedestrian detection models robustness and uncertainty calibration under naturally occurring corruption and in realistic cross-dataset evaluation setting is conducted to show that our proposed solution improves upon previous work. In this paper, the importance of testing the robustness of recognition models is emphasized and it shows a

Entry No. 742

Entry type journal paper

Authors A. Czyżewski, A. Kurowski, P. Odya, P. Szczuko

English title Multifactor consciousness level assessment of participants with acquired brain injuries employing human–computer interfaces

Polish title

Journal BioMedical Engineering OnLine

Volume 19

Number

Pages 1 - 26

Abstract Background A lack of communication with people suffering from acquired brain injuries may lead to drawing erroneous conclusions regarding the diagnosis or therapy of patients. Information technology and neuroscience make it possible to enhance the diagnostic and rehabilitation process of patients with traumatic brain injury or post-hypoxia. In this paper, we present a new method for evaluation possibility of communication and the assessment of such patients’ state employing future generation computers extended with advanced human–machine interfaces. Methods First, the hearing abilities of 33 participants in the state of coma were evaluated using auditory brainstem response measurements (ABR). Next, a series of interactive computer-based exercise sessions were performed with the therapist’s assistance. Participants’ actions were monitored with an eye-gaze tracking (EGT) device and with an electroencephalogram EEG monitoring headset. The data gathered were processed with the use of data clustering techniques. Results Analysis showed that the data gathered and the computer-based methods developed for their processing are suitable for evaluating the participants’ responses to stimuli. Parameters obtained from EEG signals and eye-tracker data were correlated with Glasgow Coma Scale (GCS) scores and enabled separation between GCS-related classes. The results show that in the EEG and eye-tracker signals, there are specific consciousness-related states discoverable. We observe them as outliers in diagrams on the decision space generated by the autoencoder. For this reason, the numerical variable that separates particular groups of people with the same GCS is the variance of the distance of points from the cluster center that the autoencoder generates. The higher the GCS score, the greater the variance in most cases. The results proved to be statistically significant in this context. Conclusions The results indicate that the method proposed may help to assess the consciousness state of participants in an objective manner.

Entry No. 743

Entry type journal paper

Authors A. Kurowski, S. Zaporowski, A. Czyzewski

English title SIMULATION OF SPEED RECOMMENDATION ALGORITHM IMPACT ON MINIMUM DISTANCE BETWEEN VEHICLES

Polish title

Journal Transport Problems

Volume

Number

Pages

Notes Zgłoszenie po konferencji GAMBIT

Abstract An approach to recommendation systems suggesting safe speed on the road is presented. Real data obtained on roads in the Pomeranian Voivodeship in Poland were used for the simulation. As part of the INZNAK project, a number of measurements were carried out both on local roads and expressways. Based on the data obtained, a speed recommendation model was created based on the SUMO traffic simulator. The proposed system, depending on the volume of traffic and atmospheric conditions prevailing on the road and the surface condition, recommends the safe speed for a passing vehicle influencing the distance from the preceding vehicle to a safe distance. The observed effect of the system application was indeed an increase in the minimal distance between vehicles in most simulations.

Entry No. 744

Entry type conference paper

Authors A. Kurowski, S. Zaporowski, A. Czyzewski

English title SIMULATION OF SPEED RECOMMENDATION ALGORITHM IMPACT ON MINIMUM DISTANCE BETWEEN VEHICLES

Polish title

Conference GAMBIT 2020

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 7.9.2020- 8.9.2020

Notes Referat na konferencję GAMBIT - brak oficjalnej monografii pokonferencyjnej - zgłoszenie do Transport Problems

Abstract An approach to recommendation systems suggesting safe speed on the road is presented. Real data obtained on roads in the Pomeranian Voivodeship in Poland were used for the simulation. As part of the INZNAK project, a number of measurements were carried out both on local roads and expressways. Based on the data obtained, a speed recommendation model was created based on the SUMO traffic simulator. The proposed system, depending on the volume of traffic and atmospheric conditions prevailing on the road and the surface condition, recommends the safe speed for a passing vehicle influencing the distance from the preceding vehicle to a safe distance. The observed effect of the system application was indeed an increase in the minimal distance between vehicles in most simulations.

Entry No. 745

Entry type conference paper

Authors S. Zaporowski, A. Czyzewski

English title Audio Feature Analysis for Precise Vocalic Segments Classification in English

Polish title

Conference MCSS 2020 - Multimedia Communications, Services and Security

Preprint

Number

Volume

Pages

Conference site

Conference date 8.10.2020- 9.10.2020

Abstract An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal recorded employing 7 speakers who spoke English at the native or near-native speaker level withing a Standard Southern British English variety accent. The recordings were analyzed by specialists from the field of phonology in order to extract vocalic segments and selected allophones. Then parameterization was made using Mel Frequency Cepstral Coefficients, Delta MFCC, and Delta Delta MFCC. In the next stage, feature vectors were passed to the input of individual algorithms utilized to reduce the size of the vector by previously mentioned algorithms. The vectors prepared in this way have been used for classifying allophones and vocalic segments employing simple Artificial Neural Network (ANN) and Support Vector Machine (SVM). The classification results using both classifiers and methods applied for reducing the number of parameters were presented. The results of the reduction are also shown explicitly, by indicating parameters proven to be significant and those rejected by particular algorithms. Factors influencing the obtained results were discussed. Difficulties associated with obtaining the data set, its labeling, and research on allophones were also analyzed.

Entry No. 746

Entry type conference paper

Authors M. Stefaniak, A. Czyzewski

English title Comparison of two methods of sound extraction from guitar string video recordings

Polish title

Conference IEEE SPA 2020

Preprint

Number

Volume

Pages 146 - 150

Conference site Poznań, Polska

Conference date 23.9.2020- 25.9.2020

Abstract A comparison of two sound extraction methods from guitar string video recordings is presented in the paper. A brief overview of highframe rate camera technology and possible applications are included. The method using the image analysis from two such cameras is presented. The cameras are placed at the angle of 90 degrees for recording the image in three planes. The results achieved with the setup proposed by ourselvesare comparedto the results of recording with a single highframe rate camera used for the Visual Microphone method developed by scientists from MIT. Spectrograms and signal spectra of recordings were compared and discussed, revealing that both methods of sound extraction from video brought the ability to reproduce sound, but with some distortions.Finally, the options for future experiments are considered.

Entry No. 747

Entry type journal paper

Authors S. Cygert, A. Czyżewski

English title Vehicle Detection with Self-Training for Adaptative Video Processing Embedded Platform

Polish title

Journal Applied Sciences-Basel

Volume 10

Number 17

Pages

Abstract Traffic monitoring from closed-circuit television (CCTV) cameras on embedded systems is the subject of the performed experiments. Solving this problem encounters difficulties related to the hardware limitations, and possible camera placement in various positions which affects the system performance. To satisfy the hardware requirements, vehicle detection is performed using a lightweight Convolutional Neural Network (CNN), named SqueezeDet, while, for tracking, the Simple Online and Realtime Tracking (SORT) algorithm is applied, allowing for real-time processing on an NVIDIA Jetson Tx2. To allow for adaptation of the system to the deployment environment, a procedure was implemented leading to generating labels in an unsupervised manner with the help of background modelling and the tracking algorithm. The acquired labels are further used for fine-tuning the model, resulting in a meaningful increase in the traffic estimation accuracy, and moreover, adding only minimal human effort to the process allows for further accuracy improvement. The proposed methods, and the results of experiments organised under real-world test conditions are presented in the paper.

Entry No. 748

Entry type conference paper

Authors A. Kurowski, S. Zaporowski, A. Czyzewski

English title 1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type

Polish title

Conference IEEE SPA 2020

Preprint

Number

Volume

Pages 142 - 145

Conference site Poznań, Polska

Conference date 23.9.2020- 25.9.2020

Abstract A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence. The neural network architecture employed for these tasks is a 1D convolutional network. Two types of classifiers are tested: one analyzing only the current audio frame and one analyzing three consecutive audio frames that allow us to take into account the context of the middle frame occurrence. The neural network is trained on datasets derived for four frame lengths, namely 50 ms, 100 ms, 200 ms, and 400 ms. Results of statistical analysis of both network classification accuracy are presented. The context-aware variant of a neural network performed better in a statistically significant manner for three out of four investigated frame lengths

Entry No. 749

Entry type journal paper

Authors P. Szczuko, A. Kurowski, P. Odya, A. Czyżewski, B. Kostek, B. Graff

English title Granularity Concept Applied to Respiratory Rate Quantification and Abnormal Pattern Prediction

Polish title

Journal Cognitive Computation - Granular Computing and Three-Way Decisions for Cognitive Analytics

Volume

Number

Pages

Notes w przygotowaniu

Abstract W przygotowaniu

Entry No. 750

Entry type conference paper

Authors A. Kurowski, Sz Zaporowski, A. Czyżewski

English title Simulation of the impact of the speed recommendation algorithm on the minimum distance between vehicles

Polish title Symulacja oddziaływania algorytmu zalecanych prędkości na minimalny odstęp pomiędzy pojazdami

Conference XIII Międzynarodowa Konferencja Bezpieczeństwa Ruchu Drogowego GAMBIT 2020

Preprint

Number

Volume

Pages

Conference site Gdańsk, Polska

Conference date 7.9.2020- 8.9.2020

Notes referat został wstępnie zakwalifikowany do publikacji w czasopiśmie Transport Problems

Abstract This article presents the approach to recommendation systems suggesting safe speed on the road based on real data obtained on roads in the Pomeranian Voivodeship in Poland. As part of the INZNAK project, several measurements were carried out both on local roads and expressways. Based on the data obtained, a speed recommendation model was created based on the SUMO traffic simulator. The proposed system, depending on the volume of traffic and atmospheric conditions prevailing on the road and the condition of the surface, recommends a safe speed for a passing vehicle, increasing the distance from the preceding vehicle to a safe distance. The observed effect of the system was increasing the distance between vehicles in most simulations.

Streszczenie W artykule przedstawiono podejście do systemów rekomendacji sugerujących bezpieczną prędkość na drodze na podstawie rzeczywistych danych uzyskanych na drogach województwa pomorskiego w Polsce. W ramach projektu INZNAK wykonano kilka pomiarów zarówno na drogach lokalnych, jak i drogach ekspresowych. Na podstawie uzyskanych danych powstał model doboru prędkości oparty na symulatorze ruchu SUMO. Proponowany system, w zależności od natężenia ruchu i warunków atmosferycznych panujących na drodze oraz stanu nawierzchni, zaleca bezpieczną prędkość dla przejeżdżającego pojazdu, wymuszając zwiększenie odległości od poprzedzającego pojazdu do bezpiecznej wartości. Obserwowanym efektem działania systemu było zwiększenie odległości między pojazdami w większości symulacji.

Entry No. 751

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Evaluating calibration and robustness of pedestrian detectors

Polish title

Conference Multimedia Communications, Services & Security (MCSS'20)

Preprint

Number

Volume

Pages

Conference site

Conference date 8.10.2020- 9.10.2020

Abstract In this work robustness and calibration of modern pedestrian detectors are evaluated. Pedestrian detection is a crucial perception com- ponent in autonomous driving and here we study its performance under different image corruptions. Furthermore, we provide analysis of classifi- cation calibration of pedestrian detectors and we show a positive effect of using style-transfer augmentation technique. Our analysis is aimed as a step towards understanding and improving current safety-critical detection systems.

Entry No. 752

Entry type conference paper

Authors S. Cygerrt, F. Górski, P. Juszczyk, S. Lewalski, K. Pastuszak, A Czyżewski

English title Towards Cancer Patients Classification Using Liquid Biopsy

Polish title

Conference Predictive Intelligence in Medicine, MICCAI 2021 workshop

Preprint

Number

Volume

Pages 221 - 230

Conference site Strasburg, Francja

Conference date 1.2021- 1..2021

Abstract (zgodnie z językiem wydania) Liquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier based on the Convolutional Neural Network (CNN), allowed significantly improving the classification accuracy. In this work, we take a closer look at this approach and find that similar results can be obtained using significantly smaller models. Additionally, competitive results were achieved using gradient boosting. Since it has another advantage of adding interpretability to the model, we further analyze it in this work

Entry No. 753

Entry type journal paper

Authors M. Lech, A. Czyzewski, MT Kucewicz

English title CyberEye: New Eye-Tracking Interfaces for Assessment and Modulation of Cognitive Functions beyond the Brain

Polish title

Journal Sensors

Volume 21

Number 22

Pages 1 - 7

Abstract The emergence of innovative neurotechnologies in global brain projects has accelerated research and clinical applications of BCIs beyond sensory and motor functions. Both invasive and noninvasive sensors are developed to interface with cognitive functions engaged in thinking, communication, or remembering. The detection of eye movements by a camera offers a particularly attractive external sensor for computer interfaces to monitor, assess, and control these higher brain functions without acquiring signals from the brain. Features of gaze position and pupil dilation can be effectively used to track our attention in healthy mental processes, to enable interaction in disorders of consciousness, or to even predict memory performance in various brain diseases. In this perspective article, we propose the term ‘CyberEye’ to encompass emerging cognitive applications of eye-tracking interfaces for neuroscience research, clinical practice, and the biomedical industry. As CyberEye technologies continue to develop, we expect BCIs to become less dependent on brain activities, to be less invasive, and to thus be more applicable.

Entry No. 754

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Robustness in Compressed Neural Networks for Object Detection

Polish title

Conference 2021 International Joint Conference on Neural Networks (IJCNN)

Preprint

Number

Volume

Pages

Conference site Shenzhen, Chiny

Conference date 2021- .2021

Abstract Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving setting, which is considered in this work. It was shown in the paper that the sensitivity of compressed models to different distortion types is nuanced, and some of the corruptions are heavily impacted by the compression methods (i.e., additive noise), while others (blur effect) are only slightly affected. A common way to improve the robustness of models is to use data augmentation, which was confirmed to positively affect models' robustness, also for highly compressed models. It was further shown that while data imbalance methods brought only a slight increase in accuracy for the baseline model (without compression), the impact was more striking at higher compression rates for the structured pruning. Finally, methods for handling data imbalance brought a significant improvement of the pruned models' worst-detected class accuracy.

Entry No. 755

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Robust Object Detection with Multi-input Multi-output Faster R-CNN

Polish title

Conference International Conference on Image Analysis and Processing ICIAP 2022

Preprint

Number

Volume

Pages

Conference site LEcce, Włochy

Conference date 23.5.2022- 27.5.2022

Notes https://link.springer.com/chapter/10.1007/978-3-031-06427-2_48

Abstract Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work, a generalization of the MIMO approach is applied to the task of object detection using the general-purpose Faster R-CNN model. It was shown that using the MIMO framework allows building strong feature representation and obtains very competitive accuracy when using just two input/output pairs. Furthermore, it adds just 0.5% additional model parameters and increases the inference time by 15.9% when compared to the standard Faster R-CNN. It also works comparably to or outperforms the Deep Ensemble approach in terms of model accuracy, robustness to out-of-distribution setting, and uncertainty calibration when the same number of predictions is used. This work opens up avenues for applying the MIMO approach in other high-level tasks such as semantic segmentation and depth estimation.

Entry No. 756

Entry type journal paper

Authors K. Marciniuk, B. Kostek, A. Czyżewski

English title Classifying type of vehicles on the basis of data extracted from audio signal characteristics

Polish title

Journal J. Acoust. Soc. Amer.

Volume 141

Number 5

Pages 3883 - 3883

Notes http://acousticalsociety.org/sites/default/files/docs/Boston_Full_Week.pdf

Abstract he aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox, designed for music parameter extraction, is used to obtain a vector of parameters. Correlation analyses are performed to check whether extracted parameters enable to separate selected types of vehicle-associated noise, e.g.: car, truck and motorcycle. Behrens-Fisher statistics is used to find the most suitable parameters that may be contained in the optimized feature vector. The last step is to build a decision system that allows for the automatic classification of a vehicle type. The results of automatic classification of prepared vehiclenoise related samples are shown and discussed.

Entry No. 757

Entry type journal paper

Authors A. Czyżewski, M. Piotrowska, B. Kostek

English title Analysis of allophones based on audio signal recordings and parameterization

Polish title

Journal J. Acoust. Soc. Amer.

Volume 141

Number 5

Pages 3521 - 3521

Abstract The aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping in time demands a precise determination of allophonic pronunciation in the context of phonemic transcription. The presented study is focused on creation of speech recordings that may serve for the analysis of allophone variation. Two sets of recordings are prepared. The first one consists of words read by the non-native speakers. Tempo of reading is forced by a teleprompter. In the second case, every word is played back from the recordings of the phonology expert and then the speaker repeats a particular word. The last stage is the assessment of recordings by the same expert. Scores assigned by the expert are included as a reference for signal analysis and parametrization.

Entry No. 758

Entry type conference paper

Authors S. Cygert, B. Wróblewski, K. Woźniak, R. Słowiński, A. Czyżewski

English title Closer Look at the Uncertainty Estimation in Semantic Segmentation under Distributional Shift

Polish title

Conference 2021 International Joint Conference on Neural Networks (IJCNN)

Preprint

Number

Volume

Pages

Conference site Shenzhen, Chiny

Conference date

Notes https://ieeexplore.ieee.org/document/9533330

Abstract While recent computer vision algorithms achieve impressive performance on many benchmarks, they lack robustness - presented with an image from a different distribution, (e.g. weather or lighting conditions not considered during training), they may produce an erroneous prediction. Therefore, it is desired that such a model will be able to reliably predict its confidence measure. In this work, uncertainty estimation for the task of semantic segmentation is evaluated under a varying level of domain shift: in a cross-dataset setting and when adapting a model trained on data from the simulation. It was shown that simple color transformations already provide a strong baseline, comparable to using more sophisticated style-transfer data augmentation. Further, by constructing an ensemble consisting of models using different backbones and/or augmentation methods, it was possible to improve significantly model performance in terms of overall accuracy and uncertainty estimation under the domain shift setting. The Expected Calibration Error (ECE) on challenging GTA to Cityscapes adaptation was reduced from 4.05 to the competitive value of 1.1. Further, an ensemble of models was utilized in the self-training setting to improve the pseudo-labels generation, which resulted in a significant gain in the final model accuracy, compared to the standard fine-tuning (without ensemble).

Entry No. 759

Entry type journal paper

Authors K. Łopatka, R. Rybacki, B. Kunka, A. Czyżewski, B. Kostek

English title Virtual Keyboard controlled by eye gaze employing speech synthesis

Polish title Wirtualna Klawiatura sterowana wzrokiem, wykorzystująca syntezę mowy

Journal Elektronika

Volume 52

Number 1

Pages 39 - 42

Bibliographic No. 11

Abstract The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents an algorithm of concatenative speech synthesis used in the engineered solution. Both modules of the system described were created by the Multimedia Systems Department. The work of the entire system was verified in real conditions. Conclusions focusing on the usefulness of this approach are provided.

Streszczenie W artykule przedstawiono zastosowanie syntezy mowy w zintegrowanym w systemie śledzenia punktu fiksacji wzroku. Takie podejście w znaczący sposób może przyczynić się do poprawy jakości życia osób niepełnosprawnych fizycznie, które nie mają możliwości komunikowania się. Interfejsem umożliwiającym wprowadzanie do syntetyzera mowy tekstu jest wirtualna klawiatura z rozkładem klawiszy QWERTY. W pierwszej części artykułu przedstawiono sposób wyznaczania punktu fiksacji wzroku na monitorze komputerowym za pomocą stworzonego w Katedrze Systemów Multimedialnych systemu o nazwie Cyber-Oko. W drugiej części zaprezentowano algorytm syntezy mowy konkatenacyjnej, który jest wykorzystywany w zaproponowanym rozwiązaniu. Sprecyzowano odpowiednie wnioski na temat użyteczności takiego podejścia oraz zweryfikowano pracę systemu w warunkach rzeczywistych.

Entry No. 760

Entry type book

Authors A. Ciarkowski, A. Czyżewski

English title

Polish title System komunikacji operacyjnej i dostepu do strumieni multimedialnych dla terminali mobilnych

Editor PPBW

Pages

Notes (w druku)

Abstract Developed multimedia communications framework optimized for operational use by law enforcement and security forces is presented. A special attention was paid to wireless access to multimedia streams originating from “intelligent surveillance” cameras. The requirements were analyzed and the assumptions leading to the system design are discussed. The use of PDA-class devices as mobile communication terminals is proposed. The idea of using XMPP protocol as signaling medium is described. The issue of real-time multimedia transmission with Jingle/XMPP extension is discussed. Technical aspects of establishing multimedia communications session in the presence of intermediary network devices are raised and a solution dedicated for surveillance system, which combines intelligent relay with transmission recorder is proposed. The topic of transmission of metadata specific to video and audio streams originating from surveillance system stations is discussed.

Streszczenie Przedstawiono opracowany system komunikacji multimedialnej zoptymalizowany pod kątem jego wykorzystania w warunkach operacyjnych przez służby odpowiedzialne za ochronę obiektów i bezpieczeństwo. Szczególną uwagę poświęcono funkcjonalności bezprzewodowego dostępu do strumieni multimedialnych pochodzących z kamer systemu „inteligentnego monitoringu”. Przeanalizowano wymagania i omówiono założenia, na których opiera się projekt tego systemu. Zaproponowano wykorzystanie urządzeń klasy PDA w roli mobilnych terminali komunikacyjnych. Przedstawiono ideę wykorzystania protokołu XMPP jako medium komunikacji sygnalizacyjnej. Przedyskutowano zagadnienie transmisji multimediów w czasie rzeczywistym z wykorzystaniem rozszerzenia Jingle/XMPP. Zwrócono także uwagę na techniczne aspekty związane z nawiązaniem sesji komunikacji multimedialnej w obecności urządzeń pośredniczących typowych dla sieci rozproszonych. Omówiono zagadnienie transmisji metadanych charakterystycznych dla strumieni wizyjnych i fonicznych pochodzących ze stacji systemu monitoringowego.

Entry No. 761

Entry type journal paper

Authors P. Odya, A. Kurowski, A. Czyżewski

English title Study of the impact of audio-visual stimulation on progress in English language learning

Polish title

Journal Computer Assisted Language Learning

Volume

Number

Pages

Notes w recenzji (po poprawkach)

Abstract The experiments aimed to verify whether the use of auditory and visual stimulation can improve the effectiveness of learning selected skills in a foreign language. The main idea was to perform stimulation of the sight and hearing senses using the content processed with some special digital signal processing (DSP) techniques. The auditory stimulation is based on adjusting the pace of speech without changing the pitch of the recorded spoken material. The visual stimulation consists of displaying on the computer monitor the text heard in headphones with the highlighted word or phrase - synchronized with the speech. The experiment was supposed to compare the effectiveness of English language learning of participants related to four primary skills: understanding the meaning of speech, assimilation of new contextual vocabulary, pronunciation skills, the ability to spell correctly. The most promising results were obtained for the group using only the visual stimulation for which statistically significant improvement is observed in the vocabulary-related and pronunciation skills. Auditory stimulation did not bring the expected positive effects, except for only one exercise, namely vocabulary-related skill. No improvement and even some degradation were observed for the bimodal stimulation. The achieved results of the experimental study are discussed, and general conclusions on the impact of the proposed auditory-visual stimulation on English learning progress are drawn.

Entry No. 762

Entry type conference paper

Authors A. Ciarkowski, P. Mroczkiewicz, A. Czyżewski

English title Concept of Distributed Multimedia Processing in Mobile Networks Utilizing Web Services

Polish title

Conference XXII Polish National Symposium on Telecommunications

Preprint

Number

Volume

Pages

Conference site Bydgoszcz, Polska

Conference date

Notes Dane preprintu zostaną uzupełnione

Abstract Concept of distributed multimedia processing in mobile networks is presented as an effective method for dealing with hardware limitations of portable devices. The DeSyME environment is depicted as a convenient platform for development and running of mobile services implementing described architecture. An innovative approach to mobile web service invocation based on semantical mechanisms is presented. Semantical web service “wrapper” for BPEL-initiated invocation is described.

Entry No. 763

Entry type

Authors A. Czyżewski, B. Kostek, P. Odya

English title Method for the visual performance of sound works on musical instruments using the musical notation and the system for execution of this method

Polish title Sposób wzrokowego wykonywania utworów dźwiękowych na instrumentach muzycznych z wykorzystaniem zapisu nutowego oraz układ do realizacji tego sposobu

Notes nr zgł. 407812, data zgł. 2014-04-07, nr BUP 21/2015, data pub. BUP 2015-10-12,

Streszczenie Przedmiotem zgłoszenia jest sposób wzrokowego wykonywania utworów dźwiękowych na instrumentach muzycznych z wykorzystywaniem zapisu nutowego, który charakteryzuje się tym, że w pamięci urządzenia komputerowego zaimplementowuje się sygnały z informacjami o elementach zapisu nutowego, zaś na ekranie wyświetlacza wyświetla się obraz z zapisem nutowym. Na oczy użytkownika kieruje się światło podczerwone i mierzy się w znany sposób fiksację wzroku użytkownika, a na podstawie pomiarów punktu fiksacji wzroku określa się jednostkowy element zapisu nutowego, na którym skupiony jest wzrok użytkownika. Następnie przesyła się pierwszy sygnał sterujący do sterownika mikroprocesorowego, przy pomocy którego porównuje się informacje zawarte w pierwszym sygnale sterującym z informacjami o jednostkowych elementach zapisu nutowego, zawartymi w pamięci urządzenia komputerowego. Generuje się drugi sygnał sterujący z zapisami dźwiękowymi, odpowiadającymi poszczególnym jednostkowym elementom zapisu nutowego, po czym drugi sygnał sterujący przesyła się do układu syntezy dźwięku, przy pomocy którego generuje się trzeci sygnał sterujący, zawierający informacje dźwiękowe, odtwarzane przez układ odtwarzania dźwięku. Przedmiotem wynalazku jest również układ do realizacji sposobu.

Entry No. 764

Entry type conference paper

Authors A. Czyżewski, P. Bratoszewski, P. Hoffmann, M. Lech, M. Szykulski, M. Szczodrak

English title

Polish title Rozproszone laboratorium zastosowań biometrii w bankowości

Conference

Preprint

Number

Volume

Pages

Conference site Warszawa, Polska

Conference date

Streszczenie W referacie plakatowym zaprezentowano budowany wielomodalny system weryfikacji klienta bankowego, którego pierwszoplanowym celem jest pozyskanie praktycznej wiedzy na temat efektywności i akceptowalności technologii identyfikacji biometrycznej, wykorzystujących m. in. oryginalne propozycje oparte na wielowymiarowej analizie dynamicznego podpisu elektronicznego, składanego z pomocą specjalnie oczujnikowanego długopisu, nowych zastosowań fotogrametrii laserowej, a także implementacji znanych metod weryfikacji tożsamości, takich, jak: biometria głosu, analiza w podczerwieni rozkładu naczyń żylnych w dłoniach (Hand Vain), rozpoznawanie twarzy w obrazie wizyjnym. Prezentacja plakatowa zawiera fotografie prototypowego stanowiska do wielomodalnej weryfikacji tożsamości wraz z jego opisem, wykazem funkcjonalności i uzyskanymi parametrami. Opracowane stanowisko zostanie powielone w 100 egzemplarzach i w tej formie posłuży do utworzenia laboratoriom eksperymentalnej biometrii rozproszonego pomiędzy 60 oddziałów bankowych, którego utworzenie jest przewidziane w celu zebrania za pośrednictwem centralnego serwera wyników eksperymentalnych przy udziale ok. 10.000 klientów oraz przeprowadzenia analizy poprzedzającej planowane wdrożenie systemów biometrycznych w największym polskim banku.

Entry No. 765

Entry type conference paper

Authors A. Czyżewski, M. Lech, P. Hoffmann, P. Bratoszewski

English title

Polish title Multimodalny biometryczny system weryfikacji tożsamości klienta bankowego

Conference XII KONFERENCJA NAUKOWA BIOMETRIA 2016

Preprint

Number

Volume

Pages

Conference site Wrszawa, Polska

Conference date

Notes Prezentacja konferencyjna

Streszczenie W prezentacji przedstawiono przegląd rozwiązań wykorzystywanych w bankach do weryfikacji tożsamości klientów. Ponadto zawarto opis metod biometrycznych aktualnie wykorzystywanych w placówkach bankowych wraz z odniesieniem do skuteczności i wygody korzystania z dostępnych rozwiązań. Zaproponowano rozszerzenie zakresu wykorzystania technologii biometrycznych, wskazując kierunek rozwoju systemów bezpieczeństwa dla poprawy dostępu do usług i zwiększenia bezpieczeństwa transakcji. Prezentacja zawiera informacje opisujące projekt IDENT, realizaowany w ramach Programu Badań Stosowanych NCBR, który ma na celu poprawę skuteczności weryfikacji klientów bankowych z użyciem technologii biometrycznych.

Entry No. 766

Entry type book

Authors A. Czyżewski, A. Korzeniewski, P. Odya, P. Szczuko

English title

Polish title METODY BADANIA ODDZIAŁYWANIA PRZYDROŻNYCH REKLAM NA KIEROWCÓW Z ZASTOSOWANIEM TECHNOLOGII MULTIMEDIALNEJ

Editor w druku, wydawca niznany

Pages

Notes rozdział w książce: NOWOCZESNE TECHNOLOGIE NA RZECZ BEZPIECZEŃSTWA

Streszczenie Istotnym problemem z punktu widzenia bezpieczeństwa ruchu drogowego jest właściwa lokalizacja reklam statycznych i dynamicznych w otoczeniu pasa drogowego. Celem niniejszej publikacji, nakierowanym na wspomaganie rozwiązywania wynikających z tego tytułu problemów, jest przedstawienie zakresu możliwych do wykonania, szeroko zakrojonych, wielopłaszczyznowych badań, wykorzystują- cych nowoczesne rozwiązania technologiczne, pozwalające na obiektywną ocenę zagrożeń wynikających z obecności reklam w obrębie dróg. Jako narzędzia badawcze zostały zaproponowane systemy śledzenia reakcji kierowców, oparte na wykorzystaniu zaawansowanej technologii multimedialnej. Systemy te mogą zostać zintegrowane w rzeczywistym pojeździe, umożliwiając badania w warunkach rzeczywistych lub jako element rozbudowanego symulatora jazdy. Ponadto elementem proponowanych badań jest sprawdzenie opinii kierowców z użyciem ankietyzacji oraz analiza wypadkowości w ruchu drogowym odbywającym się w sąsiedztwie reklam drogowych.

Entry No. 767

Entry type conference paper

Authors B. Kostek, P. Szczuko, J. Kotus, M. Szczodrak, A. Czyżewski

English title Guitar String Sound Retrieved from Moving Pixels

Polish title Drgania struny gitary ekstrahowane z rejestracji obrazu

Conference 171st Acoust. Soc. of Amercia Meeting

Preprint

Number

Volume

Pages

Conference site Salt Lake City, USA

Conference date

Notes referat wybrany jako lay-paper, Apple News 5aMU5; http://acoustics.org/5amu5-guitar-string-sound-retrieved-from-moving-pixels-bozena-kostek/

Abstract The aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing the vibration image into to the sound recording. The sound reconstructed in this way is compared with the sound recorded synchronously with the reference measurement microphone.

Streszczenie Celem prezentowanych badań było opracowanie metody wizyjnej rejestracji i analizy drgań struny gitary z wykorzystaniem szybkich kamer i dedykowanych algorytmów przetwarzania obrazu. W referacie przedstawiono metodykę nagrań wizyjnych, mających na celu rejestrację wideo drgań struny gitary oraz ekstrakcję dźwięku z nagrań wideo. W dalszej kolejności przedstawiono analizy uzyskanych dźwięków gitar i porównano je z dźwiękiem nagranym synchronicznie za pomocą referencyjnego mikrofonu pomiarowego.

Entry No. 768

Entry type journal paper

Authors M. Szczodrak, A. Czyżewski, J. Kotus

English title Dynamic noise mapping in the city of Gdansk

Polish title

Journal Acta Acoustica Supplement

Volume 95

Number 1

Pages 66 - 66

Entry No. 769

Entry type journal paper

Authors J. Cichowski, K. Lisowski, P. Szczuko, A. Czyżewski

English title

Polish title Zdalny zintegrowany moduł nadzoru radiowo-wizyjnego

Journal Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

Volume

Number 8-9

Pages 1 - 6

Notes Abstrakt w czasopiśmie "Przegląd Telekomunikacyjny" , No. 8-9, .2015.Treść referatu na dołączonej do czasopisma płycie CD-R.

Streszczenie Przedstawiono prace koncepcyjne, badawcze oraz implementacyjne skoncentrowane na praktycznej realizacji sys-temu lokalizacji i śledzenia obiektów z wykorzystaniem kamer wizyjnych i identyfikacji radiowej. Zaproponowano metodę konkatenacji danych w celu zwiększenia dokładno-ści i skuteczności detekcji obiektów. Omówiono założenia projektowe oraz technologie opracowane w ramach rozwi-janego multimodalnego modułu nadzoru. Zaproponowano i przedyskutowano praktyczne zastosowania opisanego systemu.

Entry No. 770

Entry type conference paper

Authors P. Żwan, A. Czyżewski

English title Further developments of parameterization methods of audio stream analysis for security purposes

Polish title Dalszy rozwój metod parametryzacji zdarzeń dźwiękowych związanych z niebezpieczeństwem

Conference 126th Audio Enineering Society Convention

Preprint

Number

Volume

Pages

Conference site Monachium, Niemcy

Conference date

Abstract The paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses of sounds of a broken window, gunshot and scream are performed and parameterization methods are proposed and discussed. Moreover, a sound classification system based on the Support Vector Machines (SVM) algorithm is presented and its accuracy is discussed. The practical application of the system with the use of a monitoring station is shown. The plan of further experiments is presented and the conclusions are derived.

Streszczenie W artykule przedstawiono analizę parametrów służących do automatycznego rozpoznawania zdarzeń dźwiękowych związanych z niebezpieczeństwem. Zbadano skuteczność działania klasyfikacji przy pomocy algorytmu Maszyny Wektorów Wspierających. Przedstawiono praktyczne zastosowanie systemu w systemie monitoringu zdarzeń dźwiękowych jako części projektu INDECT.

Entry No. 771

Entry type journal paper

Authors Ł. Kulasek, B. Kunka, A. Czyżewski

English title Face recognition by humans with gaze-tracking system Cyber-Eye

Polish title Badanie rozpoznawania twarzy przez człowieka z wykorzystaniem systemu śledzenia fiksacji wzroku Cyber-Oko

Journal Elektronika

Volume 1

Number 2011

Pages

Bibliographic No. 9

Abstract In order to understand the way humans memorize and recognize faces, we conducted research experiments employing a group of 20 people using the previously prepared gaze-tracking system Cyber-Eye. Cyber-Eye’s dedicated software coupled with infrared diodes and a camera allow tracking fixation point on the screen. Several videos samples containing face images were presented each individual participating in the experiment. Those videos were made separately for the two different stages: face memorizing and face recognition. Then, Cyber-Eye system rendered them with heat maps that presented the position of fixation point at every moment. The analysis of the videos showed which face regions are significant in recognizing and memorizing faces and in which order they are processed. The results of this paper can help to improve face recognition algorithms running on machines.

Streszczenie W celu dokładniejszego zrozumienia sposobu rozpoznawania i zapamiętywania twarzy przez człowieka przeprowadzono doświadczenie na grupie 20 osób z wykorzystaniem wcześniej opracowanego systemu śledzenia fiksacji wzroku Cyber-Oko. Wykorzystując diody i kamerę podczerwieni wraz z dedykowanym oprogramowaniem Cyber-Oko, które pozwala na śledzenie punktu skupienia wzroku na ekranie. Każdej osobie biorącej udział w doświadczeniu pokazano plik filmowy zbudowany w oparciu o zdjęcia twarzy osób. Filmy wideo zostały przygotowane oddzielnie dla etapów rozpoznawania i zapamiętywania twarzy. Następnie system Cyber-Oko umożliwił połączenie ich z mapami ciepła przedstawiającymi pozycję skupienia wzroku w danej chwili. Analizując otrzymane w ten sposób filmy wideo udało się zaobserwować które regiony twarzy są znaczące przy rozpoznawaniu i zapamiętywaniu twarzy przez człowieka, oraz w jakiej kolejności są analizowane. Wyniki niniejszej pracy mogą pozwolić na ulepszenie algorytmów rozpoznawania twarzy.

Entry No. 772

Entry type journal paper

Authors P. Szczuko, B. Kostek, A. Czyżewski

English title Comparison between natural movements and automatically generated animated motion employing motion capture and fuzzy logic techniques

Polish title Porównanie pomiędzy naturalnym a generowanym automatycznie ruchem postaci z wykorzystaniem przechwytywania ruchu Motion Capture i logiki rozmytej

Journal Intelligent Automation and Soft Computing

Volume

Number

Pages

Notes W recenzjach

Abstract The paper describes a new method for automatic generation of animated motion with quality comparable to natural motion. First the reference motion data are gathered utilizing a motion capture system. Then these data are reduced and only main poses of the action are left. The resulting motion is simplified and its quality is considerably decreased. Then, utilizing the automatic motion enhancement system, ANIMATOR, a new version of the action is generated, based on input poses and subjective descriptors given by the user. Various degrees of motion fluency and naturalness are possible to achieve this way. The proposed algorithm of the animation enrichment is based on fuzzy description of motion parameters and motion subjective features. The first step consists in creating fuzzy rules for the algorithm using based on subjective evaluation of the animated movement. The second stage utilizes input descriptors for the new motion phases calculation, which are finally added to the animation. It is assumed that such processing increases naturalness and quality of motion, and this is verified by subjective evaluation tests. Finally a comparison between the original and the recreated motion is performed. Scores obtained in evaluation tests suggest that a substantial increase in quality between reduced and recreated versions is obtained, matching the original one. The method for motion enhancement is useful for automatic motion generation and can be paired with motion data reduction procedure for regaining naturalness. Moreover the reduced version can easily be edited in the ANIMATOR system, and in this way a new action can be created.

Streszczenie Przedstawiono nową metodę automatycznego generowania ruchu animowanej postaci o jakości zbliżonej do ruchu naturalnego uzyskiwanego metodami Motion Capture. W tym celu gromadzone są dane referencyjne, które są następnie parametryzowane i upraszczane do opisu wyłącznie kluczowych elementów ruchu. Utrata danych prowadzi to do spadku jakości, który jest ostatecznie korygowany poprzez zastosowanie przetwarzania rozmytego zaimplementowanego w formie systemu ANIMATOR. Udostępniane są użytkownikowi różne poziomy płytnności i naturalności ruchu. Jakość animacji uzyskiwanych z aplikacji porówywana jest z jakością referencyjnego ruchu oryginalnego. Potwierdzono znaczącą poprawę jakości i duże subiektywne podobieństwo do oryginału. Zredukowane wersje animacji (przed przetarzaniem rozmytym) dobrze nadają się do ręcznej edycji, co odróżnia ten system od typowych aplikacji rejestrujących ruch.

Entry No. 773

Entry type conference paper

Authors K. Łopatka, A. Czyżewski

English title Detection of dialogue in movie soundtrack for speech intelligibility enhancement

Polish title Detekcja dialogów w ścieżce dźwiękowej filmu dla potrzeb poprawy zrozumiałości mowy

Conference International Conference on Acoustics, Speech and Signal Processing

Preprint

Number

Volume

Pages

Conference site Florencja, Włochy

Conference date

Notes w recenzji

Abstract A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channels signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility. The techniques for reduction of artifacts in the processed signal are also introduced. The results of objective tests are provided, which prove that increased dialogue intelligibility is achieved with the aid of the proposed algorithm.

Streszczenie Przedstawiono metodę wykrywania dialogu w wielokanałowej ścieżce dźwiękowej filmu w formacie 5.1 w oparciu o analizę dysparycji kanałów. Sygnały w przednich kanałach (lewy, środkowy, prawy) są analizowane w dziedzinie widma. Wybrane składowe sygnału w kanale środkowym, które znacznie różnią się od składowych w kanałach bocznych, są identyfikowane jako dialog. Składowe związane z dialogami są wzmacniane w celu zwiększenia zrozumiałości dialogów. Przedstawiono również techniki redukcji artefaktów w przetworzonym sygnale. Omówiono wyniki testów obiektywnych, które potwierdzają, że proponowany algorytm pozwala na osiągnięcie znacznego wzrostu zrozumiałości dialogów.

Entry No. 774

Entry type conference paper

Authors M. Szczodrak, P. Dalka, A. Czyżewski

English title Performance evaluation of video object tracking algorithm in autonomous surveillance system

Polish title

Conference 2nd International Conference on Information Technology (ICIT) 2010

Preprint

Number

Volume

Pages 31 - 34

Conference site

Conference date

Notes Materiały w IEEE Xplore Digital Library

Abstract Results of a performance evaluation of a video object tracking algorithm are presented. The method of moving object detection and tracking is based on background modelling with mixtures of Gaussian and Kalman filters. An emphasis is put on algorithm's efficiency with regards to its settings. Utilized methods of a performance evaluation based on a comparison of the algorithm output to manually prepared reference data are introduced. The experiments aimed at examining the performance achieved with various object detection algorithm parameter settings are presented and discussed.

Entry No. 775

Entry type

Authors A. Czyżewski, J. Cichowski, M. Lech

English title

Polish title Matryca do przetworników elektromechanicznych, zwłaszcza do analizy percepcji czuciowej dłoni

Notes ZGŁOSZENIE - WZÓR UŻYTKOWY

Abstract Matryca do przetworników elektromechanicznych, zwłaszcza do analizy percepcji czuciowej dłoni pozwalająca na obiektywną analizę percepcji czuciowej oraz interakcję z maszynami.

Entry No. 776

Entry type conference paper

Authors M. Szczodrak, P. Dalka, A. Czyżewski

English title Performance evaluation of video object tracking algorithm in autonomous surveillance system

Polish title

Conference Technologie Informacyjne 2010

Preprint

Number

Volume

Pages

Conference site

Conference date

Abstract Results of performance evaluation of a video object tracking algorithm are presented. The method of moving objects detection and tracking is based on background modelling with mixtures of Gaussians and Kalman filters. An emphasis is put on algorithm’s efficiency with regards to its settings. Utilized methods of performance evaluation based on comparison of algorithm output to manually prepared reference data are introduced. The experiments aimed at examining the performance achieved with various object detection algorithm parameter settings are presented and discussed.

Entry No. 777

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Style Transfer for Detecting Vehicles with Thermal Camera

Polish title

Conference Signal Processing: Algorithms, Architectures, Arrangements, and Applications 2019

Preprint

Number

Volume

Pages

Conference site Poznań, Polska

Conference date

Abstract In this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images from thermal cameras has significantly improved. As a side effect, we noticed that Style Transfer can be also used to improve detection accuracy from standard RGB camera, which has potential for various applications.

Entry No. 778

Entry type conference paper

Authors B. Kostek, A. Czyżewski

English title Music Archive Metadata Processing Based on Flow Graphs

Polish title Przetwarzanie meta opisu plików muzycznych z zastosowaniem grafów przepływowych

Conference 116th Audio Engineerig Society

Preprint 6138

Number

Volume

Pages 1 - 7

Conference site

Conference date

Entry No. 779

Entry type journal paper

Authors A. Czyżewski, J. Kotus, G. Szwoch

English title Estimating Trac Intensity Employing Passive Acoustic Radar and Enhanced Microwave Doppler Radar Sensor

Polish title

Journal Remote Sensing

Volume

Number

Pages

Notes nr rekordu w MojaPG 150847

Abstract Innovative road signs that can autonomously display the speed limit in cases where the trac situation requires it are under development. The autonomous road sign contains many types of sensors, of which the subject of interest in this article is the Doppler sensor that we have improved and the constructed and calibrated acoustic probe. An algorithm for performing vehicle detection and tracking, as well as vehicle speed measurement, in a signal acquired with a continuous wave Doppler sensor, is discussed. A method is also experimentally presented and studied for counting vehicles and for determining their movement direction by means of acoustic vector sensor application. The assumptions of the method employing spatial distribution of sound intensity determined with the help of an integrated three-dimensional (3D) sound intensity probe are discussed. The enhanced Doppler radar and the developed sound intensity probe were used for the experiments that are described and analyzed in the paper.

Entry No. 780

Entry type

Authors A. Czyżewski

English title

Polish title Znak towarowy CyberOko

Abstract W 2014 r. zgłoszono do Urzędu Patentowego RP znak literowo-graficzny CyberOko

Entry No. 781

Entry type journal paper

Authors K. Lisowski, A. Czyżewski

English title Modeling of temporal dependencies between cameras in multi-camera surveillance systems

Polish title

Journal Pattern Recognition Letters

Volume

Number

Pages

Notes (w recenzji)

Abstract A method of modeling the time of transition between given pairs of the cameras based on the Gaussian Mixture Model (GMM) is proposed in the paper. The probabilistic model was obtained through matching the GMM to the histogram of transition times between particular pair of cameras. The matching procedure utilizes Expectation Maximization (EM) algorithm and modified particle swarm optimization (mPSO). The way of using models of transition time in object re-identification is also presented. Experiments with proposed methods of modeling the transition time were carried out, and a comparison of them is also presented.

Entry No. 782

Entry type journal paper

Authors D. Grabowski, M. Szczodrak, A. Czyżewski

English title Economical methods for measuring road surface roughness

Polish title

Journal Metrology and Measurement Systems

Volume 25

Number 3

Pages 533 - 549

Abstract Two low-cost methods of estimating the road surface condition are presented in the paper, the first one based on the use of accelerometers and the other on the analysis of images acquired from cameras installed in a vehicle. In the first method, miniature positioning and accelerometer sensors are used for evaluation of the road surface roughness. The device designed for installation in vehicles is composed of a GPS receiver and a multi-axis accelerometer. The measurement data were collected from recorded ride sessions taken place on diversified road surface roughness conditions and at varied vehicle speeds on each of examined road sections. The data were gathered for various vehicle body types and afterwards successful attempts were made in constructing the road surface classification employing the created algorithm. In turn, in the video method, a set of algorithms processing images from a depth camera and RGB cameras were created. A representative sample of the material to be analysed was obtained and a neural network model for classification of road defects was trained. The research has shown high effectiveness of applying the digital image processing to rejection of images of undamaged surface, exceeding 80%. Average effectiveness of identification of road defects amounted to 70%. The paper presents the methods of collecting and processing the data related to surface damage as well as the results of analyses and conclusions.

Entry No. 783

Entry type journal paper

Authors A. Czyżewski, A. Kwiatkowska, P. Odya

English title Testing reading skills of people with consciousness disorders employing gaze tracking

Polish title

Journal Journal of Communication Disorders

Volume

Number

Pages

Notes w recenzji

Abstract The opportunity to assess cognitive functions of people with consciousness disorders employing the preserved functions of gaze fixation is analysed in the paper. The Alexia test is a proprietary research tool prepared for the needs of research employing human-system interaction. The part of the organized tests related to Alexia (the term meaning deep difficulties of reading) is presented in this paper. 50 people (15 women and 35 men) who had been awakened from coma after craniocerebral trauma or hypoxia of the brain were recruited to participate in the study. People with neurogenic vision disorders were eliminated at the first stage of qualification. The method of data acquisition and analysis together with achieved results of the experimental study are discussed and general conclusions on the ability of reading in coma patients were drawn. The obtained results proved that people with reduced consciousness have got preserved global reading, especially for single words and simple sentences. They provide hints for further speech therapy and psychological therapy for people in this state.

Entry No. 784

Entry type conference paper

Authors S. Cygert, A. Czyżewski

English title Vehicle detector training with labels derived from background subtraction algorithms in video surveillance

Polish title

Conference SPA 2018 Signal Processing algorithms, architectures, arrangements, and applications

Preprint

Number

Volume

Pages 98 - 103

Conference site Poznań, Polska

Conference date

Notes https://ieeexplore.ieee.org/document/8563368

Abstract Vehicle detection in video from a miniature station- ary closed-circuit television (CCTV) camera is discussed in the paper. The camera provides one of components of the intelligent road sign developed in the project concerning the traffic control with the use of autonomous devices being developed. Modern Convolutional Neural Network (CNN) based detectors need big data input, usually demanding their manual labeling. In the presented research approach the weakly-supervised learning paradigm is used for the training of a CNN based detector em- ploying labels obtained automatically through an application of video background subtraction algorithm. The proposed method is evaluated on GRAM-RTM dataset and a CNN fine-tuned with labels from the background subtraction algorithm. Even though obtained representation in the form of labels may include many false positives and negatives, a reliable vehicle detector was trained employing them. The results are presented showing that such a method can be applied to traffic surveillance systems.