Jorge Abraham Martinez Castaneda
32 records found
1
Reverberation is a key aspect when designing the interior of buildings, and must be carefully considered in the context of the function of the room. Defined by the reverberation time (RT), it is known to have a big influence on the intelligibility and quality of audio in closed s
...
Estimating reverberation time (RT60) accurately is crucial for enhancing the acoustic quality of various environments as it decides how you feel the sound fades away subjectively. Traditional methods, such as Sabine's equation, require extensive prior knowledge and assume ideal c
...
In building design, it is important to consider certain materials for certain acoustical properties. Specifically, the time it takes for an audio signal to decrease in volume by 60 dB is important. This can be estimated with Sabine's and Eyring's formula's, which both make use of
...
Evaluation of Perceptual Accuracy in Simulated Room Impulse Responses
Designing and Implementing a Subjective Testing Methodology for the Perceptual Evaluation of Simulated Room Impulse Responses
The accurate simulation of Room Impulse Responses (RIRs) is important in a variety of applications in acoustics such as automatic speech recognition, speech enhancement, and architectural acoustic design. While objective metrics for evaluating RIRs have been researched extensivel
...
Automatic Speech Recognition (ASR) systems are becoming increasingly popular in this day and age. Unfortunately, due to inherent biases within these systems, performance disparities exist among specific demographic groups. Bias metrics can be used to measure this bias. Within ASR
...
Exploring the Relationship Between Bias and Speech Acoustics in Automatic Speech Recognition Systems
An Experimental Investigation Using Acoustic Embeddings and Bias Metrics on a Dataset of Spoken Dutch
Automatic Speech Recognition (ASR) systems have become an integral part of daily lives. Despite their widespread use, these systems can exhibit biases that express themselves in the differences in their accuracy and performance across different demographic groups. Methods quantif
...
Dutch State-of-the-art Automatic Speech Recognition (ASR) systems do not perform equally well for different speaker groups. Existing metrics to quantify this bias rely on demographic metadata, which is often unavailable. Recent advances in the field use machine learning to find g
...
How to measure bias in automatic speech recognition systems?
A bias metric without a reference group
This paper presents a novel approach to measuring bias in Automatic Speech Recognition (ASR) systems by proposing a metric that does not use the conventional approach of a reference group. Current methods typically measure bias through comparison with a ’norm’ or minimum error gr
...
Automated Processing of scanned historic watermarks
A Comparison of Feature Extraction Techniques for Binarized Content-Based Image Retrieval
Feature extraction techniques for content-based image retrieval are explored, focusing on black-and-white images in the context of historical watermarks. Orthogonal moments and texture features are found to be most applicable. Seven methods are evaluated: four different orthogona
...
Pre-Trained Models on Scanned Historic Watermarks
A Comparative Analysis Exploring Pre-Trained Models on Scanned Historic Watermarks
This paper tackles the problem of evaluating the task of finding similar scanned historical watermarks - small images embedded in historical paper that have been digitized to be processed on a computer - using pre-trained neural networks. This research aims to identify an efficie
...
Binarization of Historical Watermarks
A Review of Thresholding Techniques Applied to Historical Watermark Images
A watermark image is a scan of a historical paper document that contains a watermark, which is a motif embedded in the paper that provides valuable information on the origins of a document. Developing tools to automatically identify watermarks can make this information more acces
...
Curve Reconstruction and Approximation in Binarised Scanned Historic Watermark Images
A Study of Techniques Aiding Binarisation for an Automated Watermark Similarity-matching Pipeline
A curve is a continuously bending line with no angles that can be found anywhere in the real world, forming shapes and outlines. They are also the building blocks of historic watermarks, imprinted images on paper that may be used to identify its manufacturers. Their shapes consis
...
Text Removal Using Wavelet Transform and Morphological Operations
An Approach for the Removal of Text and Ink Artifacts from Historical Watermark Images
Watermarks have an essential role in identifying the origins and age of specific documents. However, this is often a laborious process. One of the main issues in automatic watermark segmentation is the presence of text that obstructs it, making it difficult to properly reconstruc
...
In the task of music style transfer, the symbolic music representation based on Musical Instrument Digital Interface (MIDI) files has always been a popular research medium. By using such representation, some mature models for image style transfer can also be applied to this scena
...
In today’s world, accurate location sensing is impossible to think away. One of the most prominent and most used techniques for determining location is GPS. In the outside world, GPS is capable of pinpointing a location with only a few meters error. But inside buildings, GPS ofte
...
The work presented in this thesis investigates the creation of virtual sound sources in a room equipped with a limited number of loudspeakers. This limited number of loudspeakers is typical for consumer loudspeaker systems. Ideally, these systems can provide a listening experienc
...
Low complexity crosstalk cancellation algorithm for consumer audio systems
Optimizing crosstalk cancellation from a human sound perception perspective
Over the past decade, spatial audio awareness evolved into an in-demand feature in audio entertainment. The addition of sound source locations to, for instance, movies or music adds a level of auditory envelopment and spatial awareness to the audio experience. Expensive setups pr
...
The placement of a subwoofer has significant impact on the quality of its sound reproduction. This paper present a method to optimise subwoofer placement in a room. Resonances caused by room boundaries (called "room modes") cause localised peaks and lows for specific lower freque
...
A novel approach for determining the optimal location of a sound source within an acoustic environment is proposed. This approach involves the application of Importance Sampling to improve the efficiency of the existing method of acoustic ray-tracing for finding the frequency res
...
Technology is playing an increasingly important role in education, partially thanks to emerging teaching methods such as hybrid education. Smart classrooms equipped with technology can make the lives of the educators and students easier, and aid in the switch to hybrid education.
...