Research Overview

Our research spans multiple areas of language technology, from speech recognition to natural language processing and machine learning. Here is an overview of our current research focus areas.

Speech Technology

Developing automatic speech recognition, text-to-speech synthesis, and speaker identification systems for Yoruba language and its dialects.

Current Projects

  • Building ASR models for Yoruba with support for multiple dialects
  • Developing natural-sounding TTS systems using neural networks
  • Creating speaker identification systems for forensic and security applications

Key Datasets

Speech Recognition Dataset (50K+ hours)TTS Voice Dataset

Natural Language Processing

Building NLP tools for Yoruba dialects including machine translation, sentiment analysis, and named entity recognition.

Current Projects

  • Developing Yoruba-English machine translation systems
  • Creating named entity recognition systems for information extraction

Key Datasets

Text Corpus (2M+ samples)Parallel Translation (500K+ pairs)

Machine Learning

Developing models and frameworks specifically designed for Yoruba dialects and low-resource languages.

Current Projects

  • Adapting multilingual models for Yoruba using transfer learning
  • Optimizing models for resource-constrained environments
  • Developing few-shot learning techniques for low-resource scenarios

Publications & Papers

Check back soon for our latest publications, or contact us to learn more about our research.