All Stories
Speaker Entrainment
In this post, we will discuss the phenomenon of speaker entrainment and the insights we gained when designing a voice-bot that entrai...
In Machine Learning, Feb 04, 2022Speech-First Conversational AI
We often get asked about the differences between voice and chat bots. The most common perception is that the voice bot problem can be...
In Machine Learning, Feb 02, 2022Evaluating an ASR in a Spoken Dialogue System
An ASR (automatic speech recognition) is an integral component of any voice bot. The most popular metric that is used to evaluate the...
In Machine Learning, Jan 21, 2022Complexity of Conversations - I
Consider a restaurant booking voice bot built using a frames and slots approach. While this can easily solve the problem of booking w...
In Machine Learning, Jan 18, 2022On using ASR Alternatives for a Better SLU
This blog discusses some concepts from the recently published paper by members of the ML team at Skit (formerly Vernacular.ai). The p...
In Machine Learning, Nov 29, 2021Authentication in gRPC
In gRPC, there are a number of ways you can add authentication between client and server. It is handled via Credentials Objects.
In Engineering, Oct 31, 2021Seminar - Code Mixing in NLP and Speech
Below are some pointers and insights from the papers that we covered in the recently concluded seminar on Code-mixing in NLP and Spee...
In Machine Learning, Aug 24, 2021Code Mixing Metrics
We at skit, recently concluded a seminar series on code-mixing, where we covered research papers that looked at approaches to deal wi...
In Machine Learning, Aug 09, 2021Normalizing Flows - Part 2
In Part-1, we introduced the concept of normalizing flows. Here, we discuss the different types of normalizing flows. In most blogs t...
In Machine Learning, May 08, 2021What's New in Kaldi-Serve 1.0
Kaldi-Serve is our open source high performance Speech Recognition server framework capable of serving Kaldi ASR models in production...
In Machine Learning, Mar 25, 2021