Tags

grpc

Building Fast and Efficient Microservices with gRPC

Skit.ai processes millions of speech recognition requests every day, and to handle such a load we have focused on building a highly s...

In Engineering, Feb 05, 2020

load balancing

Building Fast and Efficient Microservices with gRPC

Skit.ai processes millions of speech recognition requests every day, and to handle such a load we have focused on building a highly s...

In Engineering, Feb 05, 2020

microservices

Building Fast and Efficient Microservices with gRPC

Skit.ai processes millions of speech recognition requests every day, and to handle such a load we have focused on building a highly s...

In Engineering, Feb 05, 2020

interservice communication

Building Fast and Efficient Microservices with gRPC

Skit.ai processes millions of speech recognition requests every day, and to handle such a load we have focused on building a highly s...

In Engineering, Feb 05, 2020

diarization

Speaker Diarization

This blog post is based on the work done by Anirudh Dagar as an intern at Skit.ai

In Machine Learning, Jul 21, 2020

classification

Bad Audio Detection

This blog will be a short one, where we’ll talk about our approach on filtering out inscrutable audios from VASR.

In Machine Learning, Jul 29, 2020

work

Google Summer of Code, 2022

Google Summer of Code - 2022

In Machine Learning, Feb 18, 2022

Reading Sessions

Studying researches and building on top of them is an important part of what a team of ML Engineers do on a regular basis. Usually, t...

In Machine Learning, Nov 30, 2020

Interspeech 2020

We recently attended the all remote Interspeech 2020. Each of us made notes on what they did overall. But instead of posting those or...

In Machine Learning, Dec 01, 2020

conference

EMNLP 2020

Individual summary notes from EMNLP 2020.

In Machine Learning, Dec 21, 2020

Interspeech 2020

We recently attended the all remote Interspeech 2020. Each of us made notes on what they did overall. But instead of posting those or...

In Machine Learning, Dec 01, 2020

normalizing-flows

Normalizing Flows - Part 2

In Part-1, we introduced the concept of normalizing flows. Here, we discuss the different types of normalizing flows. In most blogs t...

In Machine Learning, May 08, 2021

Normalizing Flows - Part 1

Normalizing flows, popularized by (Rezende, & Mohamed, 2015), are techniques used in machine learning to transform simple probabi...

In Machine Learning, Dec 19, 2020

tts

Seminar - Code Mixing in NLP and Speech

Below are some pointers and insights from the papers that we covered in the recently concluded seminar on Code-mixing in NLP and Spee...

In Machine Learning, Aug 24, 2021

Code Mixing Metrics

We at skit, recently concluded a seminar series on code-mixing, where we covered research papers that looked at approaches to deal wi...

In Machine Learning, Aug 09, 2021

Normalizing Flows - Part 2

In Part-1, we introduced the concept of normalizing flows. Here, we discuss the different types of normalizing flows. In most blogs t...

In Machine Learning, May 08, 2021

Normalizing Flows - Part 1

Normalizing flows, popularized by (Rezende, & Mohamed, 2015), are techniques used in machine learning to transform simple probabi...

In Machine Learning, Dec 19, 2020

speech recognition

What's New in Kaldi-Serve 1.0

Kaldi-Serve is our open source high performance Speech Recognition server framework capable of serving Kaldi ASR models in production...

In Machine Learning, Mar 25, 2021

framework

What's New in Kaldi-Serve 1.0

Kaldi-Serve is our open source high performance Speech Recognition server framework capable of serving Kaldi ASR models in production...

In Machine Learning, Mar 25, 2021

new release

What's New in Kaldi-Serve 1.0

Kaldi-Serve is our open source high performance Speech Recognition server framework capable of serving Kaldi ASR models in production...

In Machine Learning, Mar 25, 2021

code-mixing

Seminar - Code Mixing in NLP and Speech

Below are some pointers and insights from the papers that we covered in the recently concluded seminar on Code-mixing in NLP and Spee...

In Machine Learning, Aug 24, 2021

Code Mixing Metrics

We at skit, recently concluded a seminar series on code-mixing, where we covered research papers that looked at approaches to deal wi...

In Machine Learning, Aug 09, 2021

asr

Seminar - Code Mixing in NLP and Speech

Below are some pointers and insights from the papers that we covered in the recently concluded seminar on Code-mixing in NLP and Spee...

In Machine Learning, Aug 24, 2021

Code Mixing Metrics

We at skit, recently concluded a seminar series on code-mixing, where we covered research papers that looked at approaches to deal wi...

In Machine Learning, Aug 09, 2021

authentication

Authentication in gRPC

In gRPC, there are a number of ways you can add authentication between client and server. It is handled via Credentials Objects.

In Engineering, Oct 31, 2021

gRPC

Authentication in gRPC

In gRPC, there are a number of ways you can add authentication between client and server. It is handled via Credentials Objects.

In Engineering, Oct 31, 2021

ASR

Speaker Entrainment

In this post, we will discuss the phenomenon of speaker entrainment and the insights we gained when designing a voice-bot that entrai...

In Machine Learning, Feb 04, 2022

Evaluating an ASR in a Spoken Dialogue System

An ASR (automatic speech recognition) is an integral component of any voice bot. The most popular metric that is used to evaluate the...

In Machine Learning, Jan 21, 2022

On using ASR Alternatives for a Better SLU

This blog discusses some concepts from the recently published paper by members of the ML team at Skit (formerly Vernacular.ai). The p...

In Machine Learning, Nov 29, 2021

SLU

On using ASR Alternatives for a Better SLU

This blog discusses some concepts from the recently published paper by members of the ML team at Skit (formerly Vernacular.ai). The p...

In Machine Learning, Nov 29, 2021

WER

Evaluating an ASR in a Spoken Dialogue System

An ASR (automatic speech recognition) is an integral component of any voice bot. The most popular metric that is used to evaluate the...

In Machine Learning, Jan 21, 2022

sticky

Evaluating an ASR in a Spoken Dialogue System

An ASR (automatic speech recognition) is an integral component of any voice bot. The most popular metric that is used to evaluate the...

In Machine Learning, Jan 21, 2022

speech

Speech LLMs for Conversations

With LLMs making conversational systems has become easier. You no longer need to focus on the low-level details of categorizing seman...

In Machine Learning, May 09, 2024

Speech-First Conversational AI Revisited

Around last year, we shared our views on how nuances of spoken conversations make voicebots different than chatbots. With the recent ...

In Machine Learning, May 11, 2023

Speech-First Conversational AI

We often get asked about the differences between voice and chat bots. The most common perception is that the voice bot problem can be...

In Machine Learning, Feb 02, 2022

Turn-taking

Turn Taking Dynamics in Voice Bots

One of the challenges in building an interactive voice bots is accounting for turn taking behaviour. Turn-taking is a difficult probl...

In Machine Learning, Mar 07, 2022

barge-in

Turn Taking Dynamics in Voice Bots

One of the challenges in building an interactive voice bots is accounting for turn taking behaviour. Turn-taking is a difficult probl...

In Machine Learning, Mar 07, 2022

duplex conversations

Turn Taking Dynamics in Voice Bots

One of the challenges in building an interactive voice bots is accounting for turn taking behaviour. Turn-taking is a difficult probl...

In Machine Learning, Mar 07, 2022

TTS

TTS Enhancement

Problem Statement

In Machine Learning, Mar 09, 2022

speech-enhancement

TTS Enhancement

Problem Statement

In Machine Learning, Mar 09, 2022

end-of-utterance

End of Utterance Detection

This blog post is based on the work done by Anirudh Thatipelli as an ML research fellow at Skit.ai

In Machine Learning, Apr 24, 2022

turn-taking

End of Utterance Detection

This blog post is based on the work done by Anirudh Thatipelli as an ML research fellow at Skit.ai

In Machine Learning, Apr 24, 2022

conversational ai

Theory of Mind and Implications for Conversational AI

When a diplomat says yes, he means ‘perhaps’; When he says perhaps, he means ‘no’; When he says no, he is not a diplomat.

In Machine Learning, Theory of Mind, May 19, 2022

voicebot

Theory of Mind and Implications for Conversational AI

When a diplomat says yes, he means ‘perhaps’; When he says perhaps, he means ‘no’; When he says no, he is not a diplomat.

In Machine Learning, Theory of Mind, May 19, 2022

chatbot

Theory of Mind and Implications for Conversational AI

When a diplomat says yes, he means ‘perhaps’; When he says perhaps, he means ‘no’; When he says no, he is not a diplomat.

In Machine Learning, Theory of Mind, May 19, 2022

voice assistant

Theory of Mind and Implications for Conversational AI

When a diplomat says yes, he means ‘perhaps’; When he says perhaps, he means ‘no’; When he says no, he is not a diplomat.

In Machine Learning, Theory of Mind, May 19, 2022

label-noise

slu

Improving consumer verification using confidence calibration and thresholding

In the past year, our team’s current focus has shifted to building robust and scalable voice-bots for US companies. In particular, we...

In Machine Learning, Jan 09, 2024

Incorporating context to improve SLU

Introduction In task-oriented dialogue systems, the spoken language understanding, or SLU, refers to the task of parsing the natural ...

In Machine Learning, Aug 04, 2022

context

Incorporating context to improve SLU

Introduction In task-oriented dialogue systems, the spoken language understanding, or SLU, refers to the task of parsing the natural ...

In Machine Learning, Aug 04, 2022

nlp

Improving consumer verification using confidence calibration and thresholding

In the past year, our team’s current focus has shifted to building robust and scalable voice-bots for US companies. In particular, we...

In Machine Learning, Jan 09, 2024

Incorporating context to improve SLU

Introduction In task-oriented dialogue systems, the spoken language understanding, or SLU, refers to the task of parsing the natural ...

In Machine Learning, Aug 04, 2022

llm

Speech LLMs for Conversations

With LLMs making conversational systems has become easier. You no longer need to focus on the low-level details of categorizing seman...

In Machine Learning, May 09, 2024

Speech-First Conversational AI Revisited

Around last year, we shared our views on how nuances of spoken conversations make voicebots different than chatbots. With the recent ...

In Machine Learning, May 11, 2023

compliance

Improving consumer verification using confidence calibration and thresholding

In the past year, our team’s current focus has shifted to building robust and scalable voice-bots for US companies. In particular, we...

In Machine Learning, Jan 09, 2024

conversations

Speech LLMs for Conversations

With LLMs making conversational systems has become easier. You no longer need to focus on the low-level details of categorizing seman...

In Machine Learning, May 09, 2024