voice-ai react

This is a React application that allows users to engage in voice-based conversations with AI personalities. Users can create their own distinct AI personality and experience real-time, interactive conversations with AI.

Voice-based conversations with AI
Customizable AI personality prompt
Real-time, interactive communication
Responsive and user-friendly interface
Compatible with various devices and screen sizes

2023-05-11.13-11-11.mp4

Prerequisites

Ensure that you have the following prerequisites installed and set up on your system:

Node.js: The application is built using Node.js, so you need to have it installed on your machine. You can download the latest version of Node.js from the official website.
Amazon AWS Polly API Key: The voice output feature of the application utilizes Amazon AWS Polly, a text-to-speech service. To use AWS Polly, sign up for an AWS account, and follow the steps to create an IAM user with the required permissions. Once you have your IAM user, set up your AWS credentials in the .env file of the application. You can refer to the official AWS Polly documentation for more information.
OpenAI API Key: The application requires an OpenAI API key for AI integration (Chat-GPT and Whisper Transcription). Sign up for an OpenAI account and obtain an API key from the OpenAI Developer Dashboard. Once you have your API key, set it up in the .env file of the application.

Getting Started

# Clone repository
git clone https://github.com/darrylschaefer/voice-ai-react

# Change directory
cd voice-ai-react

# Install dependencies
npm install

# Add API keys to .env in root folder
AMAZON_AWS_POLLY_ACCESS_KEY=
AMAZON_AWS_POLLY_SECRET_KEY=
OPENAI_API_KEY=

# Build app
npm run build

# Start app
npm start

# Open client
Start your internet browser, and type in the address: http://localhost:3000

Getting Started with the Application

Configure API keys: Make sure your API keys are properly set up in the application.
Set your prompt (optional): Open the prompt dialog box to modify the default prompt.
Initiate a session: Type your message in the text input and press enter, or click the microphone icon to send a voice message.
Understand the three phases:
- User Input Phase: The microphone is ready, it will capture your voice input to send it to the AI for processing once you begin talking.
- AI Processing phase: Your recording is being handled by the APIs.
- AI Playing phase: The generated response is played back to you.
Engage with the AI: By default, the application will automatically cycle through the phases. Be ready to respond to AI when it's your turn.
Control the pace (optional): If you need more time to respond, disable Automatic Detection and manually put the application into standby mode by clicking the microphone button when it's your turn.
Speak in complete sentences: For better transcription and understanding, use complete sentences when engaging with the AI.

Options Menu

Find the options menu below the console with these available features:

Abandon Session: Reset the session
Voices: Change the AWS Polly voice
Personality Type: The type of personality that the interviewer will have.
Automatic Detection: Sets recording mode to standby after AI finishes playing their audio prompt. If you turn this off, you will have to click the Microphone button after the AI has spoken in order to set the Mic to standby and begin a recording.
Voice Threshold: This slider sets the minimum volume level needed to trigger a recording from standby mode. Adjust based on background noise & experiment for best results.
Mic Pause Timer: This slider sets the delay after a recording dips below the Volume Threshold that will trigger an automatic completion of the user recording.

Microphone Statuses:

Orange Border: Standby mode - awaiting voice to surpass the Voice Threshold and initiate recording.
Red Border: Recording in progress - triggered by exceeding Voice Threshold, and will stop when volume falls below the threshold for the Mic Pause Timer duration.
Grey Border: Inactive - AI is currently processing the conversation.
White Border: Inactive - no session in progress.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
src		src
.env		.env
.eslintrc.json		.eslintrc.json
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
jsconfig.json		jsconfig.json
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voice-ai react

Prerequisites

Getting Started

Getting Started with the Application

Options Menu

Microphone Statuses:

About

Releases

Packages

Languages

License

darrylschaefer/voice-ai-react

Folders and files

Latest commit

History

Repository files navigation

voice-ai react

Prerequisites

Getting Started

Getting Started with the Application

Options Menu

Microphone Statuses:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages