Daily Diary Transcription App

A full-stack web application that allows users to record audio diary entries, automatically transcribe them using OpenAI’s Whisper model, and conveniently review them through a calendar interface.

Project Overview

This application allows users to:

Securely Register/Login using JWT authentication.
Record audio diary entries directly from their browser.
Automatically transcribe audio entries via an integrated Whisper AI model.
View and manage transcripts through an interactive calendar.

Technology Stack

Frontend: React 19.1, Bootstrap 5
Backend: Node.js (Express 5), Mongoose (MongoDB)
Database: MongoDB Atlas (Cloud)
Microservice: FastAPI (Python 3.12), Whisper model (Hugging Face Transformers)
Authentication: JWT, bcrypt (password hashing)
Audio Handling: mic-recorder-to-mp3 (Web Audio API)

Prerequisites & System Requirements

Operating System: Windows (special FFmpeg setup required), macOS, Linux
Node.js: Version 18.15.0
npm: Check using npm -v
Python: Version 3.12 (due to torchaudio compatibility)
MongoDB Atlas Account
FFmpeg: Essential for audio processing

Installation Steps

1. Clone Repository

git clone https://github.com/EmilHerzberg/AI-Dairy-Web-App.git
cd AI-Dairy-Web-App

2. Environment Variables

Create .env in the backend folder:

PORT=5000
MONGODB_URI=mongodb+srv://<DB_USER>:<DB_PASSWORD>@<cluster-url>/<DB_NAME>?retryWrites=true&w=majority
JWT_SECRET=randomLongString
TOKEN_EXPIRES_IN=3600

Do not commit .env files to GitHub.

3. Backend Setup

cd backend
npm install
node src/index.js

Backend listens on http://localhost:5000.

4. Frontend Setup

cd ../frontend
npm install
npm start

Frontend available at http://localhost:3000.

5. Whisper Microservice Setup

cd ../whisper_service
python -m venv .venv312

Activate virtual environment:

Windows: .venv312\Scripts\activate
macOS/Linux: source .venv312/bin/activate

Install dependencies:

pip install fastapi uvicorn torch torchaudio transformers pydub

Run the microservice:

uvicorn main:app --host 0.0.0.0 --port 8000

Service runs at http://localhost:8000.

FFmpeg Installation & Configuration (Windows)

Install Chocolatey: Chocolatey Install
Install FFmpeg:

choco install ffmpeg

Add FFmpeg to System Path (if not added automatically):

Open System Properties → Environment Variables
Edit Path → Add: C:\ProgramData\chocolatey\bin

Workaround if environment variable issues persist: Add these lines in Python script (main.py):

import os
os.environ['FFMPEG_BINARY'] = r'C:\ProgramData\chocolatey\bin\ffmpeg.exe'
os.environ['FFPROBE_BINARY'] = r'C:\ProgramData\chocolatey\bin\ffprobe.exe'

macOS/Linux: FFmpeg typically installed via Homebrew (brew install ffmpeg) or apt-get.

MongoDB Atlas Setup

Sign up at MongoDB Atlas.
Create a Cluster and Database.
Set Database Access: create user credentials.
In Network Access, whitelist the IP address you will use to connect.
Obtain Connection URI and update .env:

mongodb+srv://<DB_USER>:<DB_PASSWORD>@<cluster-url>/<DB_NAME>?retryWrites=true&w=majority

Testing the App

Ensure all components run simultaneously:

Backend at http://localhost:5000
Frontend at http://localhost:3000
Whisper service at http://localhost:8000

Test the complete flow:

Register/Login
Record and upload audio
View transcribed entries in calendar

Troubleshooting & Common Issues

FFmpeg Not Found: Ensure installation and correct PATH or use Python workaround.
JWT Issues: Verify .env configuration (JWT_SECRET).
MongoDB Connection Issues: Confirm correct URI and IP whitelist.
Port Conflicts: Ensure ports 3000, 5000, and 8000 are free.
Python Compatibility: Check PyTorch and Torchaudio compatibility with Python 3.12.
CORS Issues: Backend configured with cors(), verify allowed origins.

Enjoy using the Daily Diary Transcription App!

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
backend		backend
frontend		frontend
whisper_service		whisper_service
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Daily Diary Transcription App

Table of Contents

Project Overview

Technology Stack

Prerequisites & System Requirements

Installation Steps

1. Clone Repository

2. Environment Variables

3. Backend Setup

4. Frontend Setup

5. Whisper Microservice Setup

FFmpeg Installation & Configuration (Windows)

MongoDB Atlas Setup

Testing the App

Troubleshooting & Common Issues

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Daily Diary Transcription App

Table of Contents

Project Overview

Technology Stack

Prerequisites & System Requirements

Installation Steps

1. Clone Repository

2. Environment Variables

3. Backend Setup

4. Frontend Setup

5. Whisper Microservice Setup

FFmpeg Installation & Configuration (Windows)

MongoDB Atlas Setup

Testing the App

Troubleshooting & Common Issues

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages