BoltAI Blog
BoltAI HomepageBlogDocumentation
  • BoltAI Blog
  • What is ChatGPT o1? Understanding Its Features, Benefits, and Uses
  • Claude 3.5 Sonnet vs GPT-4o: A Comprehensive Comparison
  • ChatGPT API Cost: Features, Plans, Pros and Cons
  • How to Use ChatGPT API: A Comprehensive Guide
  • Top AI Tools for Developers: Boost Productivity and Code Smarter
  • How to Run LLM Locally on Mac: A Step-by-Step Guide
  • How to Use ChatGPT as a Search Engine: A Complete Guide
  • ChatGPT vs Claude: Which AI Tool Fits Your Needs?
  • ChatGPT vs Gemini: Which AI Tool Is Right For You?
  • Perplexity vs. ChatGPT: Our In-Depth Comparison
  • How to Train ChatGPT on Your Own Data: Enhance AI Accuracy & Relevance
  • DeepSeek vs. ChatGPT: Which AI Model Is Right for You?
  • Exploring the Top 10 ChatGPT Alternatives for Better AI Conversations in 2025
  • Top 7 AI Tools for Students to Boost Productivity and Success in 2025
  • How to Get a ChatGPT API Key: Step-by-Step Guide
  • Tech Stack Analysis for a Cross-Platform Offline-First AI Chat Client
  • BoltAI Projects, DeepSeek support and more
  • A Developer’s Guide to Bard vs. ChatGPT for Coding
  • ChatGPT Keyboard Shortcuts for Mac: Enhance Your Workflow with Quick Commands
  • ChatGPT for Programmers: How to Boost Productivity and Efficiency
  • Here’s Our Step-by-Step Guide on How to Use Mistral 7B
  • Claude vs. ChatGPT for Coding: Which AI Assistant is Best for You?
  • Amazon Bedrock & xAI support, cache breakpoint and more
  • Advanced Voice Mode, Improved Document Analysis and more
  • How to use local Whisper instance in BoltAI
  • Optimize Ollama Models for BoltAI
  • How to use xAI in BoltAI?
  • How BoltAI handles your API keys
  • How to build an AI Coding Assistant with BoltAI
  • Best Black Friday Deals 2024 for Mac
  • A simple A/B testing setup with Simple Analytics
Powered by GitBook
On this page
  • Advanced Voice Mode
  • Better Voice UIs
  • Improved Document Analysis
  • Improved AI Command
  • Better Inline Whisper
  • And that's it for now 👋

Was this helpful?

Advanced Voice Mode, Improved Document Analysis and more

BoltAI November Update

PreviousAmazon Bedrock & xAI support, cache breakpoint and moreNextHow to use local Whisper instance in BoltAI

Last updated 5 months ago

Was this helpful?

First of all, Happy Thanksgiving! I wish you and your family a wonderful day filled with joy, gratitude, and delicious food.

The last couple of month has been productive for me. I’ve shipped a few useful features that you will love.

Let’s dive in!

TL;DR:

  • Advanced Voice Mode & better voice UIs

  • New in Document Analysis: native PDF capabilities, OCR support and more

  • Improved AI Command & better keyboard handling

  • Improved Inline Whisper with local inference support

Advanced Voice Mode

The most notable change since the last update is the new Advanced Voice Mode (AVM). It works similarly to the AVM in the official ChatGPT app. BoltAI utilizes OpenAI’s Realtime API to let you have real-time voice conversation with the GPT 4o model.

BoltAI supports both OpenAI and Azure OpenAI Service.

Better Voice UIs

I've made a few of improvements to the Voice Input UI:

  • Better audio visualization

  • Supports custom keyboard shortcut

  • Added the ability to cancel the Voice Input session

  • Retry if the transcription fails

The Read Aloud feature now streams and plays audio in real time. It's much faster and more robust.

Improved Document Analysis

Document Analysis with o1 model family

The OpenAI's new o1 model doesn't support custom System Instruction. In BoltAI, I put your custom system instruction and document content into the user prompt. Enjoy!

Claude's native PDF capabilities

Note that when native PDF capabilities option is enabled, Prompt Caching will be temporarily disabled.

OCR support

For PDFs with images and tables, you can "reprocess" the document using OCR for better accuracy. Currently, BoltAI only support OCR for PDF documents.

Improved AI Command

I listented to your feedback and added a few improvements to the AI Command feature:

New Behavior: Open in a new temporary window

When using an AI Command, you can choose to open it in a new temporary window. No need to clean up inline chats aftereward.

Improved performance & keyboard handling

  • You can now use control+n or control+p to navigate.

  • Improved input handling for Japanese input source

  • Improved AI Command performance, faster search & scrolling

Better Inline Whisper

The Inline Whisper feature allows you to use Whisper transcription within any textfield. Press a shortcut keyboard, speak and press the same keyboard shortcut again.

Now you can customize it even more: use a different AI provider, custom prompt or copy transcription to clipboard...

And that's it for now 👋

Happy Thanksgiving!

If you're using Claude Sonnet 3.5, you can use the new in BoltAI. Find the option in the right sidebar (screenshot below).

To use a local Whisper instance, follow .

native PDF capabilities
Read more...
this guide
Read more…