Changelog

November 9th, 2024

v1.27.0

NewFixed

The Document Analysis feature just got improved.

CleanShot 2024-11-09 at 11

  • New: Claude's native PDF capabilities
  • New: BoltAI supports OCR now. Right-click on a PDF and select "Reprocess with OCR"
  • New: Choose where to attach documents in the chat (attach on individual messages or to the System Prompt)
  • Fixed: Fixed the issue where BoltAI would not open a new chat when the sidebar is collapsed

Claude's native PDF capabilities

If you're using Claude Sonnet 3.5, you can use the new native PDF capabilities in BoltAI. Find the option in the right sidebar (screenshot below).

Note that when native PDF capabilities option is enabled, Prompt Caching will be temporarily disabled.

CleanShot 2024-11-07 at 23

OCR support

For PDFs with images and tables, you can "reprocess" the document using OCR for better accuracy. Currently, BoltAI only support OCR for PDF documents.

CleanShot 2024-11-09 at 21

More fine-grain settings for Document Analysis feature

You can choose to attach document content to individual messages or to the System Prompt.

  • When attaching to the System Prompt, you can also choose to apply Prompt Caching automatically.
  • When attaching to each individual message, Prompt Caching is temporarily disabled.

November 1st, 2024

v1.26.0

NewImproved

CleanShot 2024-11-01 at 01

  • New: Gemini models support native Google Search Tool.
  • Advanced Voice Mode supports more voices and with caching
  • Custom keyboard shortcuts for Voice Input & Advanced Voice Mode
  • Improved: Better Voice Input UI & error handling

What's new?

Grounding with Google Search

BoltAI now supports the Grounding with Google Search feature for Gemini Models. To enable it, open the plugin list and find the "Google Search (Gemini Only)".

Note that it might be more expensive than using the current Web Search plugins provided by BoltAI ($35 per 1K grounding request).

BoltAI doesn't support mode and dynamic_threshold settings in this release yet.

New realtime voices

Great news. You can now more realtime voices. It's also cheaper now as OpenAI applied prompt caching automatically.

Enjoy!

CleanShot 2024-11-01 at 17

Better Voice Input UI

I've made a few of improvements to the Voice Input UI:

  • Support custom keyboard shortcut
  • Added the ability to cancel the Voice Input session
  • Retry if the transcription fails

CleanShot 2024-11-01 at 17

And that's it for now 👋

Happy belated halloween 🎃


October 29th, 2024

v1.25.3

New
  • New AI Command Behavior: Open in a new temporary window
  • New: New chat window is floating by default
  • New: Open chat messages in browser

What's New?

New Behavior: Open in a new temporary window

When using an AI Command, you can choose to open it in a new temporary window. No need to clean up inline chats aftereward.

You can bulk edit the command's behavior to this one.

CleanShot 2024-10-29 at 18

Floating Chat Windows

A single chat window is floating by default. You can access it in any Space.

Open chat messages in browser

If you have a custom markdown viewer, you can use this to view the message in your favorite web browser.

CleanShot 2024-10-29 at 18

  • Thanks Adam, AR, Hans, Anıl and others for the feature suggestion.

October 28th, 2024

v1.25.2

ImprovedFixed

CleanShot 2024-10-28 at 15

  • Improved: Setting spell check & auto-correct options in the chat input field
  • Fixed: Inline Whisper not working for some users

October 25th, 2024

v1.25.0 (Advanced Voice Mode)

New

Advanced Voice Mode is here.

CleanShot 2024-10-25 at 01

  • New: Advanced Voice Mode using OpenAI's Realtime API.
  • New: New UI for the Voice Input
  • New: Read Aloud streams and plays audio in real time

What's new?

Advanced Voice mode

The long awaited Advanced Voice Mode (AVM) is released. You can now have a voice conversation with GPT-4o via Realtime API.

BoltAI supports both OpenAI and Azure OpenAI Service.

Learn more about AVM

New UI for Voice Input

Voice Input UI just got an upgrade. Much better than the previous version, right?

CleanShot 2024-10-26 at 02

Better Read Aloud

The Read Aloud feature now streams and plays audio in real time. It's much faster and more robust.

And that's it for now. Have a wonderful weekend 👋


October 16th, 2024

v1.24.0 (build 113)

NewImproved

This release focuses on improving the voice input & Inline Whisper experience.

CleanShot 2024-10-16 at 20

  • New: Use Whisper from Groq with the new model "whisper-large-v3-turbo". Go to Settings > Advanced > Voice Settings to configure.
  • New: Added suppport for custom Whisper prompt
  • New: Whisper AI plugin now supports custom host & Organization ID
  • New (Inline Whisper): Added the option to copy transcription to clipboard
  • Improved: You can configure the keyboard shortcut for toggling the chat configuration pane (Toggle Inspector)

October 13th, 2024

v1.23.0 (build 112)

NewImproved

boltai-doc-analysis-o1

  • New: The o1 series models now support Document Analysis
  • New: Reworked the chat input field. Fixed layout issues when pasting long prompts.
  • New: Estimate token & cost for Gemini models
  • Improved: Keyboard shortcut for toggling chat configuration pane (Toggle Inspector: cmd+I)
  • Fixed: Cannot set up custom OpenAI-compatible server
  • Fixed: Cannot use emojis in chat titles
  • Fixed: Fixed an issue where the app could hang when loading user's profile picture

What's new?

This release adds some great improvement.

You can use o1 series models for Document Analysis now.

The OpenAI's new o1 model doesn't support custom System Instruction. In this release, I put your custom system instruction and document content into the user prompt.

The chat input field got a huge upgrade. I switched to AppKit's NSTextView and this fixes all the layout issues when pasting long prompts. It's also a lot faster too!

Estimate costs for Gemini models. Gemini flash 8B is my new favorite small model. It's cheap and is quite capable for simple tasks such as summarization or grammar fixes...

In this release, I've added token & cost estimation for Gemini models.

CleanShot 2024-10-13 at 13

New BoltAI Workspace.

This is not related to mac app, but I've been working on the new BoltAI Workspace. It's the cloud version of BoltAI with Cloud Sync & other collaboration features.

It's not ready yet but you can join the waitlist here: https://app.youform.com/forms/kyxy1r54

CleanShot 2024-10-09 at 11

Have a nice weekend. See you in the next update 👋


September 27th, 2024

v1.22.0 (build 109)

NewImprovedFixed

CleanShot 2024-09-27 at 19

  • New: AI Command supports new variable {clipboard}, use content from your clipboard
  • New: Move chat configuration to the right pane
  • New: Custom prompt for generating chat title
  • New: Added support for *.tpl file
  • Improved: added cost estimates for o1 models
  • Improved: filter only chat models for TogetherAI
  • Fixed: cannot use o1 models from GitHub due to stream parameter
  • Fixed: search doesn’t highlight content within code blocks
  • Fixed: activate the chat input when typing / within a different textview
  • Fixed: cannot rename archived chats
  • Fixed: cannot clone / move inline chats
  • Fixed: Anthropic’s tool_use error when disabling plugins
  • Fixed: quotes and other symbols in chat titles

What's New?

This release adds many quality-of-life (QoL) improvements.

1. Move chat configuration to the right pane

The current Popover has an UX issue: it's hard to test the chat paramters. Going back-and-forth between the popover and the chat list is annoying.

I listened to your feedback and move it to the right pane (Inspector Pane). By default, it's collapsed so it won't get in your way.

2. New variable in AI Command

BoltAI now supports a new variable: {clipboard}. With this, you can choose to prompt on your clipboard content instead of the highlighted text (variable {input}).

Protip: if you don't use the placeholder {input}, BoltAI would ignore the highlighted text.

CleanShot 2024-09-27 at 20

3. Custom prompt for generating chat title

You can customize the prompt for generating chat title now. Go to Settings > General, click the gear button next to the option.

CleanShot 2024-09-27 at 20

4. Other bug fixes and improvements

I also improved the overall app in this release, especially bugs related to the new macOS Sequoia.

Let me know if it works well for you.

P.S: I'm still working hard on the new iOS app and Cloud Sync. Sign up for early access here: https://app.youform.com/forms/yplbdvx7

Annnnnd that's it for now 👋


September 19th, 2024

v1.21.0 "New Command"

NewImprovedFixed

CleanShot 2024-09-19 at 21

  • New: AI Command now supports AI Plugins (only works with the “Open in new window” behavior for now)
  • New: A new inline implementation: better compatibility & support cancellation.
  • New: Customize how you want the AI Command & AI Inline to work within an app. Go to Settings > Advanced > App Specific Settings.
  • Improved: Improve chat list performance (pagination)
  • Improved: Added support for custom keyboard layout like Dvorak
  • Fixed: OpenRouter integration fails with o1 models
  • Fixed: Main Window shows twice on macOS 15
  • Fixed: Error related to non-whitespace text with Claude

What's New:

The most notable change in this release is the new implementation for the AI Command feature.

  1. AI Command now support AI Plugins. Here is a demo video where I use the Instant Command on a video in Finder:
  1. New implementation. Previously, BoltAI uses the Accessibility C APIs to get the highlighted text on the active app. In this release, I've switched to a more modern APIs and it should be more robust.

  2. More advanced settings. In this release, you can choose how BoltAI reads the highlighted text. Go to Settings > Advanced > App Specific Settings to see the default settings. If you're having troubles with an app, try to use a different strategy.

  • Accessibility: BoltAI would use the Accessibility API to fetch the highlighted text. This works best for native apps such as Apple Notes, Xcode or TextEdit...
  • Clipboard: BoltAI would emulate Copy event to get the content of the highlighted text. This works best for a web browser or web-based apps such as Discord, Slack, Notion...
  • Smart (default): BoltAI checks if the app is web-based or native and would use the proper strategy.
  1. Other improvements:
  • BoltAI supports custom input keyboard layout like Dvorak
  • You can cancel the current AI Command or AI Inline request

CleanShot 2024-09-19 at 21

I've added more than 30 features in the last 3 month. With this release, I'm going to increase the price soon. If you're still on the fence, now is the best time to lock in the current price.

That's it for now 👋


September 16th, 2024

v1.20.5 (build 106)

NewFixed

CleanShot 2024-09-16 at 17

  • New: new import & export UI with more logging
  • New: Added support for OpenAI's latest models: o1-preview & o1-mini
  • Fix: fixed a bug with Data Import & Export
  • Fix: fixed an issue with iCloud sync for default plugins

What's new?

This release improves the Data Import & Export feature. You can see the detail logs on the UI and on the Console app.

BoltAI also supports exporting Memory in this version.