External Publication
Visit Post

Any good openrouter interface which is private and secure?

Privacy Guides Community [Unofficial] April 10, 2026
Source

Dellsam1:

the only problem now is I can’t seem to find a way to limit my messages sent to api

Here you go:

docs.openwebui.com

❓ FAQ / Open WebUI

Q: How can I get support or ask for help?

Q: Why am I seeing multiple API requests when I only send one message? Why is my token usage higher than expected?

A: Open WebUI uses Task Models to power background features that enhance your chat experience. When you send a single message, additional API calls may be made for:

  • Title Generation : Automatically generating a title for new chats
  • Tag Generation : Auto-tagging chats for organization
  • Query Generation : Creating optimized search queries for RAG (when you attach files or knowledge)
  • Web Search Queries : Generating search terms when web search is enabled
  • Autocomplete Suggestions : If enabled

By default, these tasks use the same model you’re chatting with. If you’re using an expensive API model (like GPT-4 or Claude), this can significantly increase your costs.

To reduce API costs:

  1. Go to Admin Panel > Settings > Interface (for title/tag generation settings)
  2. Configure a Task Model under Admin Panel > Settings > Models to use a smaller, cheaper model (like GPT-4o-mini) or a local model for background tasks
  3. Disable features you don’t need (auto-title, auto-tags, etc.)

Cost-Saving Recommendation

Set your Task Model to a fast, inexpensive model (or a local model via Ollama) while keeping your primary chat model as a more capable one. This gives you the best of both worlds: smart responses for your conversations, cheap/free processing for background tasks.

For more optimization tips, see the Performance Tips Guide.

Dellsam1:

I was also hoping for a chatgpt style memory

Memory is currently beta/experimental:

docs.openwebui.com

Memory & Personalization / Open WebUI

The Memory system is currently in Beta/Experimental stage. You may encounter inconsistencies in how models store or retrieve information, and storage formats may change in future updates.

Discussion in the ATmosphere

Loading comments...