This really captures something I've been experiencing with Gemini lately. The mo...

nico · 2025-09-26T00:55:18 1758848118

Another issue: Gemini can’t do tool calling and (forced) json output at the same time

If you want to use application/json as the specified output in the request, you can’t use tools

So if you need both, you either hope it gives you correct json when using tools (which many times it doesn’t). Or you have to do two requests, one for the tool calling, another for formatting

At least, even if annoying, this issue is pretty straightforward to get around

mattnewton · 2025-09-26T06:14:44 1758867284

Back before structured outputs were common among model providers, I used to have a “end result” tool the model could call to get the structured response I was looking for. It worked very reliably.

It’s a bit of a hack but maybe that reliably works here?

nico · 2025-09-26T16:17:07 1758903427

You can definitely build an agent and have it use tools like you mention. That’s the equivalent of making 2 requests to Gemini, one to get the initial answer/content, then another to get it formatted as proper json

The issue here is that Gemini has support for some internal tools (like search and web scraping), and when you ask the model to use those, you can’t also ask it to use application/json as the output (which you normally can when not using tools)

Not a huge issue, just annoying

KoolKat23 · 2025-09-26T17:02:02 1758906122

I think this might be also something to do with their super specific outputting requirements when you do use search (has to be displayed in predefined Google format).

behnamoh · 2025-09-26T03:14:46 1758856486

Does any other provider allow that? what use cases are there for JSON + tool calling at the same time?

chrisweekly · 2025-09-26T03:41:12 1758858072

Please correct my likely misunderstanding here, but on the surface, it seems to me that "call some tools then return JSON" has some pretty common use cases.

victorbjorklund · 2025-09-26T07:56:56 1758873416

Let's say you wanna build an app that gives back structured data after a web search. First a tool call to a search api. Then do some reasoning/summar/etc on the data returned by the tool. And finally return JSON.

ayende · 2025-09-26T05:39:16 1758865156

OpenAI, Ollama, DeepSeek all do that.

And wanting to programmatically work with the result + allow tool calls is super common.

shijithpk · 2025-09-30T18:05:02 1759255502

Suppose there's a pdf with lots of tables i want to scrape. I mention the pdf url in my message and with gemini's url context tool, i now have access to the pdf.

I can ask gemini to give me the pdf's content as a json and it complies most of the time. But at times, there's an introductory line like "Here's your json:". Those introductory lines interfere with programmatically using the output. They're sometimes there, sometimes not.

If I could have structured output at the same time as tool use, I can reliably use what gemini spits out as it'll be in a json, no annoying intro lines.

wahnfrieden · 2025-09-26T03:41:54 1758858114

OpenAI

golfer · 2025-09-25T19:51:36 1758829896

Unfortunately Gemini isn't the only culprit here. I've had major problems with ChatGPT reliability myself.

mguerville · 2025-09-25T20:42:28 1758832948

I only hit that problem in voice mode, it'll just stop halfway and restart. It's a jarring reminder of its lack of "real" intelligence

patrickmcnamara · 2025-09-25T21:28:24 1758835704

I've heard a lot that voice mode uses a faster (and worse) model than regular ChatGPT. So I think this makes sense. But I haven't seen this in any official documentation.

Narciss · 2025-09-25T21:52:42 1758837162

This is more because of VAD - voice activity detection

SilverElfin · 2025-09-25T21:38:35 1758836315

I think what I am seeing from ChatGPT is highly varying performance. I think this must be something they are doing to manage limitations of compute or costs. With Gemini, I think what I see is slightly different - more like a lower “peak capability” than ChatGPT’s “peak capability”.

Fade_Dance · 2025-09-26T01:53:26 1758851606

I'm fairly sure there's some sort of dynamic load balancing at work. I read an anecdote from someone had a test where they asked it to draw a little image (something like an ascii cat, but probably not exactly that since it seems a bit basic), and if the result came back poor they didn't bother using it until a different time of day.

Of course it could all be placebo, but when you intuitively think about it, somewhere on the road the the hundreds of billions in datacenter capex, one would think that there will be periods where compute and demand are out of sync. It's also perfectly understandable why now would be a time to be seeing that.

driese · 2025-09-25T21:09:27 1758834567

Small things like this or the fact that AI studio still has issues with simple scrolling confuse me. How does such a brilliant tool still lack such basic things?

victorbjorklund · 2025-09-26T07:59:43 1758873583

It's crazy how Google can create so many really amazing products technically but they fall short just because of basic UI/UX issues.

normie3000 · 2025-09-25T21:27:15 1758835635

I see Gemini web frequently break its own syntax highlighting.

brap · 2025-09-25T22:44:00 1758840240

The scrolling in AI Studio is an absolute nightmare and somehow they managed to make it worse.

It’s so annoying that you have this super capable model but you interact with it using an app that is complete ass

SXX · 2025-09-26T05:47:34 1758865654

App was likely built my same LLM...

Spooky23 · 2025-09-26T14:18:12 1758896292

Because they are moving fast and breaking shit.

Ask ChatGPT to output markdown or PDF on iOS or Mac app and the web experience. The web is often better - the apps will return nothing.

SkyPuncher · 2025-09-26T02:48:46 1758854926

This is my perception as well.

Gemini 2.5 Pro is _amazing_ for software architecture, but I just get tired of poking it along. Sonnet does well enough.

dorianmariecom · 2025-09-25T18:47:23 1758826043

chatgpt also has lots of reliability issues

diego_sandoval · 2025-09-25T18:58:59 1758826739

If anyone from OpenAI is reading this, I have two complaints:

1. Using the "Projects" thing (Folder organization) makes my browser tab (on Firefox) become unusably slow after a while. I'm basically forced to use the default chats organization, even though I would like to organize my chats in folders.

2. After editing a message that you already sent,you get to select between the different branches of the chat (1/2, and so on), which is cool, but when ChatGPT fails to generate a response in this "branched conversation" context, it will continue failing forever. When your conversation is a single thread and a ChatGPT message fails with an error, re trying usually works and the chat continues normally.

porridgeraisin · 2025-09-25T20:16:06 1758831366

And 3)

On mobile (android) opening the keyboard scrolls the chat to the bottom! I sometimes want to type referring something from the middle of the LLMs last answer.

Sabinus · 2025-09-25T22:26:35 1758839195

Projects should have their own memory system. Perhaps something more interactive than the existing Memories but projects need their own data (definitions, facts, draft documents) that is iterated on and referred to per project. Attached documents aren't it, the AI needs to be able to update the data over multiple chats.

zarmin · 2025-09-25T19:25:53 1758828353

It would also be nice if ChatGPT could move chats between projects. My sidebar is a nightmare.

throwaway240403 · 2025-09-25T20:52:20 1758833540

You can drag and drop chats between projects

zarmin · 2025-09-25T23:24:30 1758842670

i know. i want the assistant to do it. shouldn't it be able to do work on its own platform?

m101 · 2025-09-25T20:36:47 1758832607

I wonder if this is because a memory cap was reached at that output token. Perhaps they route conversations to different hardware depending on how long they expect it to be.

smittywerben · 2025-09-26T05:43:45 1758865425

When this happened to me it was because, I can only guess, it was the Gemini servers were overloaded. Symptoms: Gemini model, Opaque API wrapper error, truncated responses. To be fair the Anthropic servers are overloaded a lot too but they have a clear error. I gave Gemini a few days on the bench and it fixed itself without any client side changes. YMMV.

tschillaci · 2025-09-26T08:38:18 1758875898

Half my requests get retried because they fail, I've contributed to a ticket in June, with no fix yet.

mattmanser · 2025-09-25T19:14:35 1758827675

That used to happen a lot in ChatGPT too.

simlevesque · 2025-09-25T19:33:17 1758828797

The latest comment on that issue is someone saying there's a fix available for you to try.

tanvach · 2025-09-25T21:09:05 1758834545

Yes agree, it was totally broken when I tested the API two months ago. Lots of failed to connect and very slow response time. Hoping the update fixes these issues.

KoolKat23 · 2025-09-26T17:04:32 1758906272

It's been a lot better lately. Nothing like two months ago at all.

qnleigh · 2025-10-02T07:33:35 1759390415

What happens if you ask it to please continue? Does it start over?

drgoogle · 2025-09-26T02:01:56 1758852116

> I've been running into it consistently, responses that just stop mid-sentence

I’ve seen that behavior when LLMs of any make or model aren’t given enough time or allowed enough tokens.

reissbaker · 2025-09-25T21:22:25 1758835345

FWIW, I think GLM-4.5 or Kimi K2 0905 fit the bill pretty well in terms of complete and consistent.

(Disclosure: I'm the founder of Synthetic.new, a company that runs open-source LLMs for monthly subscriptions.)

noname120 · 2025-09-25T21:38:48 1758836328

That’s not a “disclosure”, that’s an ad.