solution channel high linux macos windows telegram

Telegram: voice/audio-only messages don't trigger a response

Fix Telegram voice notes that are ignored by enabling inbound audio transcription (tools.media.audio) and ensuring media limits allow downloading/transcribing.

By CoClaw Team •

Symptoms

  • Sending a Telegram voice note (audio-only) produces no reply.
  • If you add any text (for example, ? + voice note), the bot replies and the transcription works.

Cause

Audio-only messages need a transcription step to produce a usable text prompt. If inbound audio transcription is disabled (or blocked by limits/scope), the gateway may treat the message as “no text”, so it never triggers a normal reply flow.

Fix

In your config, enable tools.media.audio and pick a provider (or let auto-detection work).

Example (OpenAI transcription):

{
  tools: {
    media: {
      audio: {
        enabled: true,
        models: [{ provider: "openai", model: "gpt-4o-mini-transcribe" }],
      },
    },
  },
}

Restart the gateway after changes.

2) Ensure media limits allow voice notes

  • Telegram media cap: channels.telegram.mediaMaxMb (default 5MB)
  • Audio transcription cap: tools.media.audio.maxBytes (default 20MB)

If your voice notes are larger than the Telegram cap, they may never be downloaded/transcribed.

3) Probe and check logs

On the gateway host:

openclaw channels status --probe
openclaw logs --follow

Look for audio transcription messages (or errors about media size/timeout/auth).

Verify

  • Send an audio-only voice note; the bot replies with the transcript-based answer.
  • The logs show an audio transcription decision/output.

Verification & references

  • Reviewed by:CoClaw Editorial Team
  • Last reviewed:March 14, 2026
  • Verified on: Linux ¡ macOS ¡ Windows
Want to explore more? Browse all solutions or ask in the Community Forum .
Report a problem

Related Resources