{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreib376y4l3d4rkt5kt2kns2kfyn2rscierhpba4xv4cvwoizwdcsia",
    "uri": "at://did:plc:mz7h4r2iyp2egghuqaolnsev/app.bsky.feed.post/3mlmic7kzyez2"
  },
  "path": "/forum/ios-ipados/describe-images-using-local-llms-shortcuts-do",
  "publishedAt": "2026-05-11T22:54:09.000Z",
  "site": "https://applevis.com",
  "textContent": "Hey guys!\nI would like to try using the iPhone to play. For this, since using Gemini uses up a lot of tokens quickly, I would like to ask how good local models are for describing images and if it is possible to make a shortcut for this.\nThe idea would be the following:\nI press a button on the controller. The ption change. I make a gesture on the iPhone screen with VoiceOver. silently, it takes a screenshot of the screen, sends it to llm with a specific prompt, speaks and deletes the prompt.\nDo you think it would work? Find out which option is in focus, player status, among others.",
  "title": "describe images using local llms and shortcuts to do this"
}