{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreib376y4l3d4rkt5kt2kns2kfyn2rscierhpba4xv4cvwoizwdcsia",
"uri": "at://did:plc:mz7h4r2iyp2egghuqaolnsev/app.bsky.feed.post/3mlmic7kzyez2"
},
"path": "/forum/ios-ipados/describe-images-using-local-llms-shortcuts-do",
"publishedAt": "2026-05-11T22:54:09.000Z",
"site": "https://applevis.com",
"textContent": "Hey guys!\nI would like to try using the iPhone to play. For this, since using Gemini uses up a lot of tokens quickly, I would like to ask how good local models are for describing images and if it is possible to make a shortcut for this.\nThe idea would be the following:\nI press a button on the controller. The ption change. I make a gesture on the iPhone screen with VoiceOver. silently, it takes a screenshot of the screen, sends it to llm with a specific prompt, speaks and deletes the prompt.\nDo you think it would work? Find out which option is in focus, player status, among others.",
"title": "describe images using local llms and shortcuts to do this"
}