{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreihess5yxzxojec7wd4m45h67h2vt7rnmqlrulabexanqd3vj5bbau",
    "uri": "at://did:plc:xxrzfynfiasdpbxteqxi4jgq/app.bsky.feed.post/3ml5fi67pmhh2"
  },
  "description": "Microsoft Purview Data Security Investigations will add OCR support by mid-2026, enabling automatic text extraction from images to enhance detection of sensitive information. This feature is enabled by default, requires no workflow changes, and improves investigation accuracy while respecting exi...",
  "path": "/m365-message-center/message/mc1301831/",
  "publishedAt": "2026-05-06T00:00:08.000Z",
  "site": "https://blog.tophhie.cloud",
  "tags": [
    "561489",
    "Use AI analysis in Data Security Investigations | Microsoft Learn",
    "Learn about Data Security Investigations | Microsoft Learn"
  ],
  "textContent": "🚨\n\n**Major Update:** This post contains a significant change that may impact your organisation.\n\n**[Introduction]**\n\nMicrosoft Purview **Data Security Investigations (DSI)** is expanding its AI-powered investigation capabilities by adding**optical character recognition (OCR)**. This enhancement enables DSI to extract and analyze text from images, helping organizations identify sensitive information embedded in visual content. This improves the accuracy and depth of data security investigations.\n\nThis message is associated with Microsoft 365 Roadmap ID 561489.\n\n**[When this will happen:]**\n\n  * Public Preview (Worldwide): We will begin rolling out in**late May 2026** and expect to complete by **early June 2026**.\n  * General Availability (Worldwide): We will begin rolling out in **mid-July 2026** and expect to complete by**late July 2026**.\n\n\n\n**[How this affects your organization:]**\n\n**Who is affected:**\n\n  * Admins and analysts using **Microsoft Purview Data Security Investigations (DSI)**\n  * Organizations investigating data security risks using Purview\n\n\n\n**What will happen:**\n\n  * OCR will be **enabled by default** in Data Security Investigations.\n  * DSI will automatically extract text from image-based content (for example: images, screenshots, embedded visuals in files).\n  * Extracted text will be incorporated into investigation datasets to improve search, analysis, and risk detection.\n  * Existing investigation workflows will require no changes.\n  * This can help improve detection of sensitive information that may be embedded in visual content.\n  * Existing Purview policies and controls (such as sensitivity labels and DLP) continue to be respected.\n\n\n\n**[What you can do to prepare:]**\n\n\n\n\nNo action is required prior to rollout.\n\nYou may consider the following:\n\n  * Inform your security and compliance teams about improved detection capabilities involving image-based content.\n  * Review internal investigation procedures to account for insights derived from OCR.\n  * Update any internal documentation or training materials that reference Data Security Investigations capabilities.\n\n\n\n**Learn more:**\n\n  * Use AI analysis in Data Security Investigations | Microsoft Learn\n  * Learn about Data Security Investigations | Microsoft Learn\n\n\n\n**[Compliance considerations:]**\n\n**Consideration** | **Explanation**\n---|---\nAlters how existing customer data is processed | OCR introduces additional processing of image-based content in Data Security Investigations by extracting text for analysis.\nIntroduces or modifies AI/ML capabilities | AI-powered OCR is added to analyze visual content and enhance investigation insights.\nAlters admin monitoring, reporting, or compliance visibility | OCR-enriched data improves investigation depth, which may impact reporting and how compliance activities are reviewed.",
  "title": "MC1301831: Microsoft Purview: Data Security Investigations – Introducing optical character recognition (OCR) support",
  "updatedAt": "2026-05-06T00:00:08.647Z"
}