Close Menu
  • Home
  • Articles
    • Attacks
      • BEC
      • Data Breach
      • DDoS
      • Evasion Attacks
      • Injection
      • Malware
      • MITM
      • Phishing
      • Ransomware
      • RCE
      • Social Engineering
      • Spoofing
      • Spyware
    • Business and Policy
      • BCP and DRP
      • GRC
      • Regulations
    • Data Protection
      • DLP
      • DRM
      • Encryption
      • IAM
    • Future, Trends and Insight
      • AI
      • Events & Community
      • Emerging Tech
      • Expert Panel
      • Interviews With Experts
      • Insights
      • Study & Research
    • Resources
      • Guides
      • Tools
      • Training & Education
    • Security
      • API
      • Apps
      • Cloud
      • Critical Infrastructure
      • Endpoint
      • Hardware
      • IoT
      • Mobile
      • Network
      • OT
      • Port Security
      • Security Architecture
      • Software Development
      • Supply Chain
      • Zero Trust
    • Threats and Vulnerabilities
      • Emerging Threats
      • Insider Threats
      • Risk Management
      • Threat Intelligence
      • Zero Day
  • News and Exclusives
    • Latest News
    • ISB Exclusive
    • Positive News
  • Who We Are
    • About Us
    • Information Security Buzz Expert Panel​
    • Write for Us
    • Media Pack
  • Contact Us
  • Newsletter
Facebook X (Twitter) LinkedIn
Facebook X (Twitter) LinkedIn
Information Security BuzzInformation Security Buzz
  • Home
  • Articles
    • Attacks
      • BEC
      • Data Breach
      • DDoS
      • Evasion Attacks
      • Injection
      • Malware
      • MITM
      • Phishing
      • Ransomware
      • RCE
      • Social Engineering
      • Spoofing
      • Spyware
    • Business and Policy
      • BCP and DRP
      • GRC
      • Regulations
    • Data Protection
      • DLP
      • DRM
      • Encryption
      • IAM
    • Future, Trends and Insight
      • AI
      • Events & Community
      • Emerging Tech
      • Expert Panel
      • Interviews With Experts
      • Insights
      • Study & Research
    • Resources
      • Guides
      • Tools
      • Training & Education
    • Security
      • API
      • Apps
      • Cloud
      • Critical Infrastructure
      • Endpoint
      • Hardware
      • IoT
      • Mobile
      • Network
      • OT
      • Port Security
      • Security Architecture
      • Software Development
      • Supply Chain
      • Zero Trust
    • Threats and Vulnerabilities
      • Emerging Threats
      • Insider Threats
      • Risk Management
      • Threat Intelligence
      • Zero Day
  • News and Exclusives
    • Latest News
    • ISB Exclusive
    • Positive News
  • Who We Are
    • About Us
    • Information Security Buzz Expert Panel​
    • Write for Us
    • Media Pack
  • Contact Us
  • Newsletter
Subscribe
Information Security BuzzInformation Security Buzz
Home - Articles - Redacting PDFs – A Simple Task With Lots Of Consequences
Articles

Redacting PDFs – A Simple Task With Lots Of Consequences

ISBuzz TeamBy ISBuzz TeamJune 15, 2020Updated:March 13, 20235 Mins Read
Share LinkedIn Twitter Facebook Copy Link Email
Share
Facebook Twitter LinkedIn Email Copy Link
Quick AI Summary
ChatGPTClaudeGeminiGrokPerplexityDeepSeekCopilot

In 2019, lawyers representing the former political advisor Paul Manafort filed a response to special counsel Robert Mueller’s claims that Manafort violated his cooperation agreement by repeatedly lying to prosecutors. Specific sections of this response were redacted before it was released to the public due to the sensitive nature of some of the content. Or at least that’s what they thought.

This trial put PDF redaction in a whole new light, showcasing the importance of doing it right.  Although parts of the public version of this filing appeared to be redacted by black bars at first glance, it quickly became apparent that anyone with Adobe Acrobat, or other PDF viewing tool, or even browser-based viewing tools, could easily copy and paste the text that still existed under the redaction blocks to another document to simply reveal the passages that had been redacted.

However, this was not the first time this happened. A similar incident happened back in 2011 with the Ministry of Defence where a technical error meant blacked-out parts of an online MoD report could be read by pasting into another document.

What probably went wrong?

PDF provides a number of different types of documents that could have played a role in how the redaction was incorrectly carried out. Typically, a document that is scanned in is referred to as a PDF–Image. A scanned document, like a fax, is made up of black and white or colour dots and does not contain any additional text for searching or copying. Redacting this type of document simply involves converting the dots that represent the image of the text to black.

Given lawyers’ workloads and the number of scanned documents required for signatures, we can imagine that whoever performed the redaction believed the document was a scanned document. By drawing black boxes over the text, it would successfully redact the ability to read what was beneath, and remove any ability to view the black and white dots below that made up the actual words.

In fact, there are two other types of PDF documents that contain both the image of text and text data placed “underneath” the text. In these PDFs, the text data is what allows searching within the document and copy-pasting of the document’s text. Such documents can be created in two ways.

Either the image document is run through an Optical Character Recognition (OCR) Module with the text embedded behind the image to enable search and other text capabilities like copy and paste.

Alternatively, the document could be created from a word processing or font capable program directly into a PDF, including text and fonts.

Making an educated guess, the document was likely created in this manner and never scanned. We can draw this conclusion because the Manafort document is very clean, which is evident in the fact that there are no stray dots typically associated with scanned documents, with a very small file size unlike the file size of images with text. In either of these cases, simply drawing a box over the words will not remove the underlying text.

What can safeguard against human error when it comes to redaction? 

While this was a very public redactions mishap which had extreme consequences, this could be something that happened to any of us. The simple act of drawing blocks over text does not suffice where redaction is required, so proper procedures should be in place to help safeguard against erroneous action. There are software tools that support various different redaction needs.

There are tools that provide ‘suggested redactions.’ Options to redact text that is based on patterns and which can be pre-configured. In this manner phone numbers, social security numbers and other consistent patterns can be identified, and the user is given a choice of whether to redact or not redact. Additionally, to cross-check individually, a ‘search and redact’ feature can provide the ability to enter a search term to select a specific word, such as PII data like names or addresses, for redaction.

In terms of targeting content for selection, ‘selective text redaction’ means any text can be selected and redacted. Users can also have the ability to draw a box around text or graphics to redact both the image and the text underneath with ‘selected box redaction.’ Sensitive information will be highlighted and users can confirm redaction when finalising the document – this efficiently exfoliates a document and permanently removes the information. It’s important to remember if you need to remove document metadata as well, which could list your name as the author for instance. You may also want to sanitise content that can alter the document’s appearance. JavaScript, actions, and form fields are types of content that can affect this.

To redact at speed, bulk redaction on case or folder capabilities means users can redact common attributes or values across documents without having to individually sift through text. Bulk redaction with search capabilities or on migration can also help redact common attributes or values across documents. When you’ve isolated all the texts that needs to be redacted, it is key to ensure best practice with file naming and content management after redaction. You can append ‘_redacted’ to the file name to distinguish between versions and delete any content that is past its sell-by date.

While many of us may not find ourselves in the middle of a trial surrounded by global media attention, dealing with sensitive data of many kinds does fall into lots of job descriptions. Understanding how to protect that information through redaction is key to ensuring privacy and data protection, but asking individuals to manually get this right every time, on possibly thousands of documents is unfair. This is where the redaction software can help.

ISBuzz Team
  • ISBuzz Team
    Air Canada Data Breach: BianLian Extortion Group Claims A Massive Heist Contrary To Airline’s Earlier Statement
  • ISBuzz Team
    Unprecedented DDoS Attack Rocks The Web: Tech Giants Reveal A Digital Tsunami
  • ISBuzz Team
    CISA Flags High-Severity Adobe Acrobat Reader Flaw Amid Active Exploits
  • ISBuzz Team
    Curl Security Alert: Patching A Critical Bug Averting Potential Cyber Catastrophe

The opinions expressed in this post belong to the individual contributors and do not necessarily reflect the views of Information Security Buzz.

Share. Facebook Twitter LinkedIn Email Copy Link

Related Posts

Visual data is the blind spot in enterprise security: that’s about to change

May 4, 20267 Mins Read

Making stolen data worthless: why security must start with the data

March 30, 20265 Mins Read

Meta’s Smart Glasses Privacy Scandal Expands After Sama Credentials Found on the Dark Web

March 10, 20264 Mins Read
ISB-Bora-Side-Bar

No se ha podido establecer conexión. Error 429

 
ISB-Bora-Side-Bar
Black ISB Logo

Information Security Buzz is an independent resource that provides the experts’ comments, analysis, and opinion on the latest Cybersecurity news and topics

X (Twitter) LinkedIn Facebook RSS

Working With Us

  • About Us
  • Advertise With Us
  • Contact Us

Write For Us

  • How To Contribute

The Pages

  • Privacy Policy
  • Cookie Policy
  • AI Policy
  • Terms & Conditions
  • Copyright Notice

Information Security Buzz and all its contents are copyright © 2014-2025. All rights reserved. All third-party trademarks are recognized.

Type above and press Enter to search. Press Esc to cancel.

Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
  • Manage options
  • Manage services
  • Manage {vendor_count} vendors
  • Read more about these purposes
View preferences
  • {title}
  • {title}
  • {title}