Why does the file utility identify Microsoft Word files as CDF? What is this CDF?

calendar_today Asked Feb 6, 2011
thumb_up 15 upvotes
history Updated April 14, 2026

Direct Answer

Compound Documents format is related to OLE/COM. It refers to linking and embedding objects, for example, Excel charts in Word documents. See the historical (pre-XML) document…. This is an advisory response with reference links, ranked #2nd of 32 by community upvote score, from 2011.


The Problem (Q-score 15, ranked #2nd of 32 in the Word VBA archive)

The scenario as originally posted in 2011

I have some old Microsoft Word files (probably Word 97) lying around here and noticed that the standard Unix file utility identifies such files as “CDF”. It is actually more precise, dumping detailed meta data, for example:

CDF V2 Document,
Little Endian, 
Os: Windows, 
Version 4.0, 
Code page: 1252, 
Title: ..., 
Author: ..., 
Template: Normal.dot, 
Last Saved By: ..., 
Revision Number: 1, 
Name of Creating Application: Microsoft Word 8.0, 
Create Time/Date: ..., 
Last Saved Time/Date: ..., 
Number of Pages: 1, 
Number of Words: 95, 
Number of Characters: 542, 
Security: 0

What does that CDF stand for? Is that kind of a general container format, like RIFF for media files? I can’t find anything useful on the web. “Channel Definition Format” and “Compound Document Format” are clearly not meant, as those Microsoft Word files are completely binary. For Common Data Format I can’t find a connection. I tried to find something in the sourcecode of the file util (in the version which comes with FreeBSD), but I could only find out that it has a dedicated readcdf.c which deals with this format.

Why community consensus is tight on this one

Across 32 Word VBA entries in the archive, the accepted answer here holds elite answer (top 10 %%) status — meaning voters are unusually aligned on the right fix.


The Verified Solution — elite answer (top 10 %%) (+15)

Advisory answer — community consensus with reference links

Note: the verified answer below is a reference / advisory response rather than a copy-ready snippet.

Compound Documents format is related to OLE/COM. It refers to linking and embedding objects, for example, Excel charts in Word documents.

See the historical (pre-XML) document specifications for MS Office, and the specific file format description is “Windows Compound Binary File Format Specification”.


When to Use It — vintage (14+ years old, pre-2013)

A top-10 Word VBA pattern — why it still holds up

Ranks #2nd of 32 in the Word VBA archive. The only pattern ranked immediately above it is “Version Control for word documents” — compare both if you’re choosing between approaches.

What changed between 2011 and 2026

The answer is 15 years old. The Word VBA object model has been stable across Office 2013, 2016, 2019, 2021, 365, and 2024/2026 LTSC, so the pattern still compiles. Changes that might affect you: 64-bit API declarations (use PtrSafe), blocked macros in downloaded files (Mark-of-the-Web), and the shift toward Office Scripts for web-first workflows.

help
Frequently Asked Questions

Why is this answer the top decile of Word VBA Q&A?
expand_more

Answer score +15 vs the Word VBA archive median ~5; this entry is elite. The score plus 15 supporting upvotes on the question itself (+15) means the asker and 14 subsequent voters all validated the approach.

This answer links out — what are the reference links worth following?
expand_more

Read the first external link for the canonical reference, then search this archive for a top-10 entry in the same category — advisory answers are best paired with a ranked code snippet to close the loop.

This answer is 15 years old. Is it still relevant in 2026?
expand_more

Published 2011, which is 15 year(s) before today’s Office 2026 build. The Word VBA object model has had no breaking changes in that window. Three things to re-test: (1) blocked macros on downloaded files (Mark-of-the-Web), (2) 64-bit API declarations (PtrSafe, LongPtr), (3) any shift toward Office Scripts for web scenarios.

Which Word VBA pattern ranks just above this one at #1?
expand_more

The pattern one rank above is “Version Control for word documents”. If your use case overlaps, compare both before committing.

Data source: Community-verified Q&A snapshot. Q-score 15, Answer-score 15, original post 2011, ranked #2nd of 32 in the Word VBA archive. Last regenerated April 14, 2026.