View Issue Details

IDProjectCategoryView StatusLast Update
0017585CentOS-7filepublic2020-07-15 07:55
Reporterrocketraman 
PrioritynormalSeveritymajorReproducibilityhave not tried
Status confirmedResolutionopen 
Product Version 
Target VersionFixed in Version 
Summary0017585: File does not detect Word docx files properly
DescriptionThe version of `file` in Centos-7 does not detect Word docx files properly. It reports:

```
test.docx: Zip archive data, at least v2.0 to extract
```

instead of:

```
test.docx: Microsoft Word 2007+
```

This leads to other problems with programs that rely on the output of `file` being correct. For example, this issue in an earlier version of Debian also occurs in Centos-7:

https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=717096

# cat /etc/os-release
NAME="CentOS Linux"
VERSION="7 (Core)"
ID="centos"
ID_LIKE="rhel fedora"
VERSION_ID="7"
PRETTY_NAME="CentOS Linux 7 (Core)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:centos:centos:7"
HOME_URL="https://www.centos.org/"
BUG_REPORT_URL="https://bugs.centos.org/"

CENTOS_MANTISBT_PROJECT="CentOS-7"
CENTOS_MANTISBT_PROJECT_VERSION="7"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="7"

# rpm -q --whatprovides $(which file)
file-5.11-36.el7.x86_64
TagsNo tags attached.
abrt_hash
URL

Activities

rocketraman

rocketraman

2020-07-09 22:43

reporter   ~0037348

I should have said:

> The version of `file` in Centos-7 does not detect *all* Word docx files properly.

See also this Fedora bug, which contains a reproducer:

https://bugzilla.redhat.com/show_bug.cgi?id=1014299

The problem is apparently fixed in file 5.19.
ManuelWolfshant

ManuelWolfshant

2020-07-10 08:04

manager   ~0037350

I cannot reproduce that but because CentOS is a rebuild of the sources used to create RHEL, please submit your request to Redhat via bugzilla.redhat.com. If/when RH accepts it and incorporates a fix into RHEL and releases a patched version, CentOS will pick it up automatically.
It will help them if you can provide a reproducer.
Please help us track the issue by crosslinking this bug with the one you open at bugzilla.redhat.com
rocketraman

rocketraman

2020-07-15 06:44

reporter   ~0037368

Attached reproducer doc.

docx-test-zip.docx (10,870 bytes)
rocketraman

rocketraman

2020-07-15 06:44

reporter   ~0037369

RHEL bug: https://bugzilla.redhat.com/show_bug.cgi?id=1857095

Issue History

Date Modified Username Field Change
2020-07-09 22:36 rocketraman New Issue
2020-07-09 22:43 rocketraman Note Added: 0037348
2020-07-10 08:04 ManuelWolfshant Note Added: 0037350
2020-07-15 06:44 rocketraman File Added: docx-test-zip.docx
2020-07-15 06:44 rocketraman Note Added: 0037368
2020-07-15 06:44 rocketraman Note Added: 0037369
2020-07-15 07:55 ManuelWolfshant Status new => confirmed