2017-04-29 13:33 UTC

View Issue Details Jump to Notes ]
IDProjectCategoryView StatusLast Update
0012971CentOS-7unzippublic2017-03-15 12:30
ReporterCraigFinch 
PrioritynormalSeverityminorReproducibilityalways
StatusconfirmedResolutionopen 
Platformx86_64OSCentOS 7 (core)OS Version7.3.1611
Product Version7.3.1611 
Target VersionFixed in Version 
Summary0012971: Unzip on CentOS fails to extract files from large archive; same version of unzip works fine on OS X
DescriptionProblem: On CentOS 7 Linux, unzip reports CRC errors in a large (~1GB) zip file that contains 10 text files (each ~1GB uncompressed). This Linux machine is running on a VM, and the .ZIP file is located on a shared folder with the Mac OS X host. unzip works fine on the Mac, but running unzip on the *exact same* file from the CentOS VM produces CRC errors and errors in the extracted files. Therefore, I am filing this bug with CentOS, rather than upstream.

System 1: CentOS Linux release 7.3.1611 (Core)

unzip -v

UnZip 6.00 of 20 April 2009, by Info-ZIP. Maintained by C. Spieler. Send
bug reports using http://www.info-zip.org/zip-bug.html; see README for details.

Latest sources and executables are at ftp://ftp.info-zip.org/pub/infozip/ ;
see ftp://ftp.info-zip.org/pub/infozip/UnZip.html for other sites.

Compiled with gcc 4.8.5 20150623 (Red Hat 4.8.5-11) for Unix (Linux ELF) on Nov 5 2016.

UnZip special compilation options:
        COPYRIGHT_CLEAN (PKZIP 0.9x unreducing method not supported)
        SET_DIR_ATTRIB
        SYMLINKS (symbolic links supported, if RTL and file system permit)
        TIMESTAMP
        UNIXBACKUP
        USE_EF_UT_TIME
        USE_UNSHRINK (PKZIP/Zip 1.x unshrinking method supported)
        USE_DEFLATE64 (PKZIP 4.x Deflate64(tm) supported)
        UNICODE_SUPPORT [wide-chars, char coding: UTF-8] (handle UTF-8 paths)
        MBCS-support (multibyte character support, MB_CUR_MAX = 6)
        LARGE_FILE_SUPPORT (large files over 2 GiB supported)
        ZIP64_SUPPORT (archives using Zip64 for large files supported)
        USE_BZIP2 (PKZIP 4.6+, using bzip2 lib version 1.0.6, 6-Sept-2010)
        VMS_TEXT_CONV
        [decryption, version 2.11 of 05 Jan 2007]

UnZip and ZipInfo environment options:
           UNZIP: [none]
        UNZIPOPT: [none]
         ZIPINFO: [none]
      ZIPINFOOPT: [none]


System 2: Mac OS X (10.12.3)
unzip -v
UnZip 6.00 of 20 April 2009, by Info-ZIP. Maintained by C. Spieler. Send
bug reports using http://www.info-zip.org/zip-bug.html; see README for details.

Latest sources and executables are at ftp://ftp.info-zip.org/pub/infozip/ ;
see ftp://ftp.info-zip.org/pub/infozip/UnZip.html for other sites.

Compiled with gcc 4.2.1 Compatible Apple LLVM 8.0.0 (clang-800.0.34) for Unix Mac OS X on Jul 30 2016.

UnZip special compilation options:
        COPYRIGHT_CLEAN (PKZIP 0.9x unreducing method not supported)
        SET_DIR_ATTRIB
        SYMLINKS (symbolic links supported, if RTL and file system permit)
        TIMESTAMP
        UNIXBACKUP
        USE_EF_UT_TIME
        USE_UNSHRINK (PKZIP/Zip 1.x unshrinking method supported)
        USE_DEFLATE64 (PKZIP 4.x Deflate64(tm) supported)
        LARGE_FILE_SUPPORT (large files over 2 GiB supported)
        ZIP64_SUPPORT (archives using Zip64 for large files supported)
        VMS_TEXT_CONV
        [decryption, version 2.11 of 05 Jan 2007]

UnZip and ZipInfo environment options:
           UNZIP: [none]
        UNZIPOPT: [none]
         ZIPINFO: [none]
      ZIPINFOOPT: [none]
Steps To ReproduceStep 1: Download this publicly available ZIP file:
ftp://ftp.dos.state.fl.us/public/doc/Quarterly/Cor/cordata.zip

Step 2: Attempt to extract (or run unzip -t) with latest CentOS 7. CRC errors and data corruption result.

Step 3: Extract with unzip (or unzip -t) on latest Mac OS X. No CRC errors reported, and data is not corrupted.
Additional InformationOutput of unzip -t on CentOS:

unzip -t /vagrant/cordata.zip
Archive: /vagrant/cordata.zip
    testing: cordata0.txt bad CRC babd3206 (should be 30142f03)
    testing: cordata1.txt bad CRC 523b64fd (should be c5fd87ae)
    testing: cordata2.txt bad CRC d75f9870 (should be eae7fdd8)
    testing: cordata3.txt bad CRC ef248dd3 (should be 30e8e892)
    testing: cordata4.txt bad CRC 270d77e0 (should be 58137796)
    testing: cordata5.txt bad CRC c5f9a9ed (should be b40b3ea7)
    testing: cordata6.txt bad CRC 52beee10 (should be 571384fe)
    testing: cordata7.txt bad CRC 738dec2d (should be 487ac9d5)
    testing: cordata8.txt bad CRC 416cf7a1 (should be 38f076b0)
    testing: cordata9.txt bad CRC 25249335 (should be 2d681da3)
At least one error was detected in /vagrant/cordata.zip.

Output on OS X:

Archive: cordata.zip
    testing: cordata0.txt OK
    testing: cordata1.txt OK
    testing: cordata2.txt OK
    testing: cordata3.txt OK
    testing: cordata4.txt OK
    testing: cordata5.txt OK
    testing: cordata6.txt OK
    testing: cordata7.txt OK
    testing: cordata8.txt OK
    testing: cordata9.txt OK
No errors detected in compressed data of cordata.zip.

Output from 7za on CentOS:

7-Zip (a) [64] 16.02 : Copyright (c) 1999-2016 Igor Pavlov : 2016-05-21
p7zip Version 16.02 (locale=en_US.UTF-8,Utf16=on,HugeFiles=on,64 bits,2 CPUs Intel(R) Core(TM) i5-4278U CPU @ 2.60GHz (40651),ASM,AES-NI)

Scanning the drive for archives:
1 file, 1008349434 bytes (962 MiB)

Listing archive: /vagrant/cordata.zip

--
Path = /vagrant/cordata.zip
Type = zip
Physical Size = 1008349434

   Date Time Attr Size Compressed Name
------------------- ----- ------------ ------------ ------------------------
2017-01-08 08:43:44 ....A 1044819846 100777324 cordata0.txt
2017-01-08 08:44:30 ....A 1045083732 100827704 cordata1.txt
2017-01-08 08:45:22 ....A 1045018842 100831287 cordata2.txt
2017-01-08 08:46:08 ....A 1045007306 100864690 cordata3.txt
2017-01-08 08:47:02 ....A 1045031820 100840906 cordata4.txt
2017-01-08 08:47:46 ....A 1044853012 100828261 cordata5.txt
2017-01-08 08:48:38 ....A 1044871758 100821894 cordata6.txt
2017-01-08 08:49:24 ....A 1044902040 100858344 cordata7.txt
2017-01-08 08:50:12 ....A 1044809752 100876654 cordata8.txt
2017-01-08 08:51:02 ....A 1044760724 100821348 cordata9.txt
------------------- ----- ------------ ------------ ------------------------
2017-01-08 08:51:02 10449158832 1008348412 10 files

TagsNo tags attached.
abrt_hash
URL
Attached Files

-Relationships
+Relationships

-Notes

~0028863

tru (administrator)

confirmed:
wget ftp://ftp.dos.state.fl.us/public/doc/Quarterly/Cor/cordata.zip && sha256sum cordata.zip
ddc7d1d7a821618ae72212b9dc829705b05955f99e81b687044a06b498a2744b cordata.zip

CentOS-5: works fine (unzip-5.52-3.el5.x86_64)
CentOS-6: works fine (unzip-6.0-4.el6.x86_64)
CentOS-7: failed (unzip-6.0-16.el7.x86_64)
-> regression

~0028864

tru (administrator)

from a CentOS-5/6/7 machine (unzip -l works fine)
$ unzip -lv cordata.zip
Archive: cordata.zip
 Length Method Size Cmpr Date Time CRC-32 Name
-------- ------ ------- ---- ---------- ----- -------- ----
1044819846 Def64N 100777324 90% 01-08-2017 08:43 30142f03 cordata0.txt
1045083732 Def64N 100827704 90% 01-08-2017 08:44 c5fd87ae cordata1.txt
1045018842 Def64N 100831287 90% 01-08-2017 08:45 eae7fdd8 cordata2.txt
1045007306 Def64N 100864690 90% 01-08-2017 08:46 30e8e892 cordata3.txt
1045031820 Def64N 100840906 90% 01-08-2017 08:47 58137796 cordata4.txt
1044853012 Def64N 100828261 90% 01-08-2017 08:47 b40b3ea7 cordata5.txt
1044871758 Def64N 100821894 90% 01-08-2017 08:48 571384fe cordata6.txt
1044902040 Def64N 100858344 90% 01-08-2017 08:49 487ac9d5 cordata7.txt
1044809752 Def64N 100876654 90% 01-08-2017 08:50 38f076b0 cordata8.txt
1044760724 Def64N 100821348 90% 01-08-2017 08:51 2d681da3 cordata9.txt
-------- ------- --- -------
10449158832 1008348412 90% 10 files

 unzip -x cordata.zip cordata0.txt ; sha256sum cordata0.txt
corrupted data on CentOS-7
8e93970f10b28bc652c3c172257eba000b1f34815b25bb22aeb1697d287966cb cordata0.txt
original data:
d96da569f38373a902bc9b42fd5f636c0e8d5c289d6ebf8eb9d54a0eabd3e3de cordata0.txt

~0028865

tru (administrator)

reported uptream
https://bugzilla.redhat.com/show_bug.cgi?id=1432453
+Notes

-Issue History
Date Modified Username Field Change
2017-03-15 04:28 CraigFinch New Issue
2017-03-15 12:17 tru Note Added: 0028863
2017-03-15 12:24 tru Note Added: 0028864
2017-03-15 12:30 tru Note Added: 0028865
2017-03-15 12:30 tru Status new => confirmed
+Issue History