magic.from_buffer and magic.from_file give different outputs. · Issue #185 · ahupp/python-magic

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

体贴的板凳 · GitHub - ...· 6 月前 ·

年轻有为的双杠 · MQTT Modular Input - ...· 7 月前 ·

踢足球的火龙果 · useEffect 无限循环类型 - ...· 7 月前 ·

健身的小笼包 · 非洲爪蟾的碱性螺旋-环-螺旋转录因子的鉴定与 ...· 7 月前 ·

大力的西瓜 · 便宜买苹果产品！荷兰的Apple教育优惠来了 ...· 10 月前 ·

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement . We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I have been using magic library for detecting mime types and found that getting two different results for the same file when using from_buffer and from_file . Below is the python snippet I tried.

import magic
magic.Magic(mime=True, mime_encoding=True).from_buffer(open("Downloads/Document-magic.docx","r").read(1024))
'application/zip; charset=binary'
magic.Magic(mime=True, mime_encoding=True).from_file("Downloads/Document-magic.docx")
'application/vnd.openxmlformats-officedocument.wordprocessingml.document; charset=binary'

Python version 2.7.12
Does magic library support detecting mime types for files created on one drive or google docs.

Thanks in advance

Ya Right @v00d00 I read 2kb instead of 1kb and got the same output as seen in from_file.

Any idea on a generic number of bytes to read so that it does not fail for any type of files.

Thanks.

Some files have their "signature" bytes at the end of the file, so the most reliable way would be the entire file, at least in my experience.

But as that may be impractical, more the more bytes the better.

推荐文章

体贴的板凳 · GitHub - liuhao1946/RTT-T-Project

6 月前

年轻有为的双杠 · MQTT Modular Input - Fails to connect via SSL/TLS - Splunk Community

7 月前

踢足球的火龙果 · useEffect 无限循环类型 - Webfunny

7 月前

健身的小笼包 · 非洲爪蟾的碱性螺旋-环-螺旋转录因子的鉴定与初步分析 Genome-Wide Survey, Identification and Preliminary Analysis of Xenopus La

7 月前

大力的西瓜 · 便宜买苹果产品！荷兰的Apple教育优惠来了！具体规则进来看~ _ 荷兰生活网 _ 优惠打折 - Powered by Discuz! Archiver

10 月前