`TypeError: expected string or buffer` when .doc is converted to .docx with MS Office in Windows #219

rejuashes · 2016-06-21T06:52:46Z

I am facing a situation where pydocx.to_html behaves indifferently on a same .doc file which is converted to a .docx file.

Scenario 1 : .doc file is converted to .docx file using libreoffice in linux(saving as Microsoft word 2007/2010/2013 XML) - works fine.

Scenario 2 : .doc file is converted to .docx file using MS Office in windows - throws an error.

return re.match('^\s_([^\s]+)\s_(.*)$', self.instr)
File "/usr/lib/python2.7/re.py", line 137, in match
return _compile(pattern, flags).match(string)
TypeError: expected string or buffer

Any pointers would be helpful.

regards,

Rajith

kylegibson · 2016-06-21T16:36:53Z

Hi,

Thanks for the issue report! Could you attach the .doc converted to .docx using MS Office in windows that is throwing the error?

Thanks,

-Kyle

rejuashes · 2016-06-22T06:10:00Z

Hi Kyle,

Attaching the original source .doc file which was converted to .docx.

regards,

rajith

ABC.zip

winhamwr mentioned this issue Jul 29, 2016

TypeError: expected string or buffer when parsing simple field instr #199

Open

winhamwr changed the title ~~pydocx docx to html conversion error~~ TypeError: expected string or buffer when .doc is converted to .docx with MS Office in Windows Jul 29, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`TypeError: expected string or buffer` when .doc is converted to .docx with MS Office in Windows #219

`TypeError: expected string or buffer` when .doc is converted to .docx with MS Office in Windows #219

rejuashes commented Jun 21, 2016 •

edited

Loading

kylegibson commented Jun 21, 2016

rejuashes commented Jun 22, 2016

TypeError: expected string or buffer when .doc is converted to .docx with MS Office in Windows #219

TypeError: expected string or buffer when .doc is converted to .docx with MS Office in Windows #219

Comments

rejuashes commented Jun 21, 2016 • edited Loading

kylegibson commented Jun 21, 2016

rejuashes commented Jun 22, 2016

`TypeError: expected string or buffer` when .doc is converted to .docx with MS Office in Windows #219

`TypeError: expected string or buffer` when .doc is converted to .docx with MS Office in Windows #219

rejuashes commented Jun 21, 2016 •

edited

Loading