textract: Getting " failed with exit code 127" on windows 10.
I have heard from different sources this error is associated with windows 10 .
textract.exceptions.ShellError: The command
pdftotext …/data/input/example_resumes\Brendan_Herger_Resume.pdf - failed with exit code 127 ------------- stdout ------------- ------------- stderr -------------
But still i cant find any solution. Please help my system specs are Python 2.7 and Windows 10.
About this issue
- Original URL
- State: closed
- Created 6 years ago
- Reactions: 9
- Comments: 29 (2 by maintainers)
After a little bit of research I found out that the text extractors were working in a BASH environment, but not from the Windows command line. For me these text extractors were located at
C:\Program Files\Git\mingw64\bin
and by adding this to my PATH variables I solved the errors.For those working with .doc filetypes, antiword doesn’t work if there are spaces in your FULL file path. Removing spaces in the file path fixed all issues for me.
Yes, currently the same, for .doc
It seems an OS error. The PC couldn’t pipe to ‘pdftotext’. If you used the cmd in Windows PC, let’s try to use a bash type console like the git bash to execute a command ‘python ***.py’.