I use tabula-py to extract the pdf table content, the output for numeric as text such as 010019 or 0007 is always convert to float. Is there any way to fix it to return correct value (0007 instead 7.0)
Is there possible the tabula-py extract numeric 007 as 007 instead 7?
12 views Asked by Ray Ronnaret At
1
I just found a work around solution, instead extract to DataFrame, we can extract to json that will provide all raw info.
Output from my file is in the
'text'as below: