4 Replies Latest reply: May 15, 2014 9:57 AM by Data Kruncher RSS

    Extracting same info from multiple PDFs - a problem

    fgorelik _

      Here's the problem that I've been having lately. I am downloading certain sequential documents (day-to-day; in PDF format) and extracting a few columns of info into Excel for analysis. There is one column that is giving me problems in some instances. This is the way it's set up right now

       

      Player name

      Wallace

      Watson

      York

       

      etc

       

      All the PDFs visually look the same. However, I have seen a few instances where Monarch picks them up as follows:

       

      (space)(space)Player name

      Wallace

      Watson

      York

       

      The +- function in "verify" obviously helps, but this is daily data, so if on the 15th, say, it's in the incorrect format and I move two spots over to the left, the remaining data will be extracted with extra spaces before the actual names. This makes any VLOOKUP I want to do in Excel a tedious task since I have to manually edit the data to make it right. How could I possibly alleviate this problem? I am using V8.