6 Replies Latest reply: May 15, 2014 9:57 AM by Grant Perkins RSS

    trapping text problem from newbie

    Nupur _

      Hello,

      I'm having trouble figuring out how best to capture the following data.

       

      RPT 04                                                   PATIENT ABSTRACT                                         PAGE   1

      HOSPITAL                                                                                RUN-DATE 08/08/2005

                 

      ACCESSION NO.  1993/000001            *  MED REC NO.          *   MK:

      • PATIENT IDENTIFICATION DATA        *        PERSONAL DATA           *       EMPLOYMENT DATA     *      SECONDARY CONTACT

      GALT,JOHN                             *  DOB: mm/dd/yyyy MICHIGAN      * SOC SEC NO:xxx-xx-xxxx    * N/A                           

                                             

      • DOD: mm/dd/yyyy  UNKNOWN IF P *                           *                              

      address line 1                        *  PL DEATH: UNKNOWN             *                           *                              

      City                 St zipcode       *  MALE                          *                           *                                 

      County                                *  WHITE      NON-SPANISH        * CURR IND: N/A             *          

      (999) 999-9999                        *                                * LONG IND: N/A             *                              

      MAID NM: N/A             SFIX: N/A    *  LOST TO FOL:       LNG:       * CURR OCC: N/A             * 

      ALIAS:   N/A             PFIX: N/A    *                                * LONG OCC: N/A             *                LNG:  

      SPSE:  GALT, JANE                     *                                *                           *

      COMMENTS: H/O BLADDER CA 1976(PROV). QUIT SMOK 36Y DRINK 34Y     PT LAST UPDATE DATE  dd/mm/yyyy

                 

      DX IDENTIFICATION* SITE: C240 02  BILE DUCT - EXTRAHEPATI MOR ICDO3: 81403 ADENOCARCINOMA/AD MALIG-PRIMARY   GRD3 POORLY DIFFEREN

                        

      •                                        MOR ICDO2: 81403 ADENOCA/ ADENOMA  MALIG-PRIMARY   GRD3 POORLY DIFFEREN

                        

      • LATERALITY    - NOT A PAIRED SITE             

                        

      • CONFIRMATION  - POSITIVE HISTOLOGY                    REPORT SRC: HOSPITAL (INPT/OUTPT/CLINIC) 

                        

      • CLASS OF CASE - DX RPT FAC & ALL/PART 1ST CRS TRT AT RPT FAC     

                        

      • 1ST CONTACT   - 01/19/1993  INITIAL DX - 01/23/1993  1ST POS BX - 01/23/1993  ADM: 01/19/1993    DISCH: 02/15/1993

                        

      • PRIMARY PAYER - INSURANCE STATUS UNKNOWN           

                        

      • FAM HIST      - NO       TOBACCO HX: PREVIOUS USE            ALCOHOL HX: PAST HISTORY OF ALCOHO MARITAL: MARRIED 

                        

      • COMORBIDTIES & COMPLICATIONS: 1- 00000      2-            3-            4-            5-            6-     

                EXTENT * CS TUMOR SIZE (MM):           CS EXTENSION:            CS T EVALUATION: 

                        

      • REG NODES EX: 98     REG NODES POS: 01    CS LYMPH NODES:      CS N EVALUATION: 

                        

      • CS METASTASIS:                           CS M EVALUATION: 

                        

      • CS SS FACTORS: 1:         2:         3:         4:         5:         6:   

                        

      • CS DERIVED:      T:              N:              M:             STAGE:            

                        

      • CS SS1977:              CS SS2000:            AGE AT DX:  76

                        

      • TUMOR MARKERS - MRK1 UNKNOWN/NO INFORMA    MRK2 UNKNOWN/NO INFORMA    MRK3 UNKNOWN/NO INFORMA

                        

      • GEN STAGE     - REGIONAL BOTH DIR EXTN & LYMPH  SEER EXT: 99   LN: 9

                        

      • AJCC EDTN: 99    TUMOR SIZE:  99.9

                        

      • AJCC STAGE    - PATH  T: 3    N: 1    M: 0    GRP: 4A   DESC: 0    STAGE BY: UNKNOWN, NOT STATED   

                                         CLIN  T: X    N: X    M: X    GRP: 99   DESC: 0    STAGE BY: UNKNOWN, NOT STATED     

                        

      • DISTANT SITES - 1: NONE                     2:  NONE                    3: NONE                   

                        

      • PED SYS       - 88  STAGE: 88       STAGE BY: NOT STAGED              

           TREATMENT 1 * 1ST TRT:01/23/1993 SYSTEMIC TRT:00/00/0000 1ST SURG PROC:01/23/1993 MOST DEF SPS:01/23/1993 SURG DISCH:99/99/9999

                        

      • DAYS FROM DX TO 1ST TRT: 000

                        

      • READM SAME HOSP WI 30 DAYS SURG DISCH: UNKNOWN SURG PRIM SITE/UN

            TREATMENTS * SURGERY       - 1 01/23/1993  N 9999     60 RADICAL SURGERY                                          I/O:UNKNOWN      

                        

      • LYM NODE SRG  - 1 01/23/1993  N 9999     0  NONE; DX AT AUTOPSY                                      I/O:UNKNOWN      

                        

      • REG/DIST SRG  - 1 01/23/1993  N 9999     0  NONE; DX AT AUTOPSY                                      I/O:UNKNOWN      

           TREATMENT 2 * SURG MARGINS: NO RESID TUM                                RAD/SURG SEQ: NO RT AND/OR SURG; DX AT

                        

      • RSN NO SURG- PERFORMED/ADMINISTERED; DX           RSN NO RAD- NOT PLAN/ADMINISTERED AS 1S

                        

      • RSN NO CHEMO- NOT ADMINISTERED AS 1ST CR          RSN NO HORMONE- NOT ADMINISTERED AS 1ST CRS

                        

      • RSN NO IMMUNO- NOT ADMINISTERED AS 1ST CR         RSN NO HT/E- NOT ADMINISTERED AS 1ST CRS

                        

      • RSN NO OTHER- NONE; DX AT AUTOPSY      

                        

      • PROTOCOL      - ELIG STATUS: UNKNOWN IF ON PROTOCOL                     PARTICIPATION: UNKNOWN       DATE: N/A   

                        

      •               - PROTOCOL TYPE: UNKNOWN                  REMARKS:          

          DX MISC DATA * SURGEON       - LastName, FirstName      *    MANAGING: AVAILABLE, NOT           * REFERRING: AVAILABLE, NOT           

                        

      • FOLLOWING     - LastName, FirstName      *        ALT2: AVAILABLE, NOT           *      ALT3: AVAILABLE, NOT          

      RPT 04                                                   PATIENT ABSTRACT                                         PAGE   2

      HOSPITAL                                                                                RUN-DATE 08/08/2005

                 

      • PATIENT IDENTIFICATION DATA        *

      ACCESSION NO. 1993/000001(CONT'D)     * PATIENT NAME: GALT,JOHN                              * SITE: C240 02 BILE DUCT - EXTRAHEPATI

                        

      • ALT4          - AVAILABLE, NOT          

                        

      • LETTER SEQ: 1ST: F   2ND: P  3RD: P  4TH: F

                        

      • NEXT FOLLOW-UP SOURCE: NOT FOLLOWED                         FOLLOWING REG- 0099999999

                        

      • CA CONF1      - UNKNOWN                     DATE: N/A        

                        

      • CA CONF2      - UNKNOWN                     DATE: N/A        

                        

      • CA CONF3      - UNKNOWN                     DATE: N/A        

                        

      • REFERRED FROM - 0000000000                                              

                        

      • REFERRED TO   - 0000000000                                              

                        

      • COMMENTS      - PT HAD WHIPPLE PROCEDURE.   3/4/93 XYZ                                       

                        

      • CASE STATUS   - C   QA REVIEW: UNKNOWN           DATE: N/A           INITIALS:   

                 

      STATUS DISPLAY  * PATIENT STATUS:  DATE NEXT FOL: N/A          DATE LAST FOL: 08/15/1993 CA STATUS:EVID THIS TUMOR    

                        

      • 1ST RECURR -07/07/1993    TYPE 1ST REC:REGION REC, INSUFF INFO TO CODE 21-2

                        

      • RECUR DIST SITES- 1: NONE                     2: NONE                               3: NONE                   

                        

      • ICD CAUSE OF DEATH: 7797           ICD REV NO: ICD-9                          CAUSE OF EXP: UNKNOWN                      

                        

      • RESID AT DX   -                              ADDRESS LINE  1             CITY                ST 12345   

                        

      • CNTY/CNTRY:county              CENSUS:              TYPE: 

                        

      • ANNIV DATE    - 04/16/1993   COMPLETE DATE: 04/16/1993   LAST UPDATE: 06/14/1994    CR INIT SUB: 07/20/2001

                        

      • DATA ENTRY: TG    ABSTR: XY                                                         CR LAST SUB: 07/20/2001

                        

      • USER FIELDS   - TRT LETTER:                   

                        

      • STATE FIELDS  -         

                 

      SUPPLEMENTAL: CT SCAN: ABD-N PEL-N  X-RAY: CXR-N ABD-P  OTHER:    -     -   MRI:    -   BONE SCAN:    CHEMO: 1:         2:        

                    TUMOR MARK:      -      -        -      -         -      -                                     3:         4:      

                    RAD 1: SITE:     RADS       RAD 2: SITE:     RADS       RAD 3: SITE:     RADS                  5:         6:      

                    RAD 4: SITE:     RADS       RAD 5: SITE:     RADS       RAD 6: SITE:     RADS           HORM:  1:         2:      

                    COMMENT:                                                                                3:         4:                                                                               

      5:         6:      

             FOLLOW-UP *

         SUBSEQUENT TX * SDSP          - 9 07/07/1993  N 9999     06 BYPASS PROCEDURE PERFORM, BX PRIM/OTHER                  I/O:UNKNOWN      

           08/02/1993  * FLWUP SOURCE  - READMISSION (I/O)               CONTACT MD: LastName, FirstName               

                        

      • CANCER STATUS - EVID THIS TUMOR                 DATA ENTRY: TG   06/14/1994

                        

      • RECUR DATE    - 07/07/1993                            TYPE: REGION REC, INSUFF INFO TO CODE 21-27                      

                        

      • DISTANT SITES - 1: NONE                                  2: NONE                       3: NONE                  

                        

      • COMMENTS      - METS=INT TRANS COLON SM BOW. CA19-9=10 136 CEA=31                            

                 

           08/15/1993  * FLWUP SOURCE  - OTHER (FRIEND/REL)              CONTACT MD: LastName, FirstName               

                        

      • CANCER STATUS - UNK WHETH THS TUMOR PRES        DATA ENTRY: TG   06/14/1994

                        

      • RECUR DATE    - N/A                                   TYPE: DISEASE FREE FOL TRT, NO RECURRENCE                        

                        

      • DISTANT SITES - 1: NONE                                  2: NONE                       3: NONE                  

                        

      • COMMENTS      - MCF                                                                               

      (Please copy and paste to notepad or textpad to view the correct formatting.. I couldn't get it to format correctly)

       

      As you can see, this is 1 patient's record, with 1 level of detail. I tried to capture the whole record at once, but I get an error message saying that the detail line is greater than 64 lines. So I tried capturing 1 block at a time, e.g. Patient identification data, and then the DX Identification. It worked alright, but I get a blank line in the table view for the 1st row when I add a new detail.

      Also, some of the field positions can vary. E.g. "Treatments" may contain 3 rows as shown, but they may be in any order. Moreover, each row can repeat multiple times.

      I think I'm going to need extensive help. So if somebody could please email me, I'd really appreciate it.

       

      Thanks a lot in advance!

       

      Nupur

        • trapping text problem from newbie
          Grant Perkins

          Originally posted by Nupur:

           

          Hello,

          I'm having trouble figuring out how best to capture the following data.

          [font="courier"]RPT 04                                                   PATIENT ABSTRACT                                         PAGE   1

          HOSPITAL                                                                                RUN-DATE 08/08/2005

                     

          ACCESSION NO.  1993/000001            *  MED REC NO.          *   MK:

          • PATIENT IDENTIFICATION DATA        *        PERSONAL DATA           *       EMPLOYMENT DATA     *      SECONDARY CONTACT

          GALT,JOHN                             *  DOB: mm/dd/yyyy MICHIGAN      * SOC SEC NO:xxx-xx-xxxx    * N/A                           

                                                 

          • DOD: mm/dd/yyyy  UNKNOWN IF P *                           *                              

          address line 1                        *  PL DEATH: UNKNOWN             *                           *                              

          City                 St zipcode       *  MALE                          *                           *                                 

          County                                *  WHITE      NON-SPANISH        * CURR IND: N/A             *          

          (999) 999-9999                        *                                * LONG IND: N/A             *                              

          MAID NM: N/A             SFIX: N/A    *  LOST TO FOL:       LNG:       * CURR OCC: N/A             * 

          ALIAS:   N/A             PFIX: N/A    *                                * LONG OCC: N/A             *                LNG:  

          SPSE:  GALT, JANE                     *                                *                           *

          COMMENTS: H/O BLADDER CA 1976(PROV). QUIT SMOK 36Y DRINK 34Y     PT LAST UPDATE DATE  dd/mm/yyyy

                     

          DX IDENTIFICATION* SITE: C240 02  BILE DUCT - EXTRAHEPATI MOR ICDO3: 81403 ADENOCARCINOMA/AD MALIG-PRIMARY   GRD3 POORLY DIFFEREN

                            

          •                                        MOR ICDO2: 81403 ADENOCA/ ADENOMA  MALIG-PRIMARY   GRD3 POORLY DIFFEREN

                            

          • LATERALITY    - NOT A PAIRED SITE             

                            

          • CONFIRMATION  - POSITIVE HISTOLOGY                    REPORT SRC: HOSPITAL (INPT/OUTPT/CLINIC) 

                            

          • CLASS OF CASE - DX RPT FAC & ALL/PART 1ST CRS TRT AT RPT FAC     

                            

          • 1ST CONTACT   - 01/19/1993  INITIAL DX - 01/23/1993  1ST POS BX - 01/23/1993  ADM: 01/19/1993    DISCH: 02/15/1993

                            

          • PRIMARY PAYER - INSURANCE STATUS UNKNOWN           

                            

          • FAM HIST      - NO       TOBACCO HX: PREVIOUS USE            ALCOHOL HX: PAST HISTORY OF ALCOHO MARITAL: MARRIED 

                            

          • COMORBIDTIES & COMPLICATIONS: 1- 00000      2-            3-            4-            5-            6-     

                    EXTENT * CS TUMOR SIZE (MM):           CS EXTENSION:            CS T EVALUATION: 

                            

          • REG NODES EX: 98     REG NODES POS: 01    CS LYMPH NODES:      CS N EVALUATION: 

                            

          • CS METASTASIS:                           CS M EVALUATION: 

                            

          • CS SS FACTORS: 1:         2:         3:         4:         5:         6:   

                            

          • CS DERIVED:      T:              N:              M:             STAGE:            

                            

          • CS SS1977:              CS SS2000:            AGE AT DX:  76

                            

          • TUMOR MARKERS - MRK1 UNKNOWN/NO INFORMA    MRK2 UNKNOWN/NO INFORMA    MRK3 UNKNOWN/NO INFORMA

                            

          • GEN STAGE     - REGIONAL BOTH DIR EXTN & LYMPH  SEER EXT: 99   LN: 9

                            

          • AJCC EDTN: 99    TUMOR SIZE:  99.9

                            

          • AJCC STAGE    - PATH  T: 3    N: 1    M: 0    GRP: 4A   DESC: 0    STAGE BY: UNKNOWN, NOT STATED   

                                             CLIN  T: X    N: X    M: X    GRP: 99   DESC: 0    STAGE BY: UNKNOWN, NOT STATED     

                            

          • DISTANT SITES - 1: NONE                     2:  NONE                    3: NONE                   

                            

          • PED SYS       - 88  STAGE: 88       STAGE BY: NOT STAGED              

               TREATMENT 1 * 1ST TRT:01/23/1993 SYSTEMIC TRT:00/00/0000 1ST SURG PROC:01/23/1993 MOST DEF SPS:01/23/1993 SURG DISCH:99/99/9999

                            

          • DAYS FROM DX TO 1ST TRT: 000

                            

          • READM SAME HOSP WI 30 DAYS SURG DISCH: UNKNOWN SURG PRIM SITE/UN

                TREATMENTS * SURGERY       - 1 01/23/1993  N 9999     60 RADICAL SURGERY                                          I/O:UNKNOWN      

                            

          • LYM NODE SRG  - 1 01/23/1993  N 9999     0  NONE; DX AT AUTOPSY                                      I/O:UNKNOWN      

                            

          • REG/DIST SRG  - 1 01/23/1993  N 9999     0  NONE; DX AT AUTOPSY                                      I/O:UNKNOWN      

               TREATMENT 2 * SURG MARGINS: NO RESID TUM                                RAD/SURG SEQ: NO RT AND/OR SURG; DX AT

                            

          • RSN NO SURG- PERFORMED/ADMINISTERED; DX           RSN NO RAD- NOT PLAN/ADMINISTERED AS 1S

                            

          • RSN NO CHEMO- NOT ADMINISTERED AS 1ST CR          RSN NO HORMONE- NOT ADMINISTERED AS 1ST CRS

                            

          • RSN NO IMMUNO- NOT ADMINISTERED AS 1ST CR         RSN NO HT/E- NOT ADMINISTERED AS 1ST CRS

                            

          • RSN NO OTHER- NONE; DX AT AUTOPSY      

                            

          • PROTOCOL      - ELIG STATUS: UNKNOWN IF ON PROTOCOL                     PARTICIPATION: UNKNOWN       DATE: N/A   

                            

          •               - PROTOCOL TYPE: UNKNOWN                  REMARKS:          

              DX MISC DATA * SURGEON       - LastName, FirstName      *    MANAGING: AVAILABLE, NOT           * REFERRING: AVAILABLE, NOT           

                            

          • FOLLOWING     - LastName, FirstName      *        ALT2: AVAILABLE, NOT           *      ALT3: AVAILABLE, NOT          

          RPT 04                                                   PATIENT ABSTRACT                                         PAGE   2

          HOSPITAL                                                                                RUN-DATE 08/08/2005

                     

          • PATIENT IDENTIFICATION DATA        *

          ACCESSION NO. 1993/000001(CONT'D)     * PATIENT NAME: GALT,JOHN                              * SITE: C240 02 BILE DUCT - EXTRAHEPATI

                            

          • ALT4          - AVAILABLE, NOT          

                            

          • LETTER SEQ: 1ST: F   2ND: P  3RD: P  4TH: F

                            

          • NEXT FOLLOW-UP SOURCE: NOT FOLLOWED                         FOLLOWING REG- 0099999999

                            

          • CA CONF1      - UNKNOWN                     DATE: N/A        

                            

          • CA CONF2      - UNKNOWN                     DATE: N/A        

                            

          • CA CONF3      - UNKNOWN                     DATE: N/A        

                            

          • REFERRED FROM - 0000000000                                              

                            

          • REFERRED TO   - 0000000000                                              

                            

          • COMMENTS      - PT HAD WHIPPLE PROCEDURE.   3/4/93 XYZ                                       

                            

          • CASE STATUS   - C   QA REVIEW: UNKNOWN           DATE: N/A           INITIALS:   

                     

          STATUS DISPLAY  * PATIENT STATUS:  DATE NEXT FOL: N/A          DATE LAST FOL: 08/15/1993 CA STATUS:EVID THIS TUMOR    

                            

          • 1ST RECURR -07/07/1993    TYPE 1ST REC:REGION REC, INSUFF INFO TO CODE 21-2

                            

          • RECUR DIST SITES- 1: NONE                     2: NONE                               3: NONE                   

                            

          • ICD CAUSE OF DEATH: 7797           ICD REV NO: ICD-9                          CAUSE OF EXP: UNKNOWN                      

                            

          • RESID AT DX   -                              ADDRESS LINE  1             CITY                ST 12345   

                            

          • CNTY/CNTRY:county              CENSUS:              TYPE: 

                            

          • ANNIV DATE    - 04/16/1993   COMPLETE DATE: 04/16/1993   LAST UPDATE: 06/14/1994    CR INIT SUB: 07/20/2001

                            

          • DATA ENTRY: TG    ABSTR: XY                                                         CR LAST SUB: 07/20/2001

                            

          • USER FIELDS   - TRT LETTER:                   

                            

          • STATE FIELDS  -         

                     

          SUPPLEMENTAL: CT SCAN: ABD-N PEL-N  X-RAY: CXR-N ABD-P  OTHER:    -     -   MRI:    -   BONE SCAN:    CHEMO: 1:         2:        

                        TUMOR MARK:      -      -        -      -         -      -                                     3:         4:      

                        RAD 1: SITE:     RADS       RAD 2: SITE:     RADS       RAD 3: SITE:     RADS                  5:         6:      

                        RAD 4: SITE:     RADS       RAD 5: SITE:     RADS       RAD 6: SITE:     RADS           HORM:  1:         2:      

                        COMMENT:                                                                                3:         4:                                                                               

          5:         6:      

                 FOLLOW-UP *

             SUBSEQUENT TX * SDSP          - 9 07/07/1993  N 9999     06 BYPASS PROCEDURE PERFORM, BX PRIM/OTHER                  I/O:UNKNOWN      

               08/02/1993  * FLWUP SOURCE  - READMISSION (I/O)               CONTACT MD: LastName, FirstName               

                            

          • CANCER STATUS - EVID THIS TUMOR                 DATA ENTRY: TG   06/14/1994

                            

          • RECUR DATE    - 07/07/1993                            TYPE: REGION REC, INSUFF INFO TO CODE 21-27                      

                            

          • DISTANT SITES - 1: NONE                                  2: NONE                       3: NONE                  

                            

          • COMMENTS      - METS=INT TRANS COLON SM BOW. CA19-9=10 136 CEA=31                            

                     

               08/15/1993  * FLWUP SOURCE  - OTHER (FRIEND/REL)              CONTACT MD: LastName, FirstName               

                            

          • CANCER STATUS - UNK WHETH THS TUMOR PRES        DATA ENTRY: TG   06/14/1994

                            

          • RECUR DATE    - N/A                                   TYPE: DISEASE FREE FOL TRT, NO RECURRENCE                        

                            

          • DISTANT SITES - 1: NONE                                  2: NONE                       3: NONE                  

                            

          • COMMENTS      - MCF                                                                               

          /font[/quote][/quote]Hi and welcome.

           

          Quite a challenge! I expect there could be many different approaches but this is rather a large record style to extract.

           

          Do you need it all?

           

          What will happen to the data once extracted? Is it going to be exported to some other application?

           

          Do you need to break it up into fields for analysis or would one (or several) large block text fields be OK?

           

          Which version of Monarch do you have access to?

           

          UBB "CODE" button seems to work on the format.

           

          Is there any other information that you think would be of use for someone developing a model for this data?

           

          Whatever the answers to the question I think you will be making use of the multi-line field facility (Field Advanced Properties - the Help explains it better than I can. Sometimes referred to as "multi-row".) at some point! It may be worth familiarising yourself with that whilst we consider the solution to your problem.

           

          Look forward to your responses.

           

          Grant

           

          [size="1"][ August 25, 2005, 12:39 PM: Message edited by: Gareth Horton ][/size]

          • trapping text problem from newbie
            Nupur _

            Hi and thanks a lot for the quick response!

             

            Yes, I need to capture all the information in the record. The information needs to be extracted so that it can be stored into a database (SQL Server) as cleanly as possible. Also, each field needs to be separated out. But at the same time, I should be able to put it back together from the database.

             

            I am using Monarch V8.0 and I really like it from the initial read-through of the Learning Guide. I have attempted to do the following:

             

            I have used the multi-line field facility to extract all the fields till the "DX Identification" Line, e.g. Accession, Med.RecNo, Name, Address (using Address block) etc.

             

            My major troubles right now are as follows:

             

            1. I created a template with all the fields until DX Identification as detail. Then I created new append templates for the other areas (e.g. Extent, Treatment 1 etc). But in the table view (which looked good so far), the first row contains NULLs in the newly added columns. So basically, the append template is not adding data to the first row. I'm not sure why this is.

             

            2. As I mentioned in my previous email, the sub-heading "Treatments" has a lot of variation among different records. E.g. it may not exist at all in some records, others may have the fields (Surgery, Lym Node Srg, Reg/Dist Srg) in any order, or it may have repeats (e.g.,

            Surgery

            Lym Node Srg

            Reg/Dist Srg

            Surgery

            Lym Node Srg

            Reg/Dist Srg

            Also, although these are the most common fields for the sub-heading Treatments, there are records where it contains additional fields such as SDSP, Hormone, HTE, formatted exactly as shown in the demo record.

             

            3. If you scroll all the way down, there is a sub-heading called Follow-Up.

            In the demo record I gave, it contains 2 instances of follow-up. However, there may be 0, 1 or n instances of follow-up. How do I adjust?

             

            If you prefer, we can take this discussion offline. My email address is [email="npande@asterand.com"]npande@asterand.com[/email]

             

            Thank you very much.

             

            Regards,

            Nupur

            • trapping text problem from newbie
              Grant Perkins

              Hi,

               

              You have taken on a big project here! Many possible approaches some of which may be found not to work as things progress.

               

              Let's keep this on line for now to encourage other inputs from forum members, though I have no problem in dealing with details off line.

               

              I need to think about this but here are some quick observations.

               

              Appends.

               

              Have to start ABOVE the the detail record. There are ways of 'cheating' that but since you have V8 you may not need to because you can simply make the appends FOOTER type instead. (Not possible to have more than 1 footer before V8.)

               

              BUT you may have a lot of appends - and there are limits we need to work within. So ...

               

              Maybe we need to look for a different level within the data for the detail record. The potential for more than one occurrence of the same treatment (for example - I think that is what we have but ifi it was not treatment it was something else) suggests that maybe the detail shout be at that line level with an append to identify which group the detail belongs to and a higher level append to ID the patient - if you follow what I am suggesting.

               

              If necessary we could split this entire analysis into separate processes for each subsection, export the results and then bring the whole thing back together again for a final output.

               

              And a last thought for this post - if you use the 'Preceding string' property for 'Start field on' on the advance properties you can identify a field that 'floats' with in the rows  - i.e. is not always going to be on the same row and may not always exist. HOWEVER this will only bring back the first occurence of the preceding string. So, of we need to deal with vertically 'floating' fields where there may be multiples of that field within a logical data block we need to be able to break the block into small enough chunks to ensure that each chunk has only one occurrence of the field.

               

              I suppose the bottom line is that it you want this to be something that goes nto a database we have to analyse the report as if it is a database (whch it is in a way) and work out its structure and relationships as they will need to be. Then make it so for the extract.

               

              I hope this makes sense. Please let me know if it doesn't.

               

              Grant

              • trapping text problem from newbie
                Nupur _

                Hi,

                 

                I follow your thoughts so far. I think breaking up the report would probably be a good idea. For example, the most complicated part (at least right now) is the Follow-up. So I can separate that out into a second file with some identifying information such as Accession No, Name and Primary Site (DX Identification: Site).

                 

                So for now, let's assume that the Follow-up part is in a separate file. We can then do 2 separate analyses : 1 for the follow-up part, and 1 for the rest and we can put the 2 analyses together based on the the identifying info.

                 

                So for the non-follow-up part, where do I start?

                 

                Your responses are greatly appreciated.

                 

                Regards,

                Nupur

                • trapping text problem from newbie
                  Bruce _

                  Nupur

                   

                  This looks like an ANSI type of report (I don't work in that feild, so I am not sure)

                   

                  Try a search on this web site under ANSI, som other members might have made models for this already.

                   

                  Other than that, you are are trying to tackle a huge report as a newbie.

                   

                  Good luck

                  • trapping text problem from newbie
                    Grant Perkins

                    Hi Nupur,

                     

                    Thinking about this  - I guess what you really need is a replica of the database(s) the report came from! I assume you don't have direct access to that?

                     

                    So we need to think in terms of lowest level and work upwards from there. The lowest level for your purposes may not be as low as the original database of course.

                     

                    Any section of the report - Treatments for example - which may have multiple occurrences and varied numbers of multiple occurrences suggests the need to treat each occurrence, at the lowest level you need, as a detail record. (If there is the possibility within a treatment of multiple occurrences of the same recorded discrete activity then the detail level would need to be at that point, unless recording those as a single block text entry, or in some other way, was acceptable. That then depends on the analysis you need to be able to do with the data ... and so it goes.)

                     

                    So in database terms we are probably looking at multiple extractions into separate tables for the purposes of the database structure rather than because it is the only way to do it using Monarch!

                     

                    If there are any sections or subsections where the information is required on file but a memo field text form would be acceptable, rather than separation into individual fields, then simply running a multi-line extraction should work fine. However care may be required to ensure that the start and end of the data block can be identified clearly and that the resulting field does not exceed the maximum field capacity of the target database. Or at least the ability of the database's import routine to deal with it.

                     

                    Within that overall concept I am guessing we will need to look at some of the more challenging ways of of using Monarch to get at awkward data.

                     

                    Multi-row data blocks  - I think you have already dealt with those in the patient header section. Name and address blocks are a special case of that type.

                     

                    Fields sometimes being present and sometimes not and/or moving vertically (but not horizontally) within the boundaries of the section of the record. These can often be dealt with successfully providing a preceding string is available to identify the existence or potential existence of the field. All being well the same technique can be used to deal with sections of a report which have variable numbers of lines associated with them.

                     

                    Lines or blocks where the information is located inconsistently within the line. A 'slice and dice' technique can often overcome such problem inputs.

                     

                    Problems with trapping specific data sections. This may also be aided by using slice and dice techniques and sometimes filters as well.

                     

                    There may be other issues that will arise as things develop.

                     

                    I think the key to this is to keep each stage as simple as possible and then consider the best way to group the sections of extraction into single processes if it seem appropriate to do so.

                     

                    If you are happy dealing with the Patient header details (sound like you are) and the Follow-Up section that's great. Let me know your thoughts on the structure and relationships for the other sections (I can guess but you are closer to the specific need so better placed to get it right!) and we can take it from there.

                     

                    The biggest problem I can see right now is deciding which are the best of many ways to approach this. Quite a nice problem really.

                     

                    Have fun,

                     

                     

                    Grant