• Hi All

    Please note that at the Chandoo.org Forums there is Zero Tolerance to Spam

    Post Spam and you Will Be Deleted as a User

    Hui...

  • When starting a new post, to receive a quicker and more targeted answer, Please include a sample file in the initial post.

vba script to extract data(examples like headings,titles ect) from pdf doc to word

aniket21

New Member
Hi,

I have attached my problem statement, please go though once and let me know how to solve it through vba.

Thanks
Aniket


0down vote
favorite
Below is the sample format given in the pdf :
2038 3 Main heading of Requirement (Section)
2039 3.1 Requirement 1 (Sub-Section title)
2040 Requirement description……………………………………………………
2041 3.2 Requirement 2 (Sub-Section)
2042 Requirement description……………………………………………………
2043 4 Function Requirements
2044 4.1 Requirement 3
2045 Requirement description……………………………………………………
2046 4.2 Requirement 4
2047 Requirement description……………………………………………………
2048 Requirement description……………………………………………………
2049 Requirement description……………………………………………………
2050 4.3 Requirement 5
2051 Requirement description……………………………………………………
For the above format I want to extract the information in word doc in the below specific manner:
SRS #: 2039
SRS Description: Requirement 1(Sub-section title)
2040 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
SRS #: 2041
SRS Description: Requirement 2(Sub-section title)
2042 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
Observations:
And the list goes on…………………………
Please let me know if need more info or it’s still unclear.
Thanks

0down vote
favorite
Below is the sample format given in the pdf :
2038 3 Main heading of Requirement (Section)
2039 3.1 Requirement 1 (Sub-Section title)
2040 Requirement description……………………………………………………
2041 3.2 Requirement 2 (Sub-Section)
2042 Requirement description……………………………………………………
2043 4 Function Requirements
2044 4.1 Requirement 3
2045 Requirement description……………………………………………………
2046 4.2 Requirement 4
2047 Requirement description……………………………………………………
2048 Requirement description……………………………………………………
2049 Requirement description……………………………………………………
2050 4.3 Requirement 5
2051 Requirement description……………………………………………………
For the above format I want to extract the information in word doc in the below specific manner:
SRS #: 2039
SRS Description: Requirement 1(Sub-section title)
2040 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
SRS #: 2041
SRS Description: Requirement 2(Sub-section title)
2042 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
Observations:
And the list goes on…………………………
Please let me know if need more info or it’s still unclear.
Thanks

0down vote
favorite
Below is the sample format given in the pdf :
2038 3 Main heading of Requirement (Section)
2039 3.1 Requirement 1 (Sub-Section title)
2040 Requirement description……………………………………………………
2041 3.2 Requirement 2 (Sub-Section)
2042 Requirement description……………………………………………………
2043 4 Function Requirements
2044 4.1 Requirement 3
2045 Requirement description……………………………………………………
2046 4.2 Requirement 4
2047 Requirement description……………………………………………………
2048 Requirement description……………………………………………………
2049 Requirement description……………………………………………………
2050 4.3 Requirement 5
2051 Requirement description……………………………………………………
For the above format I want to extract the information in word doc in the below specific manner:
SRS #: 2039
SRS Description: Requirement 1(Sub-section title)
2040 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
SRS #: 2041
SRS Description: Requirement 2(Sub-section title)
2042 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
Observations:
And the list goes on…………………………
Please let me know if need more info or it’s still unclear.
Thanks

0down vote
favorite
Below is the sample format given in the pdf :
2038 3 Main heading of Requirement (Section)
2039 3.1 Requirement 1 (Sub-Section title)
2040 Requirement description……………………………………………………
2041 3.2 Requirement 2 (Sub-Section)
2042 Requirement description……………………………………………………
2043 4 Function Requirements
2044 4.1 Requirement 3
2045 Requirement description……………………………………………………
2046 4.2 Requirement 4
2047 Requirement description……………………………………………………
2048 Requirement description……………………………………………………
2049 Requirement description……………………………………………………
2050 4.3 Requirement 5
2051 Requirement description……………………………………………………
For the above format I want to extract the information in word doc in the below specific manner:
SRS #: 2039
SRS Description: Requirement 1(Sub-section title)
2040 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
SRS #: 2041
SRS Description: Requirement 2(Sub-section title)
2042 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
Observations:
And the list goes on…………………………
Please let me know if need more info or it’s still unclear.
Thanks

0down vote
favorite
Below is the sample format given in the pdf :
2038 3 Main heading of Requirement (Section)
2039 3.1 Requirement 1 (Sub-Section title)
2040 Requirement description……………………………………………………
2041 3.2 Requirement 2 (Sub-Section)
2042 Requirement description……………………………………………………
2043 4 Function Requirements
2044 4.1 Requirement 3
2045 Requirement description……………………………………………………
2046 4.2 Requirement 4
2047 Requirement description……………………………………………………
2048 Requirement description……………………………………………………
2049 Requirement description……………………………………………………
2050 4.3 Requirement 5
2051 Requirement description……………………………………………………
For the above format I want to extract the information in word doc in the below specific manner:
SRS #: 2039
SRS Description: Requirement 1(Sub-section title)
2040 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
SRS #: 2041
SRS Description: Requirement 2(Sub-section title)
2042 Requirement description……………………………………………………
Input:
Expected Output:
Validation Steps:
Test Results:
Observations:
And the list goes on…………………………
Please let me know if need more info or it’s still unclear.
Thanks
 

Attachments

  • problem_statement.txt
    1.2 KB · Views: 3
Hi !

Welcome to Chandoo Excel website !

So as here it's an Excel site it's better you move to a Word forum website …
 
If you have Acrobat pro version, you can directly export pdf file
to an Excel file !

If not, it would be starting with a manual copy from pdf file
and paste to a brand new workbook that you will join here
with also an exact expected result worksheet
as well with a crystal clear explanation of each result line criteria …

By the way, you could proceed a web search
how to import a pdf file in VBA …
 
I don't have the acrobat pro version, but i will try to import from pdf into excel using vba and then extract the data based on the format and for that i will require help.
 
Back
Top