+ Reply to Thread
Results 1 to 6 of 6
  1. #1
    Member
    Join Date
    May 2001
    Location
    Beautiful British Columbia
    Posts
    127

    How can I access the content within a browser window?

    I need some way to programmatically access and parse the text data that is downloaded into an instance of IE browser. I would prefer to use VB6 to do this.

    I'm betting the answer lies somewhere within the IE dom, which I'm no expert on.

    Even better yet, is there a way to access the entire html code (script code, tags, text and all) and then activate a filter to access just the text that is displayed to the user?

    Any ideas appreciated!

  2. #2
    I'll take two... CPU's BBA's Avatar
    Join Date
    May 1999
    Location
    Jacksonville Fl, USA
    Posts
    3,012
    What OS?

    You can look in temp internet files.

    To find that temp files folder, basically, just identify one item on the webpage...like a SWF for example, then go to a command prompt and type: "cd\" and then type "dir /s *.swf" that will find any swf file, in a hidden temp folder or not. You can then use explorer to open that folder by typing the full folder name in the address bar. Once the folder is accessed, all temp files will be there ( that means the http page and all elements of it will be in that folder )

    You can then save anything from the page to anywhere you want.
    WINDOWS 2000....Need I say more!


  3. #3
    Member
    Join Date
    May 2001
    Location
    Beautiful British Columbia
    Posts
    127
    Thanks BBA, I hadn't really considered accessing the webpage's data by opening it into a file, but there's got to be that IE can share its data with an outside com object. In the mean time, I'll do that - open saved htm file and parse it.

  4. #4
    Member
    Join Date
    Apr 2001
    Posts
    187
    html is basically a text file. You can read anything out of an html file, just like you would a text file.

  5. #5
    Member strangerstill's Avatar
    Join Date
    Sep 2001
    Location
    Oxford
    Posts
    203
    http://www.msdn.microsoft.com/worksh...tml/mshtml.asp
    This applies more to C/C++, but I imagine it'll be useful.

    Aha! found the references you need:
    • Microsoft Internet Controls (shdocvw.dll)
    • Microsoft HTML Internet Library (mshtml.tlb)

    There might be more, just take a look at those you have available.

  6. #6
    Member
    Join Date
    May 2001
    Location
    Beautiful British Columbia
    Posts
    127
    Well I sort of figured out a solution to my own problem. Here's a link to another forum where I explained what I did, just incase anyone cares.

    http://www.vbip.com/forum/topic.asp?id=107#7377

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts







New Security Features Planned for Firefox 4
Another Laptop Theft Exposes 21K Patients' Data
Oracle Hits to Road to Pitch Data Center Plans
Microsoft Preps Array of Windows Patches
Microsoft Nears IE9 Beta With Final Preview
Simplified Analytics Improve CRM, BI Tools
Android Passes RIM as Top Mobile OS in 2Q
VMware Updates Hyperic System Management
File Monitoring Key to Enterprise Security
LinkedIn Snaps Up SaaS Player mSpoke