I need to pull a number of pieces of information that may or may not be present, some of from tables , from a web page.
This from a free publicly accessible chemical database that does not have have a way of getting data back in any way except the HTML to display the page... and I need to pull out information for a few thousand compounds ... so it needs to be done in code ... not manually using a browser and copy and paste...
But the string I get for the HTML for the page is such mess that there is no obvious way to pull out what I need... It would be a different story if the the text was line by line in the same order as on the displayed webpage ...
Does anyone have a suggestion of how I might approach this?
Is there a free command line tool I can call from an Xojo app to help with this?
Thanks,
Karen