Can anyone suggest a good method to parse html files?
I started out using just a linked list with each node containing a vector of strings that contain the html tag and the data inside that tag.
Can anyone recommend a better way of doing it?
example:
<html><head><title>hello</title></head>
My linked list would have something like the following:
NODE1 -> vectorofstring[0] = "html" vectofstring[1] = "<head><title>hello</title>"
NODE2 -> vectorofstring[0] = "head" vector[1] = "<title>hello</title>";
NODE3 -> vector[0] = "title" vector[1] = "hello"
etc..