Can anyone suggest a good method to parse html files?
I started out using just a linked list with each node containing a vector of strings that contain the html tag and the data inside that tag.
Can anyone recommend a better way of doing it?
My linked list would have something like the following:
NODE1 -> vectorofstring = "html" vectofstring = "<head><title>hello</title>"
NODE2 -> vectorofstring = "head" vector = "<title>hello</title>";
NODE3 -> vector = "title" vector = "hello"