Hi there!
I want to build an application that reads word files and generates xml files with data from the .doc file.
e.g.
.doc file:
Metallica (Ride the Lightning) - 1985, Metal
xml:
I assume i need COM but can't figure out advanced searching like "find all bold text that comes before a bold left parenthesis and ignore whitespace"... (..a (.. to seperate band from album). I think of COM in combination with regex but i don't know where to start from...Code:<cd>
<band>Metallica</band>
<album>Ride the Lightning</album>
<year>1985</year>
<genre>Metal</genre>
</cd>
Any thoughts/tuts?