I have trouble this problem for days.
I need a tool to extact text from ppt ,especially UTF-8 support.
if someone has some ideas,please contact me <<snipped>>
I have trouble this problem for days.
I need a tool to extact text from ppt ,especially UTF-8 support.
if someone has some ideas,please contact me <<snipped>>
Last edited by Salem; 08-14-2007 at 03:46 AM. Reason: snip email address
Perhaps this should have gone in the "freelance" section? Under jobs and recruitment?
Steps to success:
* Read the powerpoint file (research the ppt format), OpenOffice could greatly help here
* Work out a way to extract text
* write it to a text file.
dance.
Does anyone have the code for read ppt format?
I need it the day after tomorrow., but my programming war so poor.
so I was very trouble it...
Think zacs7 mentioned a program you can study that does that. Try google.
Before asking for more "help", consider reading
* http://cboard.cprogramming.com/annou...t.php?f=4&a=39 (homework)
* http://cboard.cprogramming.com/annou...t.php?f=4&a=51 (forum guidelines)
Unfortunately, if you insist on reading microsoft prepared files under Linux, OpenOffice is probably the only solution without huge efforts. Of course, you could try to read the file and sort out what's "printable" and send that to a file - but it's very likely that you will get some pretty messy stuff (and it's no guarantee that the text-content is in order in the ppt file).
Of course you could try something like "emacs somename.ppt" and see what it looks like - you should be able to see most of the text (beware that it's probably in unicode, so "text" will look likeor some such.Code:\0t\0e\0x\0t
--
Mats
How strange . . . search for threads started by czyshoul and you get this thread. Search for posts by czyshoul and you get another thread which czyshoul started, identical to this one, which doesn't appear in his/her "threads started by" list.
[edit] And this post appears in both of them, even though I only posted once. [/edit]
[edit=2] I bet my post count went up 2 when I posted this. [/edit]
dwk
Seek and ye shall find. quaere et invenies.
"Simplicity does not precede complexity, but follows it." -- Alan Perlis
"Testing can only prove the presence of bugs, not their absence." -- Edsger Dijkstra
"The only real mistake is the one from which we learn nothing." -- John Powell
Other boards: DaniWeb, TPS
Unofficial Wiki FAQ: cpwiki.sf.net
My website: http://dwks.theprogrammingsite.com/
Projects: codeform, xuni, atlantis, nort, etc.
it's the same thread, two posts in the same thread.
You're right. Oops.
Oh well, it's still weird.
dwk
Seek and ye shall find. quaere et invenies.
"Simplicity does not precede complexity, but follows it." -- Alan Perlis
"Testing can only prove the presence of bugs, not their absence." -- Edsger Dijkstra
"The only real mistake is the one from which we learn nothing." -- John Powell
Other boards: DaniWeb, TPS
Unofficial Wiki FAQ: cpwiki.sf.net
My website: http://dwks.theprogrammingsite.com/
Projects: codeform, xuni, atlantis, nort, etc.
Since MS Word documents are compressed (zip), I'd presume it'd be the same story with powerpoint. But still, there are serveral versions of the ppt format, it's going to be hard either wayOriginally Posted by matsp