Thread: How to convert PPt to text under linux platform?

  1. #1
    Registered User
    Join Date
    Aug 2007
    Posts
    2

    How to convert PPt to text under linux platform?

    I have trouble this problem for days.
    I need a tool to extact text from ppt ,especially UTF-8 support.
    if someone has some ideas,please contact me <<snipped>>
    Last edited by Salem; 08-14-2007 at 03:46 AM. Reason: snip email address

  2. #2
    Woof, woof! zacs7's Avatar
    Join Date
    Mar 2007
    Location
    Australia
    Posts
    3,459
    Perhaps this should have gone in the "freelance" section? Under jobs and recruitment?

    Steps to success:
    * Read the powerpoint file (research the ppt format), OpenOffice could greatly help here
    * Work out a way to extract text
    * write it to a text file.

    dance.

  3. #3
    Registered User
    Join Date
    Aug 2007
    Posts
    2
    Does anyone have the code for read ppt format?
    I need it the day after tomorrow., but my programming war so poor.
    so I was very trouble it...

  4. #4
    Deathray Engineer MacGyver's Avatar
    Join Date
    Mar 2007
    Posts
    3,210
    Think zacs7 mentioned a program you can study that does that. Try google.

  5. #5
    Woof, woof! zacs7's Avatar
    Join Date
    Mar 2007
    Location
    Australia
    Posts
    3,459
    Before asking for more "help", consider reading
    * http://cboard.cprogramming.com/annou...t.php?f=4&a=39 (homework)
    * http://cboard.cprogramming.com/annou...t.php?f=4&a=51 (forum guidelines)

  6. #6
    Kernel hacker
    Join Date
    Jul 2007
    Location
    Farncombe, Surrey, England
    Posts
    15,677
    Unfortunately, if you insist on reading microsoft prepared files under Linux, OpenOffice is probably the only solution without huge efforts. Of course, you could try to read the file and sort out what's "printable" and send that to a file - but it's very likely that you will get some pretty messy stuff (and it's no guarantee that the text-content is in order in the ppt file).

    Of course you could try something like "emacs somename.ppt" and see what it looks like - you should be able to see most of the text (beware that it's probably in unicode, so "text" will look like
    Code:
    \0t\0e\0x\0t
    or some such.

    --
    Mats

  7. #7
    Frequently Quite Prolix dwks's Avatar
    Join Date
    Apr 2005
    Location
    Canada
    Posts
    8,057
    How strange . . . search for threads started by czyshoul and you get this thread. Search for posts by czyshoul and you get another thread which czyshoul started, identical to this one, which doesn't appear in his/her "threads started by" list.

    [edit] And this post appears in both of them, even though I only posted once. [/edit]

    [edit=2] I bet my post count went up 2 when I posted this. [/edit]
    dwk

    Seek and ye shall find. quaere et invenies.

    "Simplicity does not precede complexity, but follows it." -- Alan Perlis
    "Testing can only prove the presence of bugs, not their absence." -- Edsger Dijkstra
    "The only real mistake is the one from which we learn nothing." -- John Powell


    Other boards: DaniWeb, TPS
    Unofficial Wiki FAQ: cpwiki.sf.net

    My website: http://dwks.theprogrammingsite.com/
    Projects: codeform, xuni, atlantis, nort, etc.

  8. #8
    Registered User
    Join Date
    Oct 2001
    Posts
    2,129
    it's the same thread, two posts in the same thread.

  9. #9
    Frequently Quite Prolix dwks's Avatar
    Join Date
    Apr 2005
    Location
    Canada
    Posts
    8,057
    You're right. Oops.

    Oh well, it's still weird.
    dwk

    Seek and ye shall find. quaere et invenies.

    "Simplicity does not precede complexity, but follows it." -- Alan Perlis
    "Testing can only prove the presence of bugs, not their absence." -- Edsger Dijkstra
    "The only real mistake is the one from which we learn nothing." -- John Powell


    Other boards: DaniWeb, TPS
    Unofficial Wiki FAQ: cpwiki.sf.net

    My website: http://dwks.theprogrammingsite.com/
    Projects: codeform, xuni, atlantis, nort, etc.

  10. #10
    Woof, woof! zacs7's Avatar
    Join Date
    Mar 2007
    Location
    Australia
    Posts
    3,459
    Quote Originally Posted by matsp
    Of course you could try something like "emacs somename.ppt" and see what it looks like - you should be able to see most of the text (beware that it's probably in unicode, so "text" will look like
    Since MS Word documents are compressed (zip), I'd presume it'd be the same story with powerpoint. But still, there are serveral versions of the ppt format, it's going to be hard either way

Popular pages Recent additions subscribe to a feed

Similar Threads

  1. struct question
    By caduardo21 in forum Windows Programming
    Replies: 5
    Last Post: 01-31-2005, 04:49 PM
  2. Convert a text file to a binary file
    By Cyber Kitten in forum C Programming
    Replies: 16
    Last Post: 02-04-2002, 08:53 AM
  3. Regarding Job Possibilities as a C/C++ programmer on a Linux Platform
    By pritesh in forum A Brief History of Cprogramming.com
    Replies: 3
    Last Post: 01-22-2002, 09:26 AM
  4. Replies: 2
    Last Post: 01-18-2002, 10:03 PM
  5. Validating the contents of a char buffer
    By mattz in forum C Programming
    Replies: 3
    Last Post: 12-09-2001, 06:21 PM