How to recognize an image on the screen

**lala123** · 02-12-2010

Hi guys, I would like you to point me to the direction on which api's, functions, etc should I look for in order to recognize an image on the screen and retrieve its coordinates.

I hope that was clear enough hehe.

thanks in advance!

**sean** · 02-12-2010

Are you talking about having a program know what's on the screen, looking for a specific image, and know what it's location is on the screen? I've never done that before, but you probably want to first figure out how to get an image of the screen, then how to process that image. If a graphics library like SDL doesn't do screen capture (and I don't think it does) it'll be dependent on your OS/environment - but the technical term is screen capture - so try searching with that. What are you trying to accomplish, however? If you're wanting to know where on the screen part of your program is, there are much easier ways to pass info between processes.

**jeffcobb** · 02-12-2010

Originally Posted by lala123

Hi guys, I would like you to point me to the direction on which api's, functions, etc should I look for in order to recognize an image on the screen and retrieve its coordinates.

I hope that was clear enough hehe.

thanks in advance!

Well your question was vague enough to warrant such an answer. One way of accomplishing what you want is called the Kohonnen Self-organizing Map:
Self-organizing map - Wikipedia, the free encyclopedia

If you want more specifics, be a little clearer in your message. This will help do image-recognition.

**lala123** · 02-12-2010

Ok I will be more specific: I plan on doing an auto-clicker program. So there will be parts of the screen I need to click on. I must find where these images are and then click on their position.

I'll check your link now jeff

**sean** · 02-12-2010

Say, this wouldn't be about cheating in a game or writing a bot, now would it?

**lala123** · 02-12-2010

Well I read the forum guidelines and I don't beleive my intentions break any of the rules, but if you do think so...

It's about preventing me from getting a Carpal tunnel syndrome due to excessive clicking hhehehe

**lala123** · 02-13-2010

hmmm... anyone?

**MK27** · 02-13-2010

hmmm...WRT to API's and such this will be a platform specific project. You haven't mentioned your OS.

**lala123** · 02-13-2010

Os is windows

**MK27** · 02-13-2010

Okay, and by "image" you mean exactly what? Give us a very very specific, actual for real example here.

Do you mean like "icon on my desktop", or maybe "picture in my web browser" or maybe "an object in a larger image", such as an enemy vehicle in Far Cry?

**lala123** · 02-13-2010

Well imagine there's a little car in a flash game that has many other objects and i need to click on that car. So I would have to recognize where is the car and know it's position and then click on that position.

**jeffcobb** · 02-13-2010

Well then it would seem like the following would work:
1. Offline, train the net (neural or other) to recognize the car. Then with good training data...
2. Programmatically isolate the window with the image
3. Based on image size, iterate through all possible positions in the window where the image might be.
4. If found, send the click event to the proper X/Y coords.

6. Profit!

**jeffcobb** · 02-13-2010

Now if you are using this to cheat a fast-moving game this approach will NOT be sufficient. There is a better way to beat that but I think it would lie outside the purview of this forum...

**MK27** · 02-13-2010

I think that approach will be sufficient to keep most people busy for a few weeks anyway.

**lala123** · 02-13-2010

hum.... so maybe getting a print from the screen, then try placing the image I'm looking for against every "block" of pixels to see if they match?

Is that what you're saying?

Thread: How to recognize an image on the screen

Thread Tools

Search Thread

Display

How to recognize an image on the screen

Similar Threads

Problem reading tiff image files?

Render text

Feedback: Functional Specification Wording

char copy

i am not able to figure ot the starting point of this