Software Learns to Tag Photos Charlottesville VA

Thousands of online images from Flickr have already been tagged accurately by a new software program.

Local Companies

MSD
1-434-297-0863
204 Ridge Street
Charlottesville, VA
Richmond Camera Shop Inc
(434) 295-3570
1214 E High St
Charlottesville, VA
Cary's Camera & Digital Imaging
(434) 293-4552
2023 Ivy Rd
Charlottesville, VA
Cvs Pharmacy
(540) 981-1366
Roanoke, VA
Ritz Camera
(540) 786-1833
1674 Carl D Silver Pkwy
Fredericksburg, VA
Penn Camera Exchange Inc
(703) 222-5690
12967 Fair Lakes Pkwy
Fairfax, VA
Canon Virginia
(804) 695-7000
6000 Industrial Dr
Gloucester, VA
Ace Photo Inc
(703) 430-3333
46950 Community Plz # 109
Herndon, VA
Penjacc
(276) 669-5181
1540 Euclid Ave
Bristol, VA
Ritz Camera
(703) 754-4994
7352 Atlas Walk Way
Gainesville, VA

Software Learns to Tag Photos

provided by: 


U.S. researchers have released a new online program for automatically tagging images according to their content. In its first real-world test, the program processed thousands of publicly accessible images available on the photo-sharing site Flickr. At least one accurate tag was generated for 98 percent of all the pictures analysed.

The new software, called ALIPR (Automatic Linguistic Indexing of Pictures), uses a combination of statistical techniques to process an image and assign it a batch of 15 words, arranged in order of perceived relevance. These words may refer to a specific object within the picture, such as a "person" or "car," or to a more general theme, such as "outdoors" or "manmade."

For humans, deciphering an image is deceptively simple. And yet for computers, which can sort through millions of text documents with blistering speed and accuracy, identifying the content of an image remains a devilishly difficult task.

"Recognizing what an image is about semantically is one of the most difficult problems in AI," says Jia Li, a mathematician at Pennsylvania State University, in State College, who created the software with colleague James Wang, a member of the College of Information Sciences and Technology. "Objects in the real world are 3-D," Li explains. "When showing up in an image, they can vary vastly in color, shape, gesture, size, and position, and a computer usually has no prior knowledge about the variations."

Because a complex understanding of the world remains beyond the ability of computers, more-efficient vision-processing algorithms are needed to help them mimic human vision and intelligence.

ALIPR analyses an image pixel by pixel and applies a novel statistical method to calculate the probability that a particular word may describe its content. This involves examining the distribution of color and texture within the image and comparing these features with a stored database of words and images. Li and Wang trained their program using a commercial database containing around 50,000 images that had already been tagged.

Recently, they tested ALIPR on 5,411 previously unseen images available on the popular picture-sharing site Flickr. For 51 percent of these images, the first word generated by ALIPR appeared in users' tags. The program also produced at least one accurate word 98 percent of the time. The researchers employed images made publicly accessible by Flickr users, which were also openly accessible through Flickr's own Application Programming Interface.

By James Lee

Read article at techreview.com

Featured Local Company

MSD

1-434-297-0863
204 Ridge Street
Charlottesville, VA
www.msdevel.com

Rate Article
     
Articles Insider

Rss   Delicious   Digg   Add To My Yahoo   Add To My Google   Bookmark   Search Plugin

Topics:
Advertising Engineering Home Services Retail & Consumer Services
Business Services Entertainment Industrial Goods & Services Software
Career Family Insurance Technology
Cars Financial Services Internet Telecommunications
Computer Hardware Food & Beverage Legal Transportation & Logistics
Construction Health Pets Travel
Education Home Electronics Real Estate Wedding