Can a chip help computers see in 3D?

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS
A Silicon Valley start-up believes it can improve computer vision by combining a custom-designed chip with the way humans see. Human brains judge how far away objects are by comparing the slightly different view each eye sees. Tyzx hopes to build this stereo vision process into video cameras. The Palo Alto, California-based start-up has encoded a processing scheme into a custom chip called DeepSea, allowing the processor to determine not only the color of each tiny patch of an image but also how far away that patch is from the camera. The technology could be a boon for surveillance systems, strengthening the ability to track people in banks, stores or airports. But stereo vision could have wider uses as well, helping focus a computer's attention and cutting down on the amount of data that needs to be crunched. For instance, a vacuuming robot trying to discern a table leg through pattern recognition could avoid getting caught up in examining the wallpaper in the background. Similarly, vehicles could use the technology to detect obstacles in their path while filtering out visual noise. "The biggest value is the segmentation. It separates out the portion of the image that interests you," said Takeo Kanade, a stereo vision computing pioneer at Carnegie Mellon University and a member of an independent Tyzx advisory board. "You have not only appearance but also distance to each point. That makes the subsequent processing, such as object detection and recognition, significantly easier." Tyzx's first customers are mostly research labs, with other potential business partners evaluating the technology, chief executive Ron Buck said in an interview. Those who have bought the systems include MD Robotics, the company that makes the robotic arm for the Space Shuttle and, in the future, for the International Space Station. And ChevronTexaco is employing the equipment for "augmented reality" work -- supplementing what ordinary people see with computer imagery for tasks such as operating oil platform cranes in bad weather. The company hopes to win customers in the military and surveillance industries, and, as costs go down, to expand into broader "intelligent environments" where, for example, doors could open automatically or a house could send a medical alert if someone has been sitting still for an unusually long time. But Tyzx faces a solid challenge translating the idea into a workable product. "I believe it's a great idea," Kanade said. "Conceptually it's easy, but computationally it's not." Tyzx is backed by Vulcan Ventures, the investment firm of Microsoft co-founder Paul Allen. It has less than 20 employees, some of whom have years of experience in the field. John Woodfill and Gaile Gordon launched the company in early 2001, but much of their work precedes that date. A key formula used in the custom chip dates back to 1990, and Tyzx has had prototype chips for about a year, Buck said. It's only recently, though, that Tyzx's ideas have become economically feasible. Eyes on the prize
Stereo vision may indeed be a leap ahead for computers, but there's still a long way to go before machines can achieve the sophistication of human sight. "Because vision comes so naturally to us, we don't appreciate the problem intuitively," said David Touretzky, a computational neuroscientist at Carnegie Mellon. "I don't think we got that appreciation until people started trying to build computer systems to see." A large fraction of the brains of primates such as monkeys, apes and humans is devoted to processing visual information, Touretzky said. There are more than 20 different specialised areas for tasks such as recognizing motion, color, shapes and spatial relationships between objects. "These areas are all interconnected in ways not fully understood yet," Touretzky said, but together these parts of the brain can discern the difference between the edge of a shadow and the edge of an object or compensate for color shifts that occur when the sun comes out. Tyzx isn't the only company trying to capitalize on stereo computer vision. Microsoft Research is working on technology that extracts 3D information from 2D pictures. Point Grey Research already has cameras on the market, though its processing algorithms require a full-fledged computer. In Japan, a company called ViewPlus is working in collaboration with Point Grey Research. Its products, though, combine as many as 60 cameras into a spherical system that produces 20 simultaneous video information streams. These other companies are taking a fundamentally different approach to Tyzx in one respect: Their systems compare more than two images. Carnegie Mellon's Kanade said it might seem that comparing three images would be a harder computational task, but in fact having more data to work with can actually make the process simpler. DeepSea processing
The key development at Tyzx is its custom chip, which runs an algorithm called census correspondence that quickly finds similarities across two streams of video images broken up into a square grid of 512 pixels, or picture elements. The chip can perform this comparison 125 times per second with a video image measuring 512 by 512 pixels, but the 33MHz DeepSea consumes much less power than full-fledged processors such as Intel's Pentium. "It allows incredibly compute-intensive searching for matching pixels to happen very fast at a very low price. It allows us to bring stereo vision to computers," chief executive Buck said. Another important development needed to reach Tyzx's low-price targets is camera sensors built using the comparatively inexpensive complimentary metal-oxide semiconductor (CMOS) technology -- the same process used to build most computer chips, Buck said. Digital cameras today use more elaborate -- but more expensive -- "charge-coupled devices", or CCDs. Kanade has an appreciation for the difficulties involved. About 10 years ago he built an expensive but pioneering stereo vision system with many processors that could determine range information by comparing the images from multiple cameras. Since then, more powerful computer processing abilities have elevated the potential of the field, which Kanade believes will take off once stereo cameras are as cheap as today's ordinary video cameras. "I'm very impressed with the various attempts which made real-time stereo possible. I think the Tyzx effort may be one of the eventual successes," Kanade said.
See the Hardware News Section for the latest update on everything from MP3 players and PDAs to supercomputing. Have your say instantly, and see what others have said. Go to the ZDNet news forum. Let the editors know what you think in the Mailroom.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

GHar123

I totally dislike pirating of works, I fear that artists will be deterred from creating works if they think that they are going to get ripped off....

1 hour ago by GHar123 on ACTA stumbles in Germany
JCB33

How dare film makers, artists or anybody that invests in creativity stop us pirating their works for free. I want to be able to walk into my local...

7 hours ago by JCB33 on ACTA stumbles in Germany
Moley

@GrueMaster. I prefer horses for courses rather than one size fits all. I, and I suspect most other computer users, do not really wish to have...

9 hours ago by Moley on A tale of two distros: Ubuntu and Linux Mint
greycynic

The product that scares me every time I have to use it is the Office 2007 version of Excel. The first bug that I found was applying the median...

9 hours ago by greycynic on Ten flawed products that derail productivity
GrueMaster

Nice review and very informative. One thing I'd like to add (in reply to whs001's 1st question), the main reason to have the same interface from...

11 hours ago by GrueMaster on A tale of two distros: Ubuntu and Linux Mint
Frederick Wrigley

I'be been using Mint 12 since the RC came out, and I am far more happy with the Cinnamon, the Mate, and, yes (with extensions), theGnome 3...

11 hours ago by Frederick Wrigley via Facebook on A tale of two distros: Ubuntu and Linux Mint
bdantas

Excellent article. One small correction, though--although a fresh installation of Linux Mint 12 will, indeed, provide the user with a version of...

12 hours ago by bdantas on A tale of two distros: Ubuntu and Linux Mint
Alan Ralph

In related news, the ISPs club together to get the members of the Home Affairs Select Committee (ya goofed on that part, ZDNet UK) copies of "The...

13 hours ago by Alan Ralph via Facebook on MPs urge ISPs to take down terrorist material
Alan Ralph

In related news, the ISPs club together to get the members of the Home Affairs Select Committee (ya goofed on that part, ZDNet UK) copies of "The...

13 hours ago by Alan Ralph via Facebook on MPs urge ISPs to take down terrorist material
Moley

For Gnome 2 die-hards, it is possible to add icons to the bottom panel (or top top panel, if you prefer) which provide the exact Gnome 2...

13 hours ago by Moley on A tale of two distros: Ubuntu and Linux Mint
ramwellian

Your comments would seem pretty naive and immature. Your 'solution' appears to be, "gee, let's all just give in to the hackers and give them...

14 hours ago by ramwellian on Cloud computing security: no more oxymoron?
BugStalker

"Interesting thought ... If you installed Win7 as a dual boot on a machine that previously only had Linux, and it wrecked your Linux installation,...

14 hours ago by BugStalker on Windows 7 Declares War on GRUB
whs001

This is an excellent summary of Ubuntu and Mint and the interface differences between them. Most such articles take a very partisan position for...

14 hours ago by whs001 on A tale of two distros: Ubuntu and Linux Mint
Moley

@ewallace. Not so clear. Anyone can obtain the text, for example from here http://www.ustr.gov/webfm_send/2379. I support ACTA so long as it and...

14 hours ago by Moley on ACTA: Facts, misconceptions and questions
45283

I think WinRT is fantastic. I just wish it was an option for people that didn't want to go through Microsoft's App Store with its attendant...

17 hours ago by 45283 on Why Windows 8 needs architectural hygiene for WOA
Burn-IT

Nine people? £30m? Who's back pocket is that lot going in? And IF they say it is for new buildings, what about all the ones the government has...

18 hours ago by Burn-IT on Police set to launch three £30m e-crime hubs
ewallace

Just to be clear, nobody knows what is in the text of ACTA, here is a photograph of the text of ACTA http://twitpic.com/8h9iju as submitted to the...

19 hours ago by ewallace on ACTA: Facts, misconceptions and questions
fgvrg56

Unfortunately main issue is that ASUS is refusing to accept that they make some mistake on this version of asus Transformer prime. 1 - GPS sensor...

20 hours ago by fgvrg56 on Asus Eee Pad Transformer Prime Wi-Fi & GPS problems?
Ben Woods

@Marcus A fair question. Just talked with Archos which said it was working on an announcement for next week....

21 hours ago by Ben Woods on Archos confirms G9 Ice Cream Sandwich update schedule
Marcus Karlsson

Any update on this, considering the claimed "first week of February"?

22 hours ago by Marcus Karlsson via Facebook on Archos confirms G9 Ice Cream Sandwich update schedule