HP Labs India

Smashing India's language barriers

ZDNet UK News
May 12, 2006

India is a unique country with unique communication problems – no surprise in a country with more than a thousand languages

There's nowhere remotely like India: a billion people, 1,652 languages, three-quarters of the workforce in agriculture but immense numbers of literate, technical graduates pouring out of universities — yet it lags behind its major competitor, China, due to poor infrastructure and incomprehensibly complex bureaucracy.


According to a team of HP engineers from Bangalore, over to meet UK journalists in Bristol as part of the celebrations around HP Labs' 40th birthday, technology can help, but only if everyone can interact with it. Only around 10 percent of Indians can conduct a transaction in English, which has limited the penetration of PCs to around 60 million. There's no further chance of growth unless that particular problem is solved, the Bangalore team claims.


Voice recognition and continuous handwriting recognition have both been tried, but neither is reliable enough for daily use in the community. Instead, HP India has developed the Gesture Keyboard, or GKB, a digitiser pad and stylus combo that works with Indic writing, one of the major families of local language. This includes Urdu, Gujarati and Hindi — there are 400 million Hindi speakers alone, an enormous group to isolate from IT or, if you prefer, an enormous new market more numerous than the European Union.


The key characteristic of this way of writing is that it has a number of base characters in an alphabet, but each may have one of a large set of vowel modifiers — matras — added. For example, Hindi has 36 consonants that can be modified by any of 12 matras, and almost every consonant can be bound to another. That leads to around 1,500 symbols. Trying to input these on an ordinary QWERTY-style keyboard results in huge numbers of keystrokes. Even expert typists find it difficult to master: someone with no exposure to technology can find it alien indeed.


The GKB reflects the way the script actually works. You take the stylus and tap one of the base characters from the set printed onto the tablet, and that enters the character. Or, draw the matra over the character on the tablet: the underlying gesture recognition software in the product then combines the modifier with the base character to produce the right result. HP says that because this is very similar to the way writing is taught in schools, it takes around twenty minutes to pick up and can soon produce around twenty words a minute.


Although the device is relatively expensive — it costs around 40 euros, in a country where a PC can be bought for 200 — HP says it can easily double the income of a cybercafe or kiosk, one of the burgeoning one-man band computer booths that give non-computer owners access to the Internet, government services and so on. It might also be built into keyboards, either by replacing the numeric keypad or by sliding out from beneath.


The GKB is currently available in Hindi and Kannada, the language of the province of HP Labs India, with other versions such as Persian and Arabic in the labs. It takes about a month to add a new language, providing it follows the same basic rules; the project itself took a year to produce, building on four years of work in general purpose handwriting recognition.


PrintCast is another example of technology being adapted to specific needs. This is a way of embedding documents and other data types in video broadcasts: in principle, nothing too different to the way Teletext and other systems have been sending information in TV signals since the 1970s, although PrintCast also understands digital transmissions. But by focusing on printable documents, PrintCast can massively improve retention of the information in the broadcast — very important in areas where access otherwise is limited. There are already plenty of satellite and terrestrial broadcast systems in India — the inventiveness comes in making them double as carriers of printed matter without modification. The new information is mixed in with the programme before transmission, and a separate decoder monitors the video signal at the receiver and copies the documents into a buffer, from which it can be printed.


The system is designed to be used in public forums: people come in to a hall to watch the broadcast, and at each point where a document can be printed, the person running the equipment can run off as many copies as needed, usually for a nominal charge. The advantages for health, agriculture and other local government information transmissions are huge, not least because in many areas other printed material distributed from central government is held up or not distributed at all — documents that tell people their rights are not always popular. This bypasses huge swathes of bureaucracy.


Neither of these two inventions use breakthrough technologies — in principle, either could have been built ten or fifteen years ago with small teams of engineers working with modest resources. Indeed, at the HP Labs Bristol demonstration, a number of journalists were disparaging about the systems — "So, you've plugged a printer into a television. So what?" But like all inventions, no matter how obvious, it took someone who knew of a particular need to make them happen — and their importance has little to do with the mechanics behind them.


The needs of the invisible world, the three billion outside the rich countries, are only just becoming known — as evidenced by HP Labs, which took 36 years to set up an Indian operation, and still got there ahead of most. With that, the company is betting that by meeting the country's needs the benefits will flow both ways. It's not just a matter of opening up another market, albeit one as big as any we've known before. By connecting these people, their ideas, capabilities and potential can flow back, bringing us new information and perspectives that can only help the innovation on which the industry depends.


Click here to read the article at the ZDNet Website

This page was last updated on March 2, 2010