- Themen
- Menschen
- Firmen
- Galerien
- Videos
- Events
- Benchmarks
- Bildung
- Webcasts
- Login
- Newsletter
- Stellenmarkt

Themen
Generative AI

ERP

Standardisierung

Twitter

Top-CIOs

Server

Virtual Reality

Identity Management

Social Media

Big Data
Innovation

TCO

Artificial Intelligence

Künstliche Intelligenz

Healthcare IT

Oracle

Apple

T-Systems

Virtualisierung

Chatbot
SaaS

Security

ECM

Performance Management

Augmented Reality

Stress

Netzwerke

DSGVO

Sharepoint

IT-Budget
Freiberufler

Rechenzentrum

Schatten-IT

Predictive Analytics

Storage

Digitalisierung

Connected Car

Smartphones

Public IT

MBA
Mehr Themen
Menschen
Uwe Siller

Tim Fäsecke

Markus Schaal

Marianne Schröder

Günter Weinrauch

David Thornewill von Essen

Till Rausch

Christoph Göbel

Hans-Joachim Popp

Michael Jud
Michael Ostendorf

Andreas Igler

Joachim Badde

Heiko Fischer

Manfred Klunk

Dirk Kastner

Hans Rösch

Bernd Sengpiehl

Holger Mahnken

Andreas Strausfeld
Daniel Keller

Patrick Naef

Daniel Germani

Markus Müller

Dirk Altgassen

Francisco Rodriguez

Peter Ringbeck

Werner Schwarz

Michael Rödel

Roland Krieg
Markus Fichtinger

Reinhard Eschbach

Matthias Meyer-Pundsack

Gerdi Jansen

Detlef Schuck

Juergen Bechtel

Michael Nilles

Carsten Trapp

Klaus Glatz

Sven Lorenz
Mehr Menschen
Firmen
Hamburg Südamerikanische Dampfschifffahrts-Gesellschaft KG

SMS Group

Roche Pharma AG

Talanx AG

Lausitz Energie Kraftwerke AG

Rohde & Schwarz GmbH & Co. KG

SAP SE

Brenntag SE

Melitta Unternehmensgruppe

POCO Einrichtungsmärkte GmbH
Der Kreis Systemverbund Holding GmbH & Co. KG

Salzgitter Flachstahl GmbH

Arrow Central Europe GmbH

Adecco Germany Holding SA & Co. KG

E.ON SE

Robert Bosch GmbH

Siltronic AG

GE Deutschland Holding GmbH

Kautex Textron GmbH & Co. KG

Deutsche Leasing AG
Hubert Burda Media Holding KG

Fresenius Kabi AG

Einrichtungspartnerring VME GmbH & Co. KG

swb AG

Grammer AG

Vitesco Technologies Group AG

EnBW Energie Baden-Württemberg AG

Kion Group AG

Envia Mitteldeutsche Energie AG

Hoyer Asset Management
Heristo AG

Raiffeisen Waren GmbH

Nordzucker AG

ArcelorMittal Bremen GmbH

Deutsche Bank AG

Mainova AG

Landesbank Berlin Holding AG

Fujitsu Technology Solutions GmbH

Traton SE

Diebold Nixdorf AG
Mehr Firmen
Nachrichten
Michael Rotert: Empfänger erster E-Mail mag handgeschriebene Karten

Desasterbahn: Warum die Deutsche Bahn wieder einen Milliardenverlust macht

Neue Suchmaschine im Aufbau: Nach ChatGPT kommt SearchGPT

Anklage erhoben: International agierende Hackergruppe aus Nordkorea enttarnt

Nebenkostenprivileg: Vodafone laufen die Fernsehkunden davon

Verwaltungsdienstleistungen: Baden-Württemberg setzt KI-System von Aleph Alpha ein

Facebook und Instagram: EU-Verbraucherschützer prüfen Meta-Bezahlmodell

Telekom-Tochter: T-Mobile US will Glasfaseranbieter Metronet kaufen

Automobilbranche: Warum Ford viel Geld mit Elektroautos verliert

Aktie legt weiter zu: KI hilft IBM auf die Beine
Mehr Nachrichten
Galerien
Mehr Galerien
Heftarchiv
Ausgabe 09-10/2023
Technik und HR im Einklang
Ausgabe 07-08/2023
Abschied vom Großrechner
Ausgabe 05-06/2023
Rattey entschlackt die IT
Ausgabe 03-04/2023
Die neue CIO-Agenda
Zum Heftarchiv
Videos
Mehr Videos
Webcasts
Mehr Webcasts
Experten
Dirk Pfefferle

Christopher Walg

Daniel Taradzic

Karim el Abiary

Pascal Cronauer

Olivier Goethals

Ulf Schade

Wiebke Apitzsch

Peter Kalvelage

Uwe Stelzig
Mirriam Prieß

Tanja Hilpert

Hans-Peter Bauer

Lynn-Kristin Thorenz

Christian Walch

Arne Schultz

Mark Borgmann

Oliver Bossert

Rolf Froböse

Sebastian Olbert
Dennis-Kenji Kipker

Andreas Wartenberg

Sabine Thiemann

Silke Hüller

Michael Welz

Annika Haucke

Sebastian von Rabenau

Francesco Gerweck

Stefan Engelhardt

Mario Zillmann
Felix Fischer

Jörg G. Beyer

Mario Neumann

Peter Lahmann

Stefan Benett

Thomas Klauß

Bogdan Botezatu

Alex Bierhaus

Sven Petermann

Hanns Köhler-Krüner
Experte werden
Hot Topics

- Anzeige -

Google improves real-time visual translation app with neural net

30.07.2015

The app enables users to point their camera an object that contains words so they can translate things like menus and signs. The search giant also added 20 languages to its app.

"We want to be able to recognise a letter with a small amount of rotation, but not too much. If we overdo the rotation, the neural network will use too much of its information density on unimportant things. So we put effort into making tools that would give us a fast iteration time and good visualisations," Otavio Good, software engineer for Google Translate, wrote in a blog post.

"Inside of a few minutes, we can change the algorithms for generating training data, generate it, retrain, and visualise.

"To achieve real-time, we also heavily optimized and hand-tuned the math operations. That meant using the mobile processor'sSIMDinstructions and tuning things like matrix multiplies to fit processing into all levels of cache memory."

The app filters out background objects when reading letters in images, such as people, trees, cars, and so on. By looking at "blobs of pixels" with similar colour and are in near proximity to each other, the app recognises it as a continuous line of text to read.

The app has been trained using a convolutional neural network to learn what different letters in languages look like and differentiate letters from non-letters.

A letter generator was also built to create noise around the letters or characters being translated such as smudges and rotation so that the app does not need to always have clear, well-presented text in order to work.

Read more:Study uncovers unsupervised learning framework for image sentiment analysis

The app uses dictionary lookups for the different languages once the letters are recognised, with it still being able to recognise words from group of letters if it accidentally reads one letter as a number. For example, if it reads 'S' and '5' by mistake, it will still be able to recognise the word from following letters 'super'.

The translation is then rendered on top of the original words.

"We can do this because we've already found and read the letters in the image, so we know exactly where they are. We can look at the colours surrounding the letters and use that to erase the original letters. And then we can draw the translation on top using the original foreground colour."

Read more:Google tests $2 custom Gmail addresses

Read more:Intelligent machines part 3: Big data, machine learning -- where's it all heading

(www.cio.com.au)

Rebecca Merrett