Alayon - Tecnologia da Informação

sábado, 2 de dezembro de 2017

Artificial intelligence goes bilingual—without a dictionary

By Matthew HutsonNov. 28, 2017 , 4:30 PM

Automatic language translation has come a long way, thanks to neural networks—computer algorithms that take inspiration from the human brain. But training such networks requires an enormous amount of data: millions of sentence-by-sentence translations to demonstrate how a human would do it. Now, two new papers show that neural networks can learn to translate with no parallel texts—a surprising advance that could make documents in many languages more accessible.

“Imagine that you give one person lots of Chinese books and lots of Arabic books—none of them overlapping—and the person has to learn to translate Chinese to Arabic. That seems impossible, right?” says the first author of one study, Mikel Artetxe, a computer scientist at the University of the Basque Country (UPV) in San Sebastiàn, Spain. “But we show that a computer can do that.”

Most machine learning—in which neural networks and other computer algorithms learn from experience—is “supervised.” A computer makes a guess, receives the right answer, and adjusts its process accordingly. That works well when teaching a computer to translate between, say, English and French, because many documents exist in both languages. It doesn’t work so well for rare languages, or for popular ones without many parallel texts.

The two new papers, both of which have been submitted to next year’s International Conference on Learning Representations but have not been peer reviewed, focus on another method: unsupervised machine learning. To start, each constructs bilingual dictionaries without the aid of a human teacher telling them when their guesses are right. That’s possible because languages have strong similarities in the ways words cluster around one another. The words for table and chair, for example, are frequently used together in all languages. So if a computer maps out these co-occurrences like a giant road atlas with words for cities, the maps for different languages will resemble each other, just with different names. A computer can then figure out the best way to overlay one atlas on another. Voilà! You have a bilingual dictionary.

The new papers, which use remarkably similar methods, can also translate at the sentence level. They both use two training strategies, called back translation and denoising. In back translation, a sentence in one language is roughly translated into the other, then translated back into the original language. If the back-translated sentence is not identical to the original, the neural networks are adjusted so that next time they’ll be closer. Denoising is similar to back translation, but instead of going from one language to another and back, it adds noise to a sentence (by rearranging or removing words) and tries to translate that back into the original. Together, these methods teach the networks the deeper structure of language.

There are slight differences between the techniques. The UPV system back translates more frequently during training. The other system, created by Facebook computer scientist Guillaume Lample, based in Paris, and collaborators, adds an extra step during translation. Both systems encode a sentence from one language into a more abstract representation before decoding it into the other language, but the Facebook system verifies that the intermediate “language” is truly abstract. Artetxe and Lample both say they could improve their results by applying techniques from the other’s paper.

In the only directly comparable results between the two papers—translating between English and French text drawn from the same set of about 30 million sentences—both achieved a bilingual evaluation understudy score (used to measure the accuracy of translations) of about 15 in both directions. That’s not as high as Google Translate, a supervised method that scores about 40, or humans, who can score more than 50, but it’s better than word-for-word translation. The authors say the systems could easily be improved by becoming semisupervised–having a few thousand parallel sentences added to their training.

In addition to translating between languages without many parallel texts, both Artetxe and Lample say their systems could help with common pairings like English and French if the parallel texts are all the same kind, like newspaper reporting, but you want to translate into a new domain, like street slang or medical jargon. But, “This is in infancy,” Artetxe’s co-author Eneko Agirre cautions. “We just opened a new research avenue, so we don’t know where it’s heading.”

“It’s a shock that the computer could learn to translate even without human supervision,” says Di He, a computer scientist at Microsoft in Beijing whose work influenced both papers. Artetxe says the fact that his method and Lample’s—uploaded to arXiv within a day of each other—are so similar is surprising. “But at the same time, it’s great. It means the approach is really in the right direction.”

sexta-feira, 1 de dezembro de 2017

Conheça os novos recordes da computação quântica

Redação do Site Inovação Tecnológica - 01/12/2017

Computador quântico online gratuito e novos recordes da computação quântica

Representação artística de uma simulação quântica. Lasers manipulam uma série de 53 qubits atômicos para estudar a dinâmica do magnetismo.[Imagem: E. Edwards/JQI]

Recorde de simulador quântico

Batendo o recente recorde da equipe do MIT e da Universidade de Harvard, que apresentaram um simulador quântico de 51 qubits em agosto passado, uma equipe da Universidade de Maryland, também nos EUA, desenvolveu um simulador quântico com 53 qubits.

Como em todos os recordes o ganho parece pequeno, mas na verdade as possibilidades de cálculo dobram com cada qubit adicional. Além disso, nenhum supercomputador atual consegue lidar com problemas descritos por mais do que 20 objetos quânticos - ou 20 qubits.

Os qubits nesta nova demonstração são íons do elemento itérbio mantidos fixos por eletrodos de ouro.

"Cada íon qubit é um relógio atômico estável que pode ser perfeitamente replicado. Eles são efetivamente conectados em conjunto por feixes de laser externos. Isto significa que o mesmo dispositivo pode ser reprogramado e reconfigurado, de fora, para se adaptar a qualquer tipo de simulação quântica ou futura aplicação de computação quântica que surgir," disse o professor Christopher Monroe, coordenador da equipe.

Na verdade a coisa não é tão genérica assim. Como cada qubit se comporta como um minúsculo ímã, com sua magnetização alterando-se por meio do laser, este simulador quântico é adequado para estudos de problemas relacionados ao magnetismo em escala atômica e molecular. Outros tipos de cálculos continuarão esperando por um computador quântico mais genérico, com interações arbitrariamente programáveis.

Imagem do circuito quântico mostrando 10 qubits supercondutores (formas de estrela) interligados por um barramento central ressonante (cinza). [Imagem: Chao Song et al. - 10.1103/PhysRevLett.119.180511]

Entrelaçamento de 10 qubits

Físicos chineses conseguiram colocar 10 qubits em entrelaçamento (emaranhamento) simultâneo no interior de um circuito supercondutor, um a mais do que o recorde anterior.

O estado de 10 qubits é o maior estado multiqubits já criado em qualquer sistema de estado sólido e representa um passo importante para a computação quântica na plataforma de qubits supercondutores.

Os circuitos supercondutores são mais simples e menores e sua baixa temperatura lida bem com o fantasma da decoerência, a perda do dado pelo qubit em razão das múltiplas interferências a que os sistemas quânticos estão sujeitos.

Os qubits são feitos de pequenos pedaços de alumínio, conectados por um barramento. O chip permite o entrelaçamento entre quaisquer dois qubits, pode produzir múltiplos entrelaçamentos ou entrelaçar simultaneamente todos os 10 qubits.

A rede neural quântica é formada por um loop de 1km de fibra óptica, com a informação codificada em 2.000 parâmetros de oscilação da luz - cada um é uma superposição de "0" e "1". O resultado são os valores dos parâmetros depois de 1.000 voltas pelo loop. [Imagem: QNNCloud]

Computador quântico online gratuito

Uma equipe multi-institucional do Japão disponibilizou para uso online a qualquer interessado sua Rede Neural Quântica (QNN: Quantum Neural Network).

Trata-se de um sistema óptico capaz de resolver problemas de otimização combinatorial, como o conhecido problema do caixeiro-viajante, em que deve-se determinar a melhor sequência de cidades que um vendedor deve visitar - os algoritmos clássicos, ou não-quânticos, só conseguem resolver esse problema na força bruta.

Diferentemente de um computador quântico universal, que poderia resolver qualquer tipo de problema, a rede neural quântica é projetada para otimizar sistemas que possam ser descritos por modelos Ising - sistemas formados por unidades que podem assumir um de dois valores (0 ou 1) e somente interagem com seus vizinhos.

Esse é um limitador maior do que o computador quântico que a IBM disponibiliza pela internet, por exemplo, mas os modelos Ising podem ser usados para descrever uma ampla variedade de fenômenos.

O cadastramento para acesso pode ser feito no site da QNNCloud.

Bibliografia:

10-Qubit Entanglement and Parallel Logic Operations with a Superconducting Circuit

Chao Song, Kai Xu, Wuxin Liu, Chui-ping Yang, Shi-Biao Zheng, Hui Deng, Qiwei Xie, Keqiang Huang, Qiujiang Guo, Libo Zhang, Pengfei Zhang, Da Xu, Dongning Zheng, Xiaobo Zhu, H. Wang, Y.-A. Chen, C.-Y. Lu, Siyuan Han, Jian-Wei Pan

Physical Review Letters

Vol.: 119, 180511

DOI: 10.1103/PhysRevLett.119.180511

Observation of a many-body dynamical phase transition with a 53-qubit quantum simulator. Nature, 2017;

J. Zhang, G. Pagano, P. W. Hess, A. Kyprianidis, P. Becker, H. Kaplan, A. V. Gorshkov, Z.-X. Gong, C. Monroe

Nature

Vol.: 551 (7682): 601

DOI: 10.1038/nature24654