Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on (2009)
Sept. 15, 2009 to Sept. 18, 2009
We describe the design of an autonomous agent that can teach itself how to translate from a foreign language, by first assembling its own training set, then using it to improve its vocabulary and language model. The key idea is that a Statistical Machine Translation package can be used for the Cross-Language Retrieval Task of assembling a training set from a vast amount of available text (e.g. a large multilingual corpus, or the Web) and then train on that data, repeating that process several times. The stability issues related to such a feedback loop are addressed by a mathematical model, connecting statistical and control-theoretic aspects of the system. We test it on real-world tasks, showing that indeed this agent can improve its translation performance autonomously and in a stable fashion, when seeded with a very small initial training set. The modelling approach we develop for this agent is general, and we believe will be useful for an entire class of self-learning autonomous agents working on the Web.
self-learning, machine translation, stability analysis
Nello Cristianini, Tijl De Bie, Marco Turchi, "An Intelligent Agent That Autonomously Learns How to Translate", Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on, vol. 02, no. , pp. 12-19, 2009, doi:10.1109/WI-IAT.2009.120