This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Online Learning in BitTorrent Systems
Dec. 2012 (vol. 23 no. 12)
pp. 2280-2288
Rafit Izhak-Ratzin, Palo Alto Networks, Santa Clara
Hyunggon Park, Ewha Womans University, Seoul
Mihaela van der Schaar, University of California Los Angeles, Los Angeles
We propose a BitTorrent-like protocol based on an online learning (reinforcement learning) mechanism, which can replace the peer selection mechanisms in the regular BitTorrent protocol. We model the peers' interactions in the BitTorrent-like network as a repeated stochastic game, where the strategic behaviors of the peers are explicitly considered. A peer that applies the reinforcement learning (RL)-based mechanism uses the observations on the associated peers' statistical reciprocal behaviors to determine its best responses and estimate the corresponding impact on its expected utility. The policy determines the peer's resource reciprocations such that the peer can maximize its long-term performance. We have implemented the proposed mechanism and incorporated it into an existing BitTorrent client. Our experiments performed on a controlled Planetlab testbed confirm that the proposed protocol 1) promotes fairness and provides incentives to contributed resources, i.e., high capacity peers improve their download completion time by up to 33 percent, 2) improves the system stability and robustness, i.e., reduces the peer selection fluctuations by 57 percent, and (3) discourages free-riding, i.e., peers reduce their uploads to free-riders by 64 percent as compared to the regular BitTorrent protocol.
Index Terms:
Peer to peer computing,Protocols,Learning systems,Resource management,Games,foresighted resource reciprocation strategy,Peer-to-peer (P2P),BitTorrent,reinforcement learning
Citation:
Rafit Izhak-Ratzin, Hyunggon Park, Mihaela van der Schaar, "Online Learning in BitTorrent Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 23, no. 12, pp. 2280-2288, Dec. 2012, doi:10.1109/TPDS.2012.90
Usage of this product signifies your acceptance of the Terms of Use.