LINEAR REGRESSION WITH R AND HADOOP

Journal Title: Challenges of the Knowledge Society - Year 2015, Vol 5, Issue 0

Abstract

In this paper we present a way to solve the linear regression model with R and Hadoop using the Rhadoop library. We show how the linear regression model can be solved even for very large models that require special technologies. For storing the data we used Hadoop and for computation we used R. The interface between R and Hadoop is the open source library RHadoop. We present the main features of the Hadoop and R software systems and the way of interconnecting them. We then show how the least squares solution for the linear regression problem could be expressed in terms of map-reduce programming paradigm and how could be implemented using the Rhadoop library.

Authors and Affiliations

Bogdan OANCEA

Keywords

Related Articles

“EU IN CRISIS: CURRENT CHALLENGES IN THE AREA OF CFSP”/ A LEGAL PERSPECTIVE

The paper aims to capture - from a politico-institutional and legal point of view - the current challenges that the European project is facing with in the field of Common Foreign and Security Policy (CFSP) / Common Secur...

THE ASSESSMENT METHODOLOGIES PTELR, ADRI AND CAE – THREE METHODOLOGIES FOR COORDINATING THE EFFORTS TO IMPROVE THE ORGANIZATIONAL PROCESSES TO ACHIEVE EXCELLENCE

In the paper “The Assessment Methodologies PTELR, ADRI and CAE – Three Methodologies for Coordinating the Efforts to Improve the Organizational Processes to Achieve Excellence” the authors present the basic features of t...

GENERAL ASPECTS REGARDING THE PRIOR DISCIPLINARY RESEARCH

Disciplinary research is the first phase of the disciplinary action. According to art. 251 paragraph 1 of theLabour Code no disciplinary sanction may be ordered before performing the prior disciplinary research.These reg...

AGENCY THEORY AND OPTIMAL CAPITAL STRUCTURE

In the corporate finance, the agency theory tries to explain the behavior of various agents that intervene in the company’s funding (managers, shareholders and debt holders) and to analyze the impact of these behaviors o...

THE ROLE OF WOMEN IN PRESERVING THE ROMANIAN IDENTITY IN LOCAL ADVERTISING

The following paper intends to reveal the involvement of women into coining efficient communication campaigns for different product categories promoted on the Romanian market. Product consumption explicitly depends on ta...

Download PDF file
  • EP ID EP89875
  • DOI -
  • Views 177
  • Downloads 0

How To Cite

Bogdan OANCEA (2015). LINEAR REGRESSION WITH R AND HADOOP. Challenges of the Knowledge Society, 5(0), 1007-1012. https://europub.co.uk/articles/-A-89875