Big Computing: Performance comparison of subset in R to filter in the deployer package

Friday, July 17, 2015

Performance comparison of subset in R to filter in the deployer package

Recently I have been using RStudio’s dplyr package more and more. I started using the package mostly because of the convience of having all the manipulations I want to use on the data set all in one place. I had also started to use the “pipes” with the ggvis package so I also like the was the code looked as well. Frankly I have gotten a lot addicted to writting pipped code. Anyway I start to notice the dplyr really plowed through the work much faster than using the base R functions. I have been told this is because dplyr leverages data.table and other speed up approaches. So I thought I would test it out and see what the difference really is. I have a new Mac powerbook so all my results are off of that. I also use the hflights data set that used for the examples of the dplyr data set.
First I need to require the dplyr and hflights packages. The hflights data set is reasonably large.

require(dplyr)

## Loading required package: dplyr
## 
## Attaching package: 'dplyr'
## 
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## 
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union

require(hflights)

## Loading required package: hflights

dim(hflights)

## [1] 227496     21

The command subset() is similar to filter() in dplyr. So lets compare their performance

ptm1<-proc.time()
a<-subset(hflights, Distance>1500)
ptm1<-proc.time()-ptm1

ptm2<-proc.time()
b<-hflights %>%
  filter(Distance>1500)
ptm2<-proc.time() -ptm2

print(ptm1)

##    user  system elapsed 
##   0.023   0.007   0.030

print(ptm2)

##    user  system elapsed 
##   0.009   0.001   0.011

That is roughly a three fold decrease in total time!

6 comments:

قمم التميزAugust 17, 2017 at 7:01 PM
شركة عزل اسطح
كشف تسربات المياه
خدمات عزل الاسطح من الخدمات التى فى حاجة الية فى المكان فاذا اراد ان تقوم باعمال العزل لاى اسطح فى المصانع – المنازل – الفلل ... وغيرها وتبحث عن افضل الخدمات التى تساعد فى الوصول الى افضل ما تتمنى ان ترى علية النتائج فعليك ان تتعاون مع شركة قمم التميز التى تحقق اعلى مستوى من العزل بالاعتماد على افضل المواد المخصصة فى القيام باعمال العزل والتى استطاعت ان تتاكد ان اعمال العزل التى تتم بالخيش المقطرن والسيلتون من افضل الخدمات المثالية المميزة التى تحقق اعلى مستوى من العزل دون ان يؤدى الى ظهور اى عيب فى اعمال العزل . كشف تسربات المياه بالرياض
لا تكتفى شركة قمم التميز بالقيام باعمال العزل فى منطقة معينة بل تسعى الى الوصول الى اى مكان فى شرق الرياض – جنوب الرياض – غرب الرياض – وسط الرياض من اجل ان تحقق خدمات العزل لجميع عملاء الشركة الكرام . شركة كشف تسربات بالرياض
تهتم شركة عزل اسطح بالقيام باعمال العزل الحرارى من خلال الاعتماد على افضل الطرق المثالية وافضل الخدمات المميزة فى القيام باعمال العزل بالاضافة الى ان الشركة تهتم باعمال الكشف الدورى من فترة الى اخرى من اجل ان يتم التحقق ان اعمال العزل تمت كما هو مطلوب فى المكان . كشف تسربات المياه شرق الرياض
العزل الحرارى يساعد فى حماية الاسطح من التعرض الى ارتفاع فى درجة الحرارة وتسربها عبر الجدران فى فصل الصيف بالاضافة الى التعرض الى التصدعات وعوامل المناخية المختلفة التى تتعرض الية فى فصل الصيف نتيجة لشدة الحرارة التى تظهر التشوهات فى المبانى ، بالاضافة الى ان لدينا خدمات العزل المائى من خلال الاعتماد على خدمات العزل المائية التى تتم من خلال افضل خدمات متواجدة والتى تسعى الى تحقيق افضل مستوى من الخدمة والعزل بالفوم ايضا من اهم الطرق الحديثة التى استخدامت فى العديد من الدول الاوربية لانه يساعد فى حماية المبنى من التعرض الى الحرارة المرتفعه او التعرض الى التسربات للمياة والحفاظ على المبنى من اى مشاكل وعوامل مناخية ، فاعمال العزل للفوم اختصار لكل طرق العزل والتى تساعد فى الحفاظ على المبنى .
كشف تسربات المياه
كشف تسربات المياه غرب الرياض
ReplyDelete
Replies
DreamNTAugust 25, 2018 at 4:24 AM
Thanks for another great post.it is very modern.Very nice and very good dance.

สูตรบาคาร่า
ReplyDelete
Replies
MNYGMNovember 28, 2018 at 6:28 AM
Foam insulation company in Dammam**Thermal insulation company in Riyadh**Dammam Cleaning Company**Transfer of Afash Medina**Moving furniture in Jeddah**An insect control company in Dammam**Insulation of roofs with the**Detection of water leaks in Makkah
ReplyDelete
Replies
auto slotxoFebruary 27, 2020 at 1:44 AM
Thanks for the great message Useful for reading A good place to learn
สล็อตxoauto
ReplyDelete
Replies
abo-bderFebruary 15, 2021 at 11:32 AM

شركة تنظيف بالرياض
شركة تركيب اثاث ايكيا بالرياض
.
فنى تركيب
فني
شركة تركيب اثاث ايكيا بالرياض
فني ستائر بالرياض
شركة تركيب اثاث ايكيا بالرياض
شركة تركيب ستائر بالرياض
شركة نقل اثاث من الرياض الى الامارات
ReplyDelete
Replies
Expert Market ResearchDecember 10, 2021 at 2:45 AM
Solid Waste Management Market size exceeded USD 1 trillion in 2019 and the annual capacity is anticipated to exceed 28 billion tonnes by 2026. Declining public health & productivity, contamination of water bodies, and airborne emissions are some of the key issues associated with the mismanagement of solid waste. Growing urbanisation along with a steady increase in the global population has ticked up the implementation of solid waste management processes for discarded products while maintaining the social, technical, and economic feasibility of the treatment practises. Ongoing efforts toward resource recovery and the increasing demand for integrated solid waste management practises across urban agglomerates will augment the market growth.
ReplyDelete
Replies

Add comment

Subscribe To My Blog

Friday, July 17, 2015

Performance comparison of subset in R to filter in the deployer package

6 comments: