How Major League Teams Use R to Analyze Baseball Data - video presentation by Keith Woolner, principal data scientist for the Cleveland Guardians, at the Cleveland R User Group https://youtu.be/1SR9C4td3TI?si=Ic7osZnMgb8vQd44
Anyone else having issues with CRAN today? I was trying to submit an update to a package but I get an error that the SSL certificate has expired and it won't upload via {curl} and Firefox flags the page as unsafe. #RStats
(1/2) I recently posted a few posts about Rust 🦀 and my intention to leverage it for data science applications. Multiple people asked if Rust is a substitute for R or Python, and the short answer (in my opinion) is no. I see Rust as a complementary or supporting language that could make languages like R and Python faster.
Polaris 🐻❄️ is one example of a Python 🐍 application that uses Rust on the backend. 🧵👇🏼
The world of big data, databases, and R is rapidly evolving with an explosion of tools and packages. We're delighted to announce two workshops at posit::conf(2024) tailored for working with large datasets:
• Big Data in R with Arrow, led by Nic Crane and Steph Hazlitt
• Databases with R led by @kirill
Para los que tengáis problemas ejecutando RStudio desde Ubuntu 24.04 probad a ejecutar esto desde un terminal:
sudo chmod 4755 /usr/lib/rstudio/chrome-sandbox
posit::conf(2024) is an open-source data science conference happening in Seattle on August 12 - 14!
After the final keynote of #positconf2024, connect with R-Ladies Seattle and PyLadies Seattle at the community mixer! Open to all, even if you did not attend the conference.
Want to check duplicate values across columns of a data.frame? Well you can do that in a basic way with TidyDensity and the check_duplicate_rows() function, or you can go through todays blog post for some other ideas with #BaseR#dplyr and #datatable
A new "ISC Working Group 'Marshaling & Serialization in R'"
This WG aims to develop standard practices for marshalling & unmarshalling of #RStats objects. This will involve identifying current problems, raising awareness of it, & coming up with technical solutions.
tidymodels has long supported parallelizing model fits across CPU cores. A couple of the modeling engines that #rstats#tidymodels supports for gradient boosting—#XGBoost and #LightGBM—have their own tools to parallelize model fits. A new blog post explores whether tidymodels users should use tidymodels' implementation, the engines', or both.