stevensanderson, to programming
@stevensanderson@mstdn.social avatar

🎉 New Post Alert! 🎉

Counting words in a string is a fundamental task in data analysis.

  1. Base R: Use strsplit(), a straightforward method to split strings and count words.

  2. stringr: The str_split() function from the stringr package makes the code more readable.

  3. stringi: For powerful and efficient string manipulation, stri_split_regex() from the stringi package is your go-to.

Happy coding! 🚀

#R

Post: https://www.spsanderson.com/steveondata/posts/2024-05-16/

smach, to rstats
@smach@masto.machlis.com avatar

How Major League Teams Use R to Analyze Baseball Data - video presentation by Keith Woolner, principal data scientist for the Cleveland Guardians, at the Cleveland R User Group
https://youtu.be/1SR9C4td3TI?si=Ic7osZnMgb8vQd44

#RStats #SportsAnalytics @rstats

rOpenSci, to rstats Spanish
@rOpenSci@hachyderm.io avatar

📦 [A package a day - Computing Infrastructure 44]

Today's Computing Infrastructure package is gistr

Work with GitHub Gists
🙏 Maintained by @sckottie
📝 https://docs.ropensci.org/gistr/

Do you use this package? Tell us about it!
🗺️ https://discuss.ropensci.org/c/usecases/10

#APackageADay
#OSS
#RStats
@rstats

adamhsparks, to random
@adamhsparks@rstats.me avatar

Anyone else having issues with CRAN today? I was trying to submit an update to a package but I get an error that the SSL certificate has expired and it won't upload via {curl} and Firefox flags the page as unsafe. #RStats

ramikrispin, to rust
@ramikrispin@mstdn.social avatar

(1/2) I recently posted a few posts about Rust 🦀 and my intention to leverage it for data science applications. Multiple people asked if Rust is a substitute for R or Python, and the short answer (in my opinion) is no. I see Rust as a complementary or supporting language that could make languages like R and Python faster.

Polaris 🐻‍❄️ is one example of a Python 🐍 application that uses Rust on the backend. 🧵👇🏼

stevensanderson, to programming
@stevensanderson@mstdn.social avatar
stevensanderson, to programming
@stevensanderson@mstdn.social avatar

After I update my #R Package TidyDensity there will be 176 functions...that's a lot of code I wrote. This is just a reflection. Right now it is 172.

#R

rOpenSci, to rstats Spanish
@rOpenSci@hachyderm.io avatar

📦 [A package a day - Computing Infrastructure 43]

Today's Computing Infrastructure package is prismjs

Server-Side Syntax Highlighting
🙏 Maintained by @jeroenooms
📝 https://docs.ropensci.org/prismjs/

Do you use this package? Tell us about it!
🗺️ https://discuss.ropensci.org/c/usecases/10




@rstats

Posit, (edited ) to Flooring
@Posit@fosstodon.org avatar

The world of big data, databases, and R is rapidly evolving with an explosion of tools and packages. We're delighted to announce two workshops at posit::conf(2024) tailored for working with large datasets:

• Big Data in R with Arrow, led by Nic Crane and Steph Hazlitt
• Databases with R led by @kirill

More info on the workshops here: https://reg.conf.posit.co/flow/posit/positconf24/publiccatalog/page/publiccatalog?search=&tab.day=20240812&search.sessiontype=1675316728702001wr6r

stevensanderson, to programming
@stevensanderson@mstdn.social avatar

🔎 Selecting Columns Containing a Specific String in R: A Quick Guide 🚀

Hey R users! Need to select columns by a specific string? Here's how in base R, stringr, stringi, dplyr, and with a bonus from data.table.

🆒 R
✅ grepl
📦 stringr
📦 stringi
📦 dplyr

Bonus: 📦 data.table
library(data.table)
df_price <- df[, names(df) %like% "price"]

Happy coding! 🚀

Post: https://www.spsanderson.com/steveondata/posts/2024-05-15/

#R

image/png
image/png
image/png

mianromu, to ubuntu Spanish
@mianromu@mastodon.social avatar

Para los que tengáis problemas ejecutando RStudio desde Ubuntu 24.04 probad a ejecutar esto desde un terminal:
sudo chmod 4755 /usr/lib/rstudio/chrome-sandbox

De nada.

#rstats #rstudio #ubuntu

adityadahiya, to coffee
@adityadahiya@mastodon.social avatar

#TidyTuesday American Coffee Preferences: most people drink black #coffee; Milk and Sugar are most preferred additions.
Data: The Great American Coffee Taste Test by @jimseven and Cometeer
Code🔗https://tinyurl.com/tidy-cfe
Tools: #ggplot2 #ggfittext #rstats @R4DSCommunity #coffeetime

rOpenSci, to rstats Spanish
@rOpenSci@hachyderm.io avatar

📦 [A package a day - Computing Infrastructure 42]

Today's Computing Infrastructure package is tokenizers

Fast, Consistent Tokenization of Natural Language Text
🙏 Maintained by Lincoln Mullen
📝 https://docs.ropensci.org/tokenizers/

Check out this article: Introduction to the {tokenizers} Package
💻 https://docs.ropensci.org/tokenizers/articles/introduction-to-tokenizers.html




@rstats

mdsumner, to random
@mdsumner@rstats.me avatar

@josi this is awesome 👍 excellent details on speeding up some code and then Rewriting It In Rust 🎉

https://youtu.be/-v9qaqaj4Ug?si=202ziYPn8T0YvDGO

zkamvar, to random
@zkamvar@hachyderm.io avatar

TIL about strcapture() which has been in the source code for the last eight years

https://github.com/r-devel/r-svn/commit/7ee16675b06a6368612b173a7d2bb59db3a91d6a

https://fosstodon.org/

ivelasq3, to python
@ivelasq3@fosstodon.org avatar

posit::conf(2024) is an open-source data science conference happening in Seattle on August 12 - 14!

After the final keynote of #positconf2024, connect with R-Ladies Seattle and PyLadies Seattle at the community mixer! Open to all, even if you did not attend the conference.

RSVP here: https://www.meetup.com/rladies-seattle/events/300741695

#RStats #Python

stevensanderson, to Finance
@stevensanderson@mstdn.social avatar

I have started a public team on my DoTadda account (DoTadda is great for portfolio managers specifically) that is for R stuff, you can find it here:

https://app.dotadda.io/teams/bad2fd50-1971-4103-903a-4c6406e3f445/dots

#R #RStats #Finance #PortfolioManagers

If you are looking for a great way to automate knowledge search, share and the like you can use this to book a demo: https://4fi3yk3dxbg.typeform.com/to/qv1Xd3Ae?typeform-source=www.dotadda.com

stevensanderson, to programming
@stevensanderson@mstdn.social avatar

Want to check duplicate values across columns of a data.frame? Well you can do that in a basic way with TidyDensity and the check_duplicate_rows() function, or you can go through todays blog post for some other ideas with and

#R

Post: https://www.spsanderson.com/steveondata/posts/2024-05-14/

image/png

hrbrmstr, to random
@hrbrmstr@mastodon.social avatar

Nice!

A new "ISC Working Group 'Marshaling & Serialization in R'"

This WG aims to develop standard practices for marshalling & unmarshalling of #RStats objects. This will involve identifying current problems, raising awareness of it, & coming up with technical solutions.

https://github.com/RConsortium/marshalling-wg

maelle, to random
@maelle@mastodon.social avatar

New #RStats post: Automate code refactoring with {xmlparsedata} and {brio}

https://masalmon.eu/2024/05/15/refactoring-xml/

Absolutely worth the time, obviously 😇

maelle, to random
@maelle@mastodon.social avatar

My "useR! 2024" fireside chat with James Black 🔥

https://masalmon.eu/talks/2024-05-06-user-fireside-chat/

Proudly representing France? Brittany? with my shirt. 🥖 🥞

#RStats

statnmap, to random French
@statnmap@mastodon.social avatar

- Which functions of my package are exported 👀 or not 🙈 and where are they located ?
💪 Find it out with fusen::draw_package_structure() - Works for regular packages or 'fusen' built packages
👉 https://thinkr-open.github.io/fusen/articles/draw-a-tree-of-your-package-files-and-functions.html
🎉 fusen-v0.6.0 is now on CRAN

DataAngler, to random
@DataAngler@vis.social avatar

League of Legends but for #rstats folks wrangling legends in multi-layer charts and maps

simonpcouch, to random
@simonpcouch@fosstodon.org avatar

tidymodels has long supported parallelizing model fits across CPU cores. A couple of the modeling engines that supports for gradient boosting— and —have their own tools to parallelize model fits. A new blog post explores whether tidymodels users should use tidymodels' implementation, the engines', or both.

https://www.simonpcouch.com/blog/2024-05-13-parallel/

almenal99, to random
@almenal99@fosstodon.org avatar

hey @coolbutuseless @mattdray

what's your take on pipe-at-the-end vs pipe-at-the-beginning?

(others, feel free to chime in)

#rstats

DataAngler,
@DataAngler@vis.social avatar

@almenal99 @coolbutuseless @mattdray I like the reasons outlined in the resource below for putting |> at the end of the line:

  1. easy to scan verbs on the left to understand what a chunk is doing
  2. easy to add steps to the end of a line

https://r4ds.hadley.nz/workflow-style.html

  • All
  • Subscribed
  • Moderated
  • Favorites
  • provamag3
  • GTA5RPClips
  • rosin
  • Youngstown
  • everett
  • khanakhh
  • slotface
  • InstantRegret
  • Durango
  • ngwrru68w68
  • kavyap
  • modclub
  • DreamBathrooms
  • mdbf
  • JUstTest
  • magazineikmin
  • thenastyranch
  • cubers
  • cisconetworking
  • osvaldo12
  • ethstaker
  • normalnudes
  • Leos
  • tester
  • megavids
  • tacticalgear
  • anitta
  • lostlight
  • All magazines