I've given several internal versions of this workshop at Amazon and I daresay it's been very well received. The power of these new data wrangling libraries is honestly staggering. We use them all the time at work. You should too.
20 bucks gets you in the door. All proceeds to Ukraine aid orgs. #rstats#pydata
We can't replace them, but we welcome anyone looking for a friendly, inclusive community to join us at the Data Science Learning Community (@DSLC) https://DSLC.io
While data scientists are often taught about training a machine learning model, building a reliable MLOps strategy to deploy and maintain that model can be daunting.
It doesn’t have to be this way!
Join us with Julia Silge at Posit on Wednesday, April 24th at 11 am ET to learn how Posit Team provides fluent tooling for the whole ML lifecycle.
No registration is required to attend - simply add it to your calendar using this link, https://pos.it/team-demo
Ya está abierto el registro para nuestra reunión de abril: 🐲 LLMOps & ML para Drilling Performance y Python & Mazmorras, este mes en las oficinas de Repsol
Is your analytics turned into reports shared with across your organization?
Learn how Quarto can help streamline, automate, & bedazzle your work.
Quarto is a open-source tool that helps you publish your data science work, beautifully, anywhere.
Sit back and listen to the dulcet tones of Christophe's soothing accent as he gives a 30-minute video deep-dive walking you through this end-to-end data science workflow.
Watch the video: https://youtu.be/Hl9TvhlzfV4?feature=shared
Opportunity Scholars at posit::conf(2024). The application deadline is approaching fast; March 22nd. If you're a strong candidate or know someone who is, please act quickly.
Opportunity Scholarships receive free tickets, a workshop, support for travel and accommodation, plus lots of swag.
Wes McKinney — Principal Architect at Posit PBC — digs into building great open-source data science tools on the Real Python Podcast.
Wes has dedicated over a decade to easing bottlenecks of data processing systems. In this talk, he and Christopher Bailey at Real Python discuss composable data systems and community collaboration.
We want to apply to the Google Season of Docs for #PUDL but have never worked with an outside technical writer before. Does anybody have someone to recommend? It's a #Python project focused on producing open data describing the US energy system.
The R4DS Online Learning Community has thousands of members, hundreds of which are active on our Slack every week. You might be wondering: Why not charge those learners? Why is the Community funded through donations?
🎂It's my birthday!🎂
To celebrate, I'm... Working to build a friendly, diverse #DataScience community at https://r4ds.io, just like I do every day! It'd make my day if you supported our efforts at https://r4ds.io/donate !
Do you want to speak at posit::conf(2024)? Now is the time to start thinking about your talk because the call for proposals has just opened! https://posit.co/blog/speak-at-posit-conf-2024/
We're broadly interested in talks about any aspect of data science, and don't worry if you've never given a talk before as all accepted talks get speaker training from our wonderful partner Articulation.
I have been thinking a bit about how to detect supply chain attacks against popular open source projects such as scikit-learn.
If you have practical experience with https://reproducible-builds.org/ in particular in the #Python / #PyData ecosystem, I would be curious about any feedback to the plan I suggest for scikit-learn in the following issue.
Feel free to reply on mastodon first, if you have questions.
I ran a quick Gradient Boosted Trees vs Neural Nets check using scikit-learn's dev branch which makes it more convenient to work with tabular datasets with mixed numerical and categorical features data (e.g. the Adult Census dataset).
Let's start with the GBRT model. It's now possible to reproduce the SOTA number of this dataset in a few lines of code 2 s (CV included) on my laptop.