#TIL that @JasonPunyon curated and compiled a whopping archive of answers from #StackOverflow and assorted #StackExchange Q&A sites in a minimal sqlite format, where they can be downloaded and analyzed offline:
Amazing effort and great idea. Reminded me of the archives that #Kiwix kept of it (alongside Wikipedia and similar projects), but more streamlined and cross-platform. Nice.
I'm grieving for Stack Exchange. It's been such a good concept. I've learned a lot there. I've enjoyed contributing. "Let's build upon each other's knowledge and find the best answers, collaboratively" is such a great premise.
Well, not anymore. "AI" has to eat everything.
We've been, or are in the process of, losing so many good things lately, for different reasons. Twitter, Reddit, GitHub, Raspberry Pi, search engines, streaming services … capitalism is enshittifying everything.
Just came up with a new analogy I'm rather proud of: LLMs are digital compost heaps. They decompose whatever you hurl in and turn it into artificial excrement.
Also I'm moving from StackExchange to Codidact. If I'm going to do any more unpaid labour it's going to be for a not-for-profit, rather than a for-profit company. Feeding that work into a digital compost heap is the push I needed.
If you post to #StackExchange, you license that content to them under CC-BY-SA. You can't revoke that license, so you can't remove that content.
It shouldn't surprise you that they stop you from trying. Imagine a Wikipedia editor trying to remove their edits.
However, what it does give you is the right to host all of SE elsewhere. To fork it, if SE violates its contract with the community. This is how SE was always designed to work. From an interview with Spolsky:
Lovely is helping daddy learn about setting up #cron jobs from #CommandLine in #Linux. She's making sure he doesn't give himself a bad back by crouching over to read the #stackexchange webpage!
Alright, so many companies are using user or customer data for training #AI without consent that I think I'm going to have to make an ongoing thread to document them all. 🤖
Here we go! 1/x
Starting out with #Reddit who have sold user data to another company, that will use it to train AI:
Sounds like it can replace/augment those with experience levels #lmgt4y#StackOverflow#StackExchange
But actual specialists? Have -1 incentive now to write down their experience. 📉trends ensue.
Today, I had a work meeting with a young but eager fellow. We had a chat, I explained a few things, and he agreed with the technical choices I suggested. However, the issue arose when I mentioned writing a few lines of a "tutorial" on how to do certain things. His response: "Can't you make a video?" Surprised (but not too much), I argued that in my opinion, a concise written tutorial is more practical than a pointless video. His reply: "We only watch videos now, even my colleagues look up solutions on YouTube." This is despite the wealth of excellent tutorials on StackExchange and various blogs. I wonder: why waste half an hour watching a video that could be summarized in a few lines of text (saving time for both the producer and the user)?
Now that #CloudGlare have hijacked #serverFault, unix, #stackExchange, #askUbuntu and the entire StackOverflow family of sites, CloudGlare are more able to learn the types of work people might be doing.
#Introduction hey reaching out to yall that interested in Sci fi and time travel.
I'm trying to write a time travel novel and would be fire to connect with anyone with similar interests.
Hit me up for ideas . Preciate that frfr!
Love Jaden ❤️
Have a look at #Worldbuilding#StackExchange, an online Q&A community where authors bounce, discuss and improve their ideas. Even if you're not posting there, you could get some inspiration, or read advice that applies to your story as well.
I'm now the proud custodian of the gemtext.foo domain, which is to be used for #Gemini mirrors of popular sites, like Wikipedia and Stack Exchange, as well as developer sites, like Go/Pub package search and docs.
Of the below sites (or others, reply!), which should I mirror first? Boosting for more feedback is welcome!
Abiding by the results of the poll, I've started on this! Right now it just shows questions, no answers, and it isn't very smart about what site it's on.
They have a steep rate limit on there that I'd have to authenticate against a user to raise. I think just downloading the data dumps might be cheaper, but that might complicate search...
i know you can theoretically follow #kbin “magazines” from a mastodon account (i think), but it doesn’t really make sense outside of the intended reddit-like formatting. so i have a kbin account specifically to browse and interact with kbin, and when i see something i would want to share, i boost it there.
my #mastodon account here follows my kbin account. so when my kbin boosts show up in my mastodon timeline, i boost them to my mastodon followers from this account.
The Moderator Strike is the best thing to happen to StackExchange. First @codinghorror retires to a lifetime of yahcting. Then all the moderators go on some absurd and silly "strike" over totally immaterial things.
If you told me I'd get everything I wanted 10 years ago, I wouldn't have believed it.
You know what sucks? Spotify is apparently draining my phone's battery, and normally in situations like this I'd turn to #Reddit to see if anyone is experiencing similar issues. But since I've deleted my account and effectively blocked the site, that's a no go.
I'll try to find something on #kbin or #lemmy but I'm not too hopeful.
@zeh@alcinnz
>have been seeing the worse results from duckduckgo
We're not the only ones who have been complaining about this recently. I was very happy with DDG for years and I still really like the fact that I can search the Free Software Directory by adding !fsd to my keywords. But their general search results have gone massively downhill in the last year or two, and I blame Bing.
#Gigablast ← does their own crawling, but what happened?.. they were dissolved last year & seem to now be www.alltheinternet.com
#Ombrelo ← a proxy but more advanced than the others (filters/downranks Cloudflare sites)
#YaCy is notable because it’s a crawler that you can install and operate yourself. YaCy instances can be public-facing and they can also share indexes with each other fedi style apparently. Some Searx instances tap YaCy instances.
I would love to find a searx or 4get instance that rejects the tech giants, but aggregates from YaCy, mojeek, gigablast, metager, maginalia.nu, frogfind.com, & wiby.me.
And I would love it even more if it would make replacements:
Ideas to build a federated StackExchange alternative (ioc.exchange)
Codeberg was asking about this. The linked toot by a commenter points to :...