@gsa Idem j’ai commencé par du 1To par disque dur et je commence par remplacer par du 14To, je viens de remplacer mes deux SSD de 1To (durée de vie: 2 ans) ils n’ont pas aimé le crawler sur #yacy
“I found it very weird that there essentially is no way to browse the web in an open manner. So that’s what I am trying to build,” the founder of Stract said.
#YaCy is an open source crawler that you can run and feed Searx with. I recall some searx instances that run their own YaCy. YaCy can also share indexes with other YaCy instances.
En présentant https://zoTop.fr nos discussions ici nous ont amené à souhaiter encore mieux, un moteur de recherche libre et open source, avec son propre index, totalement indépendant et qui n'appartienne à personne. Pour ça il faut un outil à base d'instances comme Mastodon, PeerTube, etc...
Every now and then, I stumble into something that reminds me how #security in #Windows has completely backfired as a concept from the user's point of view.
After decades of frustration and cat-and-mouse games of advice, antivirus and security software and everything else, you'd have thought that its userbase would've become street-smart.
But it didn't. Given the goal users simply click OK despite any and all warnings, even following instructions, like this benign example from #YaCy.
@tomw@orbiterlab If I remember correctly, the #YaCy result quality should correlate with the amount of users (i.e. running instances). Maybe the whole project just needs more users and / or publicity.
today I think I'm going to set up #YaCy and see if I can sync it with my Firefox bookmarks as crawl nodes.
with #search dying, I think social crawling is a really rad possibility - I have a bunch of bookmarks that I know to be good sources of information/good things, so do my friends. Maybe I can also mark pages as LLM generated and signal to others to downrank them. I wonder if we could link fedi accounts to YaCy.
Not sure if that's how YaCy works out of the box, but want to check it out.
@zeh@alcinnz
>have been seeing the worse results from duckduckgo
We're not the only ones who have been complaining about this recently. I was very happy with DDG for years and I still really like the fact that I can search the Free Software Directory by adding !fsd to my keywords. But their general search results have gone massively downhill in the last year or two, and I blame Bing.
#Gigablast ← does their own crawling, but what happened?.. they were dissolved last year & seem to now be www.alltheinternet.com
#Ombrelo ← a proxy but more advanced than the others (filters/downranks Cloudflare sites)
#YaCy is notable because it’s a crawler that you can install and operate yourself. YaCy instances can be public-facing and they can also share indexes with each other fedi style apparently. Some Searx instances tap YaCy instances.
I would love to find a searx or 4get instance that rejects the tech giants, but aggregates from YaCy, mojeek, gigablast, metager, maginalia.nu, frogfind.com, & wiby.me.
And I would love it even more if it would make replacements:
👋 Hello #fediverse! I'm going into a meeting at 4ET to talk about our newsroom's social media options.
If you think our station and other #NPR stations should have a Mastodon server or a broader presence in the #fediverse, pls boost this post. If you have thoughts, please reply, I want to hear them!
Each instanz should run it's own yacy-Instance too. This should be configured this way, that it indexes fulltext the whole instance (or all fedivers-services of a domain with holds more than one service)
You can find the whole fulltext-index of the whole fediverse via federation of yacy...
Ne du, so was wie #DuckDuckGo, #StartPage & #Ecosia liefern Daten ihrer Crawler wie #Bing (M$) und/oder #Google. Pseudonymisiert ist nicht anonym, denn die sind vertraglich dazu verpflichtet.
🧵 …in keinem der Toots oben hatte ich mich um YaCy Websuche gekümmert. Ich hatte #YaCy vor Jahren mal ausprobiert aber funktioniere meiner Meinung nach nicht gut genug. Doch nun hatte der Entwickler den Projekt zustand und seine Historie so wie Funktionalität und Zukunft erklärt.
Danke @orbiterlab, ich werde nun damit #Suchen um es zu testen und zu verstehen. Das Spannende daran ist, es ist dezentral #Internet App und hat ein wenig mit @MetaGer zu tun, Details im Link:
This Guy Has Built an Open Source Search Engine as an Alternative to Google in His Spare Time (www.404media.co)
“I found it very weird that there essentially is no way to browse the web in an open manner. So that’s what I am trying to build,” the founder of Stract said.