So with my new found knowledge of #kagi upsetting me and causing me to leave it... The fact the only other engines that are not just proxying google or bing... Is #brave or #mojeek and while brave one is nice... I just cannot bring myself to use something that funds a biggot... Then mojeek just feels dated and not great...
I am tempted at bringing fuzzysearx back... and even look at trying to leverage the built in ability to add custom engines to look at leveraging other abilities... because searxng really seems like the only ethical way I can see to leverage search engines collecting data...
one of the things that I like about #mojeek search engine, is that you can filter the search results and ask the engine to never show any links from websites such as slant.co or other nonsense websites.
so @Mojeek seems to have at least deleted their death wishes off of their site, but with no explanation, and no explanation on how they will make sure that this doesn't happen to other people who may be in a worse place than I am currently and can take serious damage from things like exactly this.
To just let an AI post stuff on your page is so irresponsible. To not make sure things like this doesn't happen is irresponsible.
If you let an AI post something in your name, YOU are responsible for what it confabulates.
Si vous utilisez #Mojeek en tant que moteur de recherche, vous pouvez faire dire si le résultat est bon ou mauvais en cliquant en bas à droite sur "Soumettre un retour d’information".
Apparemment ça aide pas mal les dev à améliorer les résultats dans notre région
Using ChatGPT to Double-Distill Mojeek Results into a Date-Based Topic Overview
My concern about AI-assisted search results has been, from the beginning, the lack of human context. A simple query is rarely going to be sufficient in itself; after all, the user is searching because of some existing information lack. Outside of the most basic queries (When is a movie playing? Where is that restaurant? How many ounces in a pound?)...
While #DuckDuckGo is down, I tried out a few other search engines. I did a search I made yesterday in each. #Bing gave me an interesting looking blog first, then some less-related info from high-trust sources, then an interesting paper, then a bunch of garbage. #Mojeek gave me nothing that looked interesting, just pages that used some of the words from my search. #Kagi gave me the less-related high-trust sources first, then the entire rest of the first page looked interesting.
Wow, THIS is an eye-opener. A visual representation of what Search API powers what other Search engine. I'd no idea Bing had become so prevalent. I didn't even realise my own choice of engine, Startpage, had abandoned Google for it!
NB: This actually lists SP as still using Google but Seirdy's list doing the rounds elsewhere correctly states it as having moved.
Hmm I probably have the most ridiculous #robotstxt for a #Misskey instance right now lol. I just want to let #Mojeek and #Marginalia crawl #Makai and make sure to keep out #Google and the AI scrapers... :satrithink:
Folks, I know… I use Duck Duck Go also but remember they have venture capital. Enjoy it while it lasts (or let’s fund and build alternatives differently that aren’t temporary businesses with profit motives and exits but commons-owned institutions working for the common good).
Nice! However that does not seem to show which ones have their own index, that's a very important piece of information in my opinion. For example #Mojeek has its own independent index, while DuckDuckGo does not (it uses Bing), but that difference is not seen in the list.
All those who doubted me when I said that Bingʼs index was in the 1 to 2 milliards …
This is where Inktomi was over two decades ago, and itʼs a fraction of the size of Mojeekʼs.
update: for me and so far, #mojeek is either very much spot on or clearly bad. i had to fall back to #kagi often, but will continue this experiment for longer.
@farooqkz good point! That post was long ago when I didn't fully understand the nuances between #foss and #opensource, and I guess #privacy also crept in somewhere 😅
I should write a new intro, actually 🖋️
#Mojeek is my default for the moment (not open source I know) but I incidentally stumbled upon @Seirdy's search engines post just yesterday, so I'm exploring those too!