#SearchEngines - kbin.social

ajsadauskas, 3 months ago (edited 3 months ago) to tech

In an age of LLMs, is it time to reconsider human-edited web directories?

Back in the early-to-mid '90s, one of the main ways of finding anything on the web was to browse through a web directory.

These directories generally had a list of categories on their front page. News/Sport/Entertainment/Arts/Technology/Fashion/etc.

Each of those categories had subcategories, and sub-subcategories that you clicked through until you got to a list of websites. These lists were maintained by actual humans.

Typically, these directories also had a limited web search that would crawl through the pages of websites listed in the directory.

Lycos, Excite, and of course Yahoo all offered web directories of this sort.

(EDIT: I initially also mentioned AltaVista. It did offer a web directory by the late '90s, but this was something it tacked on much later.)

By the late '90s, the standard narrative goes, the web got too big to index websites manually.

Google promised the world its algorithms would weed out the spam automatically.

And for a time, it worked.

But then SEO and SEM became a multi-billion-dollar industry. The spambots proliferated. Google itself began promoting its own content and advertisers above search results.

And now with LLMs, the industrial-scale spamming of the web is likely to grow exponentially.

My question is, if a lot of the web is turning to crap, do we even want to search the entire web anymore?

Do we really want to search every single website on the web?

Or just those that aren't filled with LLM-generated SEO spam?

Or just those that don't feature 200 tracking scripts, and passive-aggressive privacy warnings, and paywalls, and popovers, and newsletters, and increasingly obnoxious banner ads, and dark patterns to prevent you cancelling your "free trial" subscription?

At some point, does it become more desirable to go back to search engines that only crawl pages on human-curated lists of trustworthy, quality websites?

And is it time to begin considering what a modern version of those early web directories might look like?

@degoogle #tech #google #web #internet #LLM #LLMs #enshittification #technology #search #SearchEngines #SEO #SEM

reply

expand (76)

collapse (76)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ timrichards, AdeptVeritatis, ubi, oblomov +11 more

lydiaconwell, 9 months ago to Amazon

What are the best UK #Amazon alternatives for everything you want to buy?

So not only #books but household items like cleaning products.

Where else can I shop?

I'm asking the #Fediverse because #SearchEngines aren't going to tell me shit.

#AskFedi #AskFreddy

reply

expand (15)

collapse (15)

report

activity

copy /kbin url

copy original url

open original url

Loading...

michael, 6 months ago to web

So, I've been playing with Kagi search this week, and I do quite like it:

http://kagi.com

I've been seriously fed up with Google's search recently: results full of ads and/or SEO spam, to the point that it's hard to find info.

The results in kagi search are overall good (though the map is really lacking), but it's just very expensive. I'd definitely need their $10 per month plan, and I'm not sure I could justify this, given there is free (though arguably worse) competition. 🤔

I guess I could cycle through the trial by creating a new account every week (the joys of having my own domain name) but that seems a little ridiculous, too 😆

#search #SearchEngines #SearchEngineOptimization
#web #kagi #google

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

strypey, 1 year ago to random

#SearchEngineTest I want to download an .iso of the [GNU/] Linux Mint Debian Edition. Search using keywords "LMDE" and "download"

Monocles.de: correct download page was #5 result, the release notes for the previous release were the #2. The rest were geek press articles about LMDE

Mojeek: correct download page at the top, with .nl download page for the same release as #3. The rest were geek press too

(1/?)

#search #SearchEngines

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

The end of the Googleverse (www.theverge.com)

The last 25 years of Google’s history can be boiled down to a battle against the Google bomb. Is the search engine finally losing to its hijackers?

awoodsnet, 10 months ago to UX

One of the biggest problems on the web is search — especially from a privacy perspective. we need to setup our own search engines, so that get more universal results without manipulation. Also could we federate the m? Remember how the original Napster worked?! Perhaps we could setup our search engines to work like Napster.

#Search #UX #SearchEngines
#Privacy

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

agr, 21 days ago to searxng

While Bing is down, time to (re)visit @Seirdy excellent post:

**A look at search engines with their own indexes

https://seirdy.one/posts/2021/03/10/search-engines-with-own-indexes/

Including Mojeek (own index) and SearxNG a metasearch engine (list of public instances here: https://searx.space/)

#Bing #SearchEngines #Mojeek #SearxNG

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kravietz

jake4480, 11 months ago to news

For some vintage and retro devices (or for a variety of other reasons) you might want a super-lightweight method of searching the web or simple links of news.

I saw @osz mention the SUPER lightweight search site http://frogfind.com (which is powered by DuckDuckGo) and also http://68k.news (which is a simple HTML link version of Google News).

These two sites are SO cool and potentially useful.

Know any more sites/things like this? Comment em!

#search #news #HTML #retro #RetroComputing #RetroGames #RetroGaming #VintageComputers #VintageComputing #vintage #SearchEngines #simple #DuckDuckGo #Google #GoogleNews #LightweightSearch #LightWeightNews

A screenshot of 68k.news, a simple HTML version of Google News

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Taffer, 5 months ago to DuckDuckGo

Sounds like Kagi is partnering with Brave, which is run by homophobes and does all kinds of sketchy crypto-related nonsense by default.

Anyone know any good search engines that aren't evil or jumping on the "AI" hype train? Happy to pay for one.

I used to use DuckDuckGo, but then they went all-in on Bing "AI", so no. Mildly annoyed that I already paid for a year of Kagi…

#searchengines #kagi #duckduckgo

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

amanjeev, 5 months ago to web

What search engines are you using these days? Asking for

non-tech searches

tech/programming related searches

What's working for you?

#SearchEngines #Search #WebSearch #Web #search_engines #AskFedi

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov, Sh4d0w_H34rt, Mojeek

jackyan, 1 year ago to random

Sad to read from the @Mojeek newsletter that #Gigablast has gone. 🙁 https://blog.mojeek.com/2023/05/farewell-gigablast.html #SearchEngines

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Mojeek

athairbirb, 1 year ago to random

Well damn...

This search service is shutting down. Who's decent again? The :duckduckgo: ?

https://neeva.com/blog/may-announcement

#SearchEngines #neeva

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

tomasekeli, 11 months ago to random

altavista -> yahoo -> kvasir -> ask-jeeves -> google -> duck-duck-go -> qwant -> self-hosted searx

#searchEngines #evolution

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

The First Search Engines, Built By Librarians (hackaday.com)

Before the Internet became the advertisement generator we know and love today, interspersed with interesting information here and there, it was originally a network of computers largely among various universities.

noellemitchell, 29 days ago to ai

All these new AI products really make me want to stop using the internet 😅

https://9to5google.com/2024/05/14/google-search-ai-overview-rollout/

#AI #internet #tech #computer #google #news #searchEngines #media

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

brook, 9 months ago (edited 9 months ago) to random

Wish I could get DuckDuckGo (that is to say, Bing) to index https://hub.sunny.garden/

I don't know why it doesn't, I've submitted URLs and sitemaps. According to Bing's own Webmaster Tools, there are no specific problems, it's allowed by robots.txt, testing individual pages reports no issues, etc. And yet every URL is "Excluded" with no additional information provided. Bing support helpfully suggested I review the webmaster guidelines to increase my chances.

Searching for an exact page title will return other sites which link to it, but not the original post. :blobconfounded:

#DuckDuckGo #Bing #SearchEngines

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jackyan, 1 year ago to random

In my blog
An update on how #Mojeek, #Google and #Bing are performing on a site: search here

https://jackyan.com/blog/2023/05/an-update-on-how-mojeek-google-and-bing-are-performing-on-a-site-search-here/ #SearchEngines

cc for @Mojeek

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

JeremyMallin, 6 months ago to random

Does anyone have suggestions for alternative #SearchEngines that actually have good, meaningful results?? That's going to mean no deeply flawed AI, no SEO gaming the system, just useful results.

This isn't about privacy or a lack of privacy. I'm just really getting sick of not finding what I'm actually looking for.

#NoAI

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Cassandra

researchbuzz, 6 months ago to random

#SearchEngines #ecommerce #SponsoredListings #PublicTrust

'[Consumers] tend to view sponsored listings with suspicion and often prefer to click on what are called 'organic' listings that appear high in their product search results but are not sponsored, said [Professor] Mingyu 'Max' Joo... In fact, a sponsored listing can be detrimental when it replaces a seller’s organic listing that would have appeared in the top few positions in the search results.'

https://news.ucr.edu/articles/2023/11/28/online-consumers-shy-away-sponsored-product-listings

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

strypey, 10 months ago to random

An overview of search engines (mainly English language ones for now):

https://seirdy.one/posts/2021/03/10/search-engines-with-own-indexes/

#search #WebSearch #SearchEngines

#HatTip to @indieterminacy for the link in a matrix room.

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Mojeek, indieterminacy

remixtures, 3 months ago to journalism Portuguese

#Media #News #Journalism #SEO #Google #Search #SearchEngines: "In our experience, each rollout of the Products Review Update has shaken things up, generally benefitting sites and writers who actually dedicated time, effort, and money to test products before they would recommend them to the world.

That said, most searches for specific product models don’t just magically start with users searching for specific devices off the top of their heads. There is an immediate step before this: the hours of research reading through lists of product recommendations.

If you have been reading HouseFresh for a while, your first encounter with us was likely a list like this one or this one recommending the best devices for a specific issue you were trying to solve. That is how most of our readers find us.

Unfortunately, we’re getting less and less traffic from those pages, and it’s endangering the future of our site.

That’s why we’re writing this article."

https://housefresh.com/david-vs-digital-goliaths/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

remixtures, 3 months ago to ai Portuguese

#AI #GenerativeAI #Web #Search #SearchEngines #Chatbots: "The Browser Company’s new app lets you ask semantic questions to a chatbot, which then summarizes live internet results in a simulation of a conversation. Which is great, in theory, as long as you don’t have any concerns about whether what it’s saying is accurate, don’t care where that information is coming from or who wrote it, and don’t think through the long-term feasibility of a product like this even a little bit. Or, as Dash put it, “It’s the parasite that kills the host.”

The base logic of something like Arc’s AI search doesn’t even really make sense. As Engadget recently asked in their excellent teardown of Arc’s AI search pivot, “Who makes money when AI reads the internet for us?” But let’s take a step even further here. Why even bother making new websites if no one’s going to see them? At least with the Web3 hype cycle, there were vague platitudes about ownership and financial freedom for content creators. To even entertain the idea of building AI-powered search engines means, in some sense, that you are comfortable with eventually being the reason those creators no longer exist. It is an undeniably apocalyptic project, but not just for the web as we know it, but also your own product."

https://www.fastcompany.com/91033052/does-anyone-even-want-an-ai-search-engine?mc_cid=f22a3b4b18

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 7 months ago to random

In Antitrust Trial, Google Argues That Smart Employees Explain Its Success - While the Justice Department says Google has squeezed partners and made expensive deals t... - https://www.nytimes.com/2023/10/18/business/google-antitrust-pandu-nayak.html #antitrustlawsandcompetitionissues #computersandtheinternet #justicedepartment #searchengines #innovation #googleinc

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

avail, 2 months ago to random

What is everyone using for a search engine these days? I was bouncing between Duck Duck Go and Kagi - but Kagi is now in cahoots with Brave which I'm not a fan of so that's out. Duck Duck Go is ok, but curious what else is out there.

#SearchEngines / #recommendations / #WebSearch

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mxtthxw

kzimmermann, 8 months ago to DuckDuckGo

15 years ago, #DuckDuckGo was launched

https://spreadprivacy.com/15-years-of-duckduckgo/

For me, personally, it was with DDG that I learned how #Google was privacy-invasive, and my starting point to learning more about #privacy.

As controversial as they are with conflicts of interest, I think they opened quite the door to what alternative #searchengines could be. Search and then metasearch came up and gave us a lot more privacy-friendly alternatives.

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...