Self-Hosted AI is pretty darn cool

chagall@lemmy.world · 11 months ago

Self-Hosted AI is pretty darn cool

EonNShadow@pawb.social · 11 months ago

“learned some things like Linux, command line, docker, and networking/pfsense” “I don’t consider myself technical”

Don’t sell yourself short, I work in IT and have colleagues on our helpdesk who would struggle endlessly with those concepts.

I hereby dub you a tech person, like it or not, those skills can and do pay the bills.

chagall@lemmy.world · 11 months ago

This made me smile. Thank you. The grass is always greener and I sometimes daydream of working in IT instead of healthcare. Maybe someday.

Biezelbob@programming.dev · 11 months ago

Nah dont.

GissaMittJobb@lemmy.ml · 11 months ago

Healthcare is pretty rough, I’d be willing to bet that the grass actually is greener in this case.

Biezelbob@programming.dev · edit-2 11 months ago

I am actually considering switching to healthcare (been a professional programmer)

I’ve had a burnout: I wish it was due caring for people in need instead of a stupid deadline.

Besides, you can always do IT as a hobby/for free. Harder with healthcare, except maybe volunteering

barsquid@lemmy.world · 11 months ago

You’ll be saving lives, yeah, but between dealing with entitled assholes that won’t follow directions and then yell at you because they didn’t.

It’s maybe easy to burn out in any career. Society has deprioritized individual fulfillment for most of us because it harms the nesting levels of billionaires’ yachts.

GBU_28@lemm.ee · 11 months ago

It is done.

damnthefilibuster@lemmy.world · 11 months ago

Now that you’ve dubbed OP a tech person…

Hey OP, can you help me fix my printer? It’s only printing “RED RUM RED RUM” for some reason.

sugar_in_your_tea@sh.itjust.works · 11 months ago

Have you tried giving it red rum?

Oh, and make sure you hold it out with the insides of your arms exposed, it’ll feel less threatening that way.

webghost0101@sopuli.xyz · 11 months ago

Thank you for this. I consider myself technical and those words felt like a punch in the gut.

chagall@lemmy.world · 11 months ago

I’m sorry if I offended. I can’t code or understand existing code and have always felt that technical people code. I guess I should expand my definition. Again, sorry that my words felt like a punch in the gut… wasn’t my intention at all.

IsoKiero@sopuli.xyz · 11 months ago

It depends heavily on what you do and what you’re comparing yourself against. I’ve been making a living with IT for nearly 20 years and I still don’t consider myself to be an expert on anything, but it’s a really wide field and what I’ve learned that the things I consider ‘easy’ or ‘simple’ (mostly with linux servers) are surprisingly difficult for people who’d (for example) wipe the floor with me if we competed on planning and setting up an server infrastructure or build enterprise networks.

And of course I’ve also met the other end of spectrum. People who claim to be ‘experts’ or ‘senior techs’ at something are so incompetent on their tasks or their field of knowledge is so ridiculously narrow that I wouldn’t trust them with anything above first tier helpdesk if even that. And the sad part is that those ‘experts’ often make way more money than me because they happened to score a job on some big IT company and their hours are billed accordingly.

And then there’s the whole other can of worms on a forums like this where ‘technical people’ range from someone who can install a operating system by following instructions to the guys who write assembly code to some obscure old hardware just for the fun of it.

dan@upvote.au · 11 months ago

It’s a much smaller scale but I use a Coral TPU with CodeProject AI to detect when people or animals are in front of my house. Works well with Blue Iris (NVR software for security cameras). I like it. That’s all the self-hosted AI I’ve got for now.

CallMeButtLove@lemmy.world · 11 months ago

Is there a way to host an LLM in a docker container on my home server but still leverage the GPU on my main PC?

Kairos@lemmy.today · 11 months ago

toynbee@lemmy.world · 11 months ago

With all respect, the first paragraph seems self contradictory.

Appoxo@lemmy.dbzer0.com · 11 months ago

Very technical vs not can be very subjective.
It can be a 50 year old sysadmin vs Adam I pulled from the street or a graybeard linux admin vs a beginner sysadmin only in it for thr career instead of the passion (those can be very non-technical but good problem solver folks)

I know my comparison is flawed

superglue@lemmy.dbzer0.com · 11 months ago

What kinds of specs do you need to run it well? I’ve got a laptop with a 3070.

coffee_with_cream@sh.itjust.works · edit-2 11 months ago

You probably want 48gb of vram or more to run the good stuff. I recommend renting GPU time instead of using your own hardware, via AWS or other vendors - runpod.io is pretty good.

NotMyOldRedditName@lemmy.world · 11 months ago

Kinda defeats the purpose of doing it private and local.

I wouldn’t trust any claims a 3rd party service makes with regards to being private.

Goodtoknow@lemmy.ca · 11 months ago

Have you found much practical use for small models yet? I love the idea that even the 1.1B tinyllama model can run on my phone, but haven’t found much real world use for it yet. Llama3 8b feels better, but not much better for even emails as it’s a bit dumb

chagall@lemmy.world · 11 months ago

I use my phone all the time, but I just use a wireguard VPN to tunnel into my home container of Open WebUI. Then I can interact with my desktop machine using a NVIDIA gpu. I’m currently testing mistral-nemo. It’s pretty great but it gets a bit verbose sometimes.

kureta@lemmy.ml · 11 months ago

I am also using open webui. Most LLMs are too verbose for me, so I created a model in open-webui with system prompt “Do not repeat the questions. Avoid giving lists as answers. Do not summarize the answer at the end. If asked a follow-up question, respond with only new information, do not repeat previously stated information.” and named it No Nonsense.

kate@lemmy.uhhoh.com · 11 months ago

for some reason chatgpt responds well to “no yapping”

Decronym@lemmy.decronym.xyz · edit-2 11 months ago

Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I’ve seen in this thread:

Fewer Letters	More Letters
NVR	Network Video Recorder (generally for CCTV)
PSU	Power Supply Unit
VPN	Virtual Private Network

3 acronyms in this thread; the most compressed thread commented on today has 12 acronyms.

[Thread #917 for this sub, first seen 12th Aug 2024, 07:15] [FAQ] [Full list] [Contact] [Source code]

Dataprolet@lemmy.dbzer0.com · 11 months ago

Isn’t this using a lot of computing power?

Swedneck@discuss.tchncs.de · edit-2 11 months ago

you hear that said about AI because companies are desperately throwing more and more resources at it to get 0.3% better results, and people are collectively running an insane amount of prompts all the time.

but on a personal level it’s not really any different from any other computations, people render videos all the time and no one complains about the resource usage from that, because companies aren’t trying to sell bloated video rendering services to gardening businesses.

MangoPenguin@lemmy.blahaj.zone · 11 months ago

Not really, it uses some GPU power when it’s actively generating a response, but otherwise it just sits idle.