Zandebar

Forum Replies Created

Viewing 5 posts - 16 through 20 (of 20 total)
  • Author
    Posts
  • in reply to: Local Install vs GPU / Render Farms (online GPU) #15846
    Zandebar
    Participant

      On the GPU hardware side, I’m having trouble working out what you can do in each VRAM stack of each card.

      What are the limitation of each, what can’t you do with 12gb, 16gb, 24gb and 48gb (currently).

      How much headroom do you need to give other resources in VRAM other than the model, I’m having a guess at 2gb, I don’t know if that’s right but it looks fair.

      So that would potentially mean (am guessing).

      • 12gb = 10gb Max model size
      • 16gb =14gb Max model size
      • 24gb = 22gb Max model size
      • 48gb = 46gb Max model size

       

      How large do these models get, then there’s the workflow to consider how does that affect the VRAM?

       

      If you have a look at Flux.1 the information I found @

      https://medium.com/@researchgraph/the-ultimate-flux-1-hands-on-guide-067fc053fedd

      States:

      The regular version requires at least 32GB of system RAM. Testing shows that a 4090 GPU can fully occupy its memory. The dev-fp8 version is recommended for local use.

      * I assume when talking about system ram it means VRAM

       

      So what’s the difference of dev-fp8  to the regular model (this was covered in the courses)

       

      Would the GeForce RTX 4090 function using this model (GB size from Huggingface), we know that the new GeForce RTX 509o with 32gb (reportedly) will be able to handle this.

      flux1-dev.safetensors = 23.8GB

      flux1-schnell.safetensors = 23.8GB – This is the same size as pro

       

      Also Stable Diffusion 3.5 model

      This model fits inside the VRAM of the GeForce RTX 4090 but not the GeForce RTX 4080 Super Ti @ 16GB

      stable-diffusion-3.5-large = 16.5 GB

      These are the new models coming out and if I’m correct, these are starting to be prohibited for hobbyist / creative enthusiasts who can’t afford high VRAM Flagship GPU’s.  By running these models locally, what I’m trying to get at,  what can you do with the best card you can afford.

       

       

       

       

       

      in reply to: 24gb VRAM vs Architecture series #15816
      Zandebar
      Participant

        Oops: I’ve made a mistake and I can’t edit

        4080 Difference: +9.3% with the 5080

        in reply to: 24gb VRAM vs Architecture series #15815
        Zandebar
        Participant

          Hello

          Great and Thank You! You kind of confirmed what I was thinking.

          Right: I have a bottle neck, I’m based in the UK I only have a £1000 GBP to spend on a GPU, I’m a hobbyist and will not be making any money from this to justify the expense and the outlay. However I’m also not sure where I’ll be going with this so I’m looking for a hybrid solution to GPU needs.

          Let’s get this straight; in 12 months time when you get the RTX 5090 with 32GB VRAM, you’ll be saying Wow at the speed and recommending 32gb VRAM and not 24gb, when asked the very same question.

          Granted if your a pro then you’ll need the FLAGSHIP option. When your not a pro (like me) justifying the expense becomes hard when your on a tight budget and you have household bills to pay, I can only dream of owning the latest and greatest GPU. There is a compromise a cheaper option or rent a GPU from a render farm, I’m actually looking at both at the moment.

          NVidia GeForce RTX 4080 Super Ti 16gb VRAM  (I can afford right now), I’ll be able to learn SD and do a fair bit with 16GB VRAM. When I hit that wall and need extra VRAM I’ll out source the GPU to a render farm, with the render farm option I’ll just pay for what I use. This isn’t a good place to be really with the new RTX5000 series coming out, where you were only 2 thirds of the max VRAM the the 5000 series comes out your half the max size. Where the model size will only get bigger, I was bouncing around and saw a model size (flux) 14GB. Ouch! not much room for everything else that gets stored in VRAM. Chance are this size model would work in 16GB VRAM, and its only going to get bigger. We know that because of the increase of VRAM in the 5000 series, if you make more space people will fill more space. You can’t win being a hobbyist.

          I was also thinking, wait long enough the 3090 may fit in my budget:

          Do you get this craziness where you are?

          EVGA GeForce RTX 3090 Ti FTW3 ULTRA GAMING, 24G-P5-4985-KR, 24GB GDDR6X, iCX3, ARGB LED, Backplate, Free eLeash

          £2,094.22

          MSI GeForce RTX 4090 VENTUS 3X E 24G OC Gaming Graphics Card – 24GB GDDR6X, 2550 MHz, PCI Express Gen 4, 384-bit, 2x DP v 1.4a, HDMI 2.1a (Supports 4K & 8K HDR)

          £1,749.99

          GIGABYTE GeForce RTX 4090 GAMING OC 24GB Graphics Card – 24GB GDDR6X, PCI-E 4.0, Core 2535Mhz, RGB fusion, Anti-sag bracket, Metal back plate, DP 1.4, HDMI 2.1a, NVIDIA DLSS 3, GV-N4090GAMING OC-24GD

          £1,899.00

           

          Where the 4090 is cheaper than the 3090 CRAZY! OK the 3090 is not a toaster like the 4090 with the power socket issue. But still you would of thought there’ll be some rest bite for us hobbyist with an older series of card, Nah!! So where stuck at the next generation down, the 4080 ti super.

          And wait for it, Nvidia are not doing themselves any favours with the next generation of cards now that they have no competition. Look at this…

          RTX 5080
          TDP: 350W
          GPU Name: GB203
          GPCs: 7
          TPCs: 42
          SMs: 84
          Cores: 10752

          Tensor Cores: (likely) 384 (half the number of RTX 5090)
          Memory Configuration: 256-bit GDDR7 (16GB VRAM)

          Boost clock speed around 2.8 GHz

          RTX 4080
          Architecture: Ada Lovelace
          Process node: 4nm TSMC
          CUDA cores: 9,728
          Ray tracing cores: 76
          Tensor cores: 304
          Base clock speed: 2,205 MHz
          Maximum clock speed: 2,505 MHz
          Memory size: 16GB GDDR6X
          Memory speed: 21 Gbps
          Bus width: 256-bit
          Bandwidth: 912 GBps
          TBP: 320W

          4080 Difference: +9.3% with the 5090

          NVidia have got there head some where I can’t say here, but logically with the uplift in performance of the 5090, you would have thought a shift in the other models.

          GeForce RTX 5000 to resemble something like this in vram: 12gb (5060), 16gb (5070), 24gb (5080) and 32gb (5090)

          And the CUDA core count is not much higher, would have thought they’ll match the 4090 cores with the 5080. Core count @10752 you would have thought they’ll match at @16384 CUDA Cores. Given that’s its rumoured that the 5090 is having 21,760 CUDA cores. And the Tensor cores have dropped, maybe a good reason there but out of my scope.

          Logically that makes more sense, it just leaves us users of the products’ frustrated, plus if the 5080 with imaginary 24gb VRAM and 16384 CUDA Cores. This would almost match the 4090 and cause a price drop of remining units of 4090. Everyone wins, but NO…

          That’s why am waiting to see what the market does and see if these rumoured specs are true, and make a decision then. Either way the consumer is going to be at a dis-advantage give Nvidia previous history.

          In the meantime: Checking out GPU farms and what they can offer is looking like a good idea and could in principle be more beneficial. That’s out of scope for this thread, I’ll make one on GPU farms…

           

          Kind Regards

          Zandebar

           

           

           

           

           

           

           

           

           

          in reply to: Getting Colab Automatic1111 Working for the first time #15801
          Zandebar
          Participant

            Yes I can help you sort this

            in reply to: Getting Colab Automatic1111 Working for the first time #15800
            Zandebar
            Participant

              Yes I can help you sort this

            Viewing 5 posts - 16 through 20 (of 20 total)