cross-posted from: https://lemmy.world/post/27013201

Ollama lets you download and run large language models (LLMs) on your device.

Install Ollama on Arch Linux (Windows guide coming soon)

  1. Check whether your device has an AMD GPU, NVIDIA GPU, or no GPU. A GPU is recommended but not required.
  2. Open Console, type only one of the following commands and press return. This may ask for your password but not show you typing it.
sudo pacman -S ollama-rocm    # for AMD GPU
sudo pacman -S ollama-cuda    # for NVIDIA GPU
sudo pacman -S ollama         # for no GPU (for CPU)
  1. Enable the Ollama service [on-device and runs in the background] to start with your device and start it now.
sudo systemctl enable --now ollama

Test Ollama alone (Open WebUI guide coming soon)

  1. Open localhost:11434 in a web browser and you should see Ollama is running. This shows Ollama is installed and its service is running.
  2. Run ollama run deepseek-r1 in a console and ollama ps in another, to download and run the DeepSeek R1 model while seeing whether Ollama is using your slow CPU or fast GPU.

AMD GPU issue Fix

Coming soon

  • mierdabird@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    2
    ·
    48 minutes ago

    Additionally here’s an excellent tutorial to do it via Docker Compose in Cosmos Cloud:

    https://discord.com/channels/1083875833824944188/1283413252356706416/1283785229923909694

    1. Locate the Docker Compose File:

      • Find the Docker Compose file for the desired service. For example, you can use the Open Chat UI and Ollama setup from the following GitHub repository: Open Chat UI Docker Compose File
    2. Docker Compose Configuration:

    3. Import the Docker Compose File:

      • Go to the ServApps section in Cosmos.
      • Click on Import Compose File.
      • Click Next and proceed to add or modify the configuration. Replace any ${} placeholders with the desired values, such as latest or any specific version you prefer. You can also make changes later if needed.
    4. Create the Docker Containers:

      • Click Create. The Docker containers will be automatically created based on the provided configuration.
    5. Access the Service:

      • After the containers are created, you can access the service by adding a new URL.
      • Cosmos typically auto-detects the correct port, so you usually only need to create the URL.
      • You can now access the service as usual.
    6. Customize with Labels:

      • Optionally, you can customize the service further using labels such as:
        • cosmos-icon
        • cosmos-stack
        • cosmos-stack-main
    7. Modify Environment Variables:

      • Refer to the documentation for the Docker service and its environment variables. You can modify these variables directly from the Docker or Compose tab in Cosmos.

    That’s it!

    • Autonomous User@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      2 minutes ago

      Cosmos Cloud fails to include a libre software license text file. We do not control it, anti-libre software. This defeats the purpose of running Ollama, libre software, on our own device.

      Also, although Docker on Arch Linux is fine, Docker Desktop used to install Docker on Windows is also anti-libre software. The upcoming guide will provide a workaround. Of course, Windows is anti-libre software, so this is for harm reduction at best.

      However, thank you for freeing this information from Discord, also anti-libre software.