<html>
<head>
<base href="https://bugzilla.suse.com/" />
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - [doc] Issue in "BASIC OVERRIDE FILE WITH GPU AND TWO MODELS PULLED AT STARTUP""
href="https://bugzilla.suse.com/show_bug.cgi?id=1249576">1249576</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>[doc] Issue in "BASIC OVERRIDE FILE WITH GPU AND TWO MODELS PULLED AT STARTUP"
</td>
</tr>
<tr>
<th>Classification</th>
<td>SUSE AI
</td>
</tr>
<tr>
<th>Product</th>
<td>SUSE AI Application Containers
</td>
</tr>
<tr>
<th>Version</th>
<td>unspecified
</td>
</tr>
<tr>
<th>Hardware</th>
<td>Other
</td>
</tr>
<tr>
<th>OS</th>
<td>Other
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>Normal
</td>
</tr>
<tr>
<th>Priority</th>
<td>P5 - None
</td>
</tr>
<tr>
<th>Component</th>
<td>documentation
</td>
</tr>
<tr>
<th>Assignee</th>
<td>tbazant@suse.com
</td>
</tr>
<tr>
<th>Reporter</th>
<td>tapas.nandi@suse.com
</td>
</tr>
<tr>
<th>QA Contact</th>
<td>ai-maintainers@lists.suse.com
</td>
</tr>
<tr>
<th>Target Milestone</th>
<td>---
</td>
</tr>
<tr>
<th>Found By</th>
<td>---
</td>
</tr>
<tr>
<th>Blocker</th>
<td>---
</td>
</tr></table>
<p>
<div>
<pre>BASIC OVERRIDE FILE WITH GPU AND TWO MODELS PULLED AT STARTUP:
<a href="https://documentation.suse.com/suse-ai/1.0/html/AI-deployment-intro/index.html#id-1.5.5.7.3">https://documentation.suse.com/suse-ai/1.0/html/AI-deployment-intro/index.html#id-1.5.5.7.3</a>
The provided override file does not utilize the nvidia gou when ollma is
installed.
When tried to run nvidia-smi command inside ollama container is says command
not found as well as when running models it runs without any gpu.
the below modified overdie file works:
global:
imagePullSecrets:
- application-collection
ingress:
enabled: false
defaultModel: "gemma:2b"
runtimeClassName: nvidia
ollama:
models:
pull:
- "gemma:2b"
- "llama3.1"
run:
- "gemma:2b"
- "llama3.1"
gpu:
enabled: true
type: 'nvidia'
number: 1
nvidiaResource: "nvidia.com/gpu"
persistentVolume:
enabled: true
storageClass: longhorn
Request:
replace the override file in the documentation with the above given override
file.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the QA Contact for the bug.</li>
</ul>
</body>
</html>