Fb8a31f7 0adb 40e7 Ac26 Bffb0a0f4547

What this pattern does:

Deploy torchserve inference server with prepared T5 model and Client Application. Manifests were tested against GKE Autopilot Kubernetes cluster.

Caveats and Consideration:

To configure HPA base on metrics from torchserve you need to: Enable Google Manager Prometheus or install OSS Prometheus. Install Custom Metrics Adapter. Apply pod-monitoring.yaml and hpa.yaml

Compatibility:

Recent Discussions with "meshery" Tag

May 01 | WEBINAR: Making the CNCF Landscape interactive with Meshery Sandra Ashipala
Apr 24 | Meshery Development Meeting | April 24th 2024 Yash Sharma
Mar 11 | [Help Wanted] A list of open DevOps-centric needs on Meshery projects Lee Calcote
Apr 14 | Unable to deploy meshery to minikube Shahid Ilhan
Apr 16 | Help needed for setup of meshery cli Pratiksha Sankhe
Apr 17 | Meshery Development Meeting | April 17th 2024 Yash Sharma
Apr 12 | What exactly is this sistent design system project Himanshu Gupta
Nov 11 | Unable setup local Meshery development server Balachandregowda P
Apr 10 | How a beginner can start exploring project of meshery? Himanshu Gupta
Apr 10 | Meshery Development Meeting | April 10th 2024 Yash Sharma

Serving T5 Large Language Model with TorchServe

Catalog Details

Pattern Snapshot

Related Patterns

Fault-tolerant batch workloads on GKE

MESHERY4b55

What this pattern does:

Caveats and Consideration:

Compatibility:

Recent Discussions with "meshery" Tag