Control Plane API
Packages:
serving.kserve.io/v1beta1
Package v1beta1 contains API Schema definitions for the serving v1beta1 API group
Resource Types:
AIXExplainerSpec
(Appears on:ExplainerSpec)
AIXExplainerSpec defines the arguments for configuring an AIX Explanation Server
| Field | Description |
|---|---|
typeAIXExplainerType |
The type of AIX explainer |
ExplainerExtensionSpecExplainerExtensionSpec |
(Members of Contains fields shared across all explainers |
AIXExplainerType
(string alias)
(Appears on:AIXExplainerSpec)
| Value | Description |
|---|---|
|
"LimeImages" |
ARTExplainerSpec
(Appears on:ExplainerSpec)
ARTExplainerType defines the arguments for configuring an ART Explanation Server
| Field | Description |
|---|---|
typeARTExplainerType |
The type of ART explainer |
ExplainerExtensionSpecExplainerExtensionSpec |
(Members of Contains fields shared across all explainers |
ARTExplainerType
(string alias)
(Appears on:ARTExplainerSpec)
| Value | Description |
|---|---|
|
"SquareAttack" |
AlibiExplainerSpec
(Appears on:ExplainerSpec)
AlibiExplainerSpec defines the arguments for configuring an Alibi Explanation Server
| Field | Description |
|---|---|
typeAlibiExplainerType |
The type of Alibi explainer |
ExplainerExtensionSpecExplainerExtensionSpec |
(Members of Contains fields shared across all explainers |
AlibiExplainerType
(string alias)
(Appears on:AlibiExplainerSpec)
AlibiExplainerType is the explanation method
| Value | Description |
|---|---|
|
"AnchorImages" |
|
|
"AnchorTabular" |
|
|
"AnchorText" |
|
|
"Contrastive" |
|
|
"Counterfactuals" |
Batcher
(Appears on:ComponentExtensionSpec)
Batcher specifies optional payload batching available for all components
| Field | Description |
|---|---|
maxBatchSizeint |
(Optional)
Specifies the max number of requests to trigger a batch |
maxLatencyint |
(Optional)
Specifies the max latency to trigger a batch |
timeoutint |
(Optional)
Specifies the timeout of a batch |
Component
Component interface is implemented by all specs that contain component implementations, e.g. PredictorSpec, ExplainerSpec, TransformerSpec.
ComponentExtensionSpec
(Appears on:ExplainerSpec, PredictorSpec, TransformerSpec)
ComponentExtensionSpec defines the deployment configuration for a given InferenceService component
| Field | Description |
|---|---|
minReplicasint |
(Optional)
Minimum number of replicas, defaults to 1 but can be set to 0 to enable scale-to-zero. |
maxReplicasint |
(Optional)
Maximum number of replicas for autoscaling. |
containerConcurrencyint64 |
(Optional)
ContainerConcurrency specifies how many requests can be processed concurrently, this sets the hard limit of the container concurrency(https://knative.dev/docs/serving/autoscaling/concurrency). |
timeoutint64 |
(Optional)
TimeoutSeconds specifies the number of seconds to wait before timing out a request to the component. |
canaryTrafficPercentint64 |
(Optional)
CanaryTrafficPercent defines the traffic split percentage between the candidate revision and the last ready revision |
loggerLoggerSpec |
(Optional)
Activate request/response logging and logger configurations |
batcherBatcher |
(Optional)
Activate request batching and batching configurations |
ComponentImplementation
ComponentImplementation interface is implemented by predictor, transformer, and explainer implementations
ComponentStatusSpec
(Appears on:InferenceServiceStatus)
ComponentStatusSpec describes the state of the component
| Field | Description |
|---|---|
latestReadyRevisionstring |
(Optional)
Latest revision name that is in ready state |
latestCreatedRevisionstring |
(Optional)
Latest revision name that is created |
previousRolledoutRevisionstring |
(Optional)
Previous revision name that is rolled out with 100 percent traffic |
latestRolledoutRevisionstring |
(Optional)
Latest revision name that is rolled out with 100 percent traffic |
traffic[]knative.dev/serving/pkg/apis/serving/v1.TrafficTarget |
(Optional)
Traffic holds the configured traffic distribution for latest ready revision and previous rolled out revision. |
urlknative.dev/pkg/apis.URL |
(Optional)
URL holds the url that will distribute traffic over the provided traffic targets. It generally has the form http[s]://{route-name}.{route-namespace}.{cluster-level-suffix} |
addressknative.dev/pkg/apis/duck/v1.Addressable |
(Optional)
Addressable endpoint for the InferenceService |
ComponentType
(string alias)
ComponentType contains the different types of components of the service
| Value | Description |
|---|---|
|
"explainer" |
|
|
"predictor" |
|
|
"transformer" |
CustomExplainer
CustomExplainer defines arguments for configuring a custom explainer.
| Field | Description |
|---|---|
PodSpecKubernetes core/v1.PodSpec |
(Members of |
CustomPredictor
CustomPredictor defines arguments for configuring a custom server.
| Field | Description |
|---|---|
PodSpecKubernetes core/v1.PodSpec |
(Members of |
CustomTransformer
CustomTransformer defines arguments for configuring a custom transformer.
| Field | Description |
|---|---|
PodSpecKubernetes core/v1.PodSpec |
(Members of |
DeployConfig
| Field | Description |
|---|---|
defaultDeploymentModestring |
ExplainerConfig
(Appears on:ExplainersConfig)
| Field | Description |
|---|---|
imagestring |
explainer docker image name |
defaultImageVersionstring |
default explainer docker image version |
ExplainerExtensionSpec
(Appears on:AIXExplainerSpec, ARTExplainerSpec, AlibiExplainerSpec)
ExplainerExtensionSpec defines configuration shared across all explainer frameworks
| Field | Description |
|---|---|
storageUristring |
The location of a trained explanation model |
runtimeVersionstring |
Defaults to latest Explainer Version |
configmap[string]string |
Inline custom parameter settings for explainer |
ContainerKubernetes core/v1.Container |
(Members of Container enables overrides for the predictor. Each framework will have different defaults that are populated in the underlying container spec. |
ExplainerSpec
(Appears on:InferenceServiceSpec)
ExplainerSpec defines the container spec for a model explanation server, The following fields follow a “1-of” semantic. Users must specify exactly one spec.
| Field | Description |
|---|---|
alibiAlibiExplainerSpec |
Spec for alibi explainer |
aixAIXExplainerSpec |
Spec for AIX explainer |
artARTExplainerSpec |
Spec for ART explainer |
PodSpecPodSpec |
(Members of This spec is dual purpose. 1) Users may choose to provide a full PodSpec for their custom explainer. The field PodSpec.Containers is mutually exclusive with other explainers (i.e. Alibi). 2) Users may choose to provide a Explainer (i.e. Alibi) and specify PodSpec overrides in the PodSpec. They must not provide PodSpec.Containers in this case. |
ComponentExtensionSpecComponentExtensionSpec |
(Members of Component extension defines the deployment configurations for explainer |
ExplainersConfig
(Appears on:InferenceServicesConfig)
| Field | Description |
|---|---|
alibiExplainerConfig |
|
aixExplainerConfig |
|
artExplainerConfig |
InferenceService
InferenceService is the Schema for the InferenceServices API
| Field | Description | ||||||
|---|---|---|---|---|---|---|---|
metadataKubernetes meta/v1.ObjectMeta |
Refer to the Kubernetes API documentation for the fields of the
metadata field.
|
||||||
specInferenceServiceSpec |
|
||||||
statusInferenceServiceStatus |
InferenceServiceSpec
(Appears on:InferenceService)
InferenceServiceSpec is the top level type for this resource
| Field | Description |
|---|---|
predictorPredictorSpec |
Predictor defines the model serving spec |
explainerExplainerSpec |
(Optional)
Explainer defines the model explanation service spec, explainer service calls to predictor or transformer if it is specified. |
transformerTransformerSpec |
(Optional)
Transformer defines the pre/post processing before and after the predictor call, transformer service calls to predictor service. |
InferenceServiceStatus
(Appears on:InferenceService)
InferenceServiceStatus defines the observed state of InferenceService
| Field | Description |
|---|---|
Statusknative.dev/pkg/apis/duck/v1.Status |
(Members of Conditions for the InferenceService |
addressknative.dev/pkg/apis/duck/v1.Addressable |
(Optional)
Addressable endpoint for the InferenceService |
urlknative.dev/pkg/apis.URL |
(Optional)
URL holds the url that will distribute traffic over the provided traffic targets. It generally has the form http[s]://{route-name}.{route-namespace}.{cluster-level-suffix} |
componentsmap[kserve.io/v1beta1/pkg/apis/serving/v1beta1.ComponentType]kserve.io/v1beta1/pkg/apis/serving/v1beta1.ComponentStatusSpec |
Statuses for the components of the InferenceService |
InferenceServicesConfig
| Field | Description |
|---|---|
transformersTransformersConfig |
Transformer configurations |
predictorsPredictorsConfig |
Predictor configurations |
explainersExplainersConfig |
Explainer configurations |
IngressConfig
| Field | Description |
|---|---|
ingressGatewaystring |
|
ingressServicestring |
|
localGatewaystring |
|
localGatewayServicestring |
|
ingressDomainstring |
LightGBMSpec
(Appears on:PredictorSpec)
LightGBMSpec defines arguments for configuring LightGBMSpec model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
LoggerSpec
(Appears on:ComponentExtensionSpec)
LoggerSpec specifies optional payload logging available for all components
| Field | Description |
|---|---|
urlstring |
(Optional)
URL to send logging events |
modeLoggerType |
(Optional)
Specifies the scope of the loggers. |
LoggerType
(string alias)
(Appears on:LoggerSpec)
LoggerType controls the scope of log publishing
| Value | Description |
|---|---|
|
"all" |
Logger mode to log both request and response |
|
"request" |
Logger mode to log only request |
|
"response" |
Logger mode to log only response |
ModelFormat
(Appears on:ModelSpec)
| Field | Description |
|---|---|
namestring |
Name of the model format. |
versionstring |
(Optional)
Version of the model format. Used in validating that a predictor is supported by a runtime. Can be “major”, “major.minor” or “major.minor.patch”. |
ModelSpec
(Appears on:PredictorSpec)
| Field | Description |
|---|---|
modelFormatModelFormat |
ModelFormat being served. |
runtimestring |
(Optional)
Specific ClusterServingRuntime/ServingRuntime name to use for deployment. |
PredictorExtensionSpecPredictorExtensionSpec |
(Members of |
ONNXRuntimeSpec
(Appears on:PredictorSpec)
ONNXRuntimeSpec defines arguments for configuring ONNX model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
PMMLSpec
(Appears on:PredictorSpec)
PMMLSpec defines arguments for configuring PMML model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
PaddleServerSpec
(Appears on:PredictorSpec)
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of |
PodSpec
(Appears on:ExplainerSpec, PredictorSpec, TransformerSpec)
PodSpec is a description of a pod.
| Field | Description |
|---|---|
volumes[]Kubernetes core/v1.Volume |
(Optional)
List of volumes that can be mounted by containers belonging to the pod. More info: https://kubernetes.io/docs/concepts/storage/volumes |
initContainers[]Kubernetes core/v1.Container |
List of initialization containers belonging to the pod. Init containers are executed in order prior to containers being started. If any init container fails, the pod is considered to have failed and is handled according to its restartPolicy. The name for an init container or normal container must be unique among all containers. Init containers may not have Lifecycle actions, Readiness probes, Liveness probes, or Startup probes. The resourceRequirements of an init container are taken into account during scheduling by finding the highest request/limit for each resource type, and then using the max of of that value or the sum of the normal containers. Limits are applied to init containers in a similar fashion. Init containers cannot currently be added or removed. Cannot be updated. More info: https://kubernetes.io/docs/concepts/workloads/pods/init-containers/ |
containers[]Kubernetes core/v1.Container |
List of containers belonging to the pod. Containers cannot currently be added or removed. There must be at least one container in a Pod. Cannot be updated. |
ephemeralContainers[]Kubernetes core/v1.EphemeralContainer |
(Optional)
List of ephemeral containers run in this pod. Ephemeral containers may be run in an existing pod to perform user-initiated actions such as debugging. This list cannot be specified when creating a pod, and it cannot be modified by updating the pod spec. In order to add an ephemeral container to an existing pod, use the pod’s ephemeralcontainers subresource. This field is alpha-level and is only honored by servers that enable the EphemeralContainers feature. |
restartPolicyKubernetes core/v1.RestartPolicy |
(Optional)
Restart policy for all containers within the pod. One of Always, OnFailure, Never. Default to Always. More info: https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#restart-policy |
terminationGracePeriodSecondsint64 |
(Optional)
Optional duration in seconds the pod needs to terminate gracefully. May be decreased in delete request. Value must be non-negative integer. The value zero indicates delete immediately. If this value is nil, the default grace period will be used instead. The grace period is the duration in seconds after the processes running in the pod are sent a termination signal and the time when the processes are forcibly halted with a kill signal. Set this value longer than the expected cleanup time for your process. Defaults to 30 seconds. |
activeDeadlineSecondsint64 |
(Optional)
Optional duration in seconds the pod may be active on the node relative to StartTime before the system will actively try to mark it failed and kill associated containers. Value must be a positive integer. |
dnsPolicyKubernetes core/v1.DNSPolicy |
(Optional)
Set DNS policy for the pod. Defaults to “ClusterFirst”. Valid values are ‘ClusterFirstWithHostNet’, ‘ClusterFirst’, ‘Default’ or ‘None’. DNS parameters given in DNSConfig will be merged with the policy selected with DNSPolicy. To have DNS options set along with hostNetwork, you have to specify DNS policy explicitly to ‘ClusterFirstWithHostNet’. |
nodeSelectormap[string]string |
(Optional)
NodeSelector is a selector which must be true for the pod to fit on a node. Selector which must match a node’s labels for the pod to be scheduled on that node. More info: https://kubernetes.io/docs/concepts/configuration/assign-pod-node/ |
serviceAccountNamestring |
(Optional)
ServiceAccountName is the name of the ServiceAccount to use to run this pod. More info: https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/ |
serviceAccountstring |
(Optional)
DeprecatedServiceAccount is a depreciated alias for ServiceAccountName. Deprecated: Use serviceAccountName instead. |
automountServiceAccountTokenbool |
(Optional)
AutomountServiceAccountToken indicates whether a service account token should be automatically mounted. |
nodeNamestring |
(Optional)
NodeName is a request to schedule this pod onto a specific node. If it is non-empty, the scheduler simply schedules this pod onto that node, assuming that it fits resource requirements. |
hostNetworkbool |
(Optional)
Host networking requested for this pod. Use the host’s network namespace. If this option is set, the ports that will be used must be specified. Default to false. |
hostPIDbool |
(Optional)
Use the host’s pid namespace. Optional: Default to false. |
hostIPCbool |
(Optional)
Use the host’s ipc namespace. Optional: Default to false. |
shareProcessNamespacebool |
(Optional)
Share a single process namespace between all of the containers in a pod. When this is set containers will be able to view and signal processes from other containers in the same pod, and the first process in each container will not be assigned PID 1. HostPID and ShareProcessNamespace cannot both be set. Optional: Default to false. |
securityContextKubernetes core/v1.PodSecurityContext |
(Optional)
SecurityContext holds pod-level security attributes and common container settings. Optional: Defaults to empty. See type description for default values of each field. |
imagePullSecrets[]Kubernetes core/v1.LocalObjectReference |
(Optional)
ImagePullSecrets is an optional list of references to secrets in the same namespace to use for pulling any of the images used by this PodSpec. If specified, these secrets will be passed to individual puller implementations for them to use. For example, in the case of docker, only DockerConfig type secrets are honored. More info: https://kubernetes.io/docs/concepts/containers/images#specifying-imagepullsecrets-on-a-pod |
hostnamestring |
(Optional)
Specifies the hostname of the Pod If not specified, the pod’s hostname will be set to a system-defined value. |
subdomainstring |
(Optional)
If specified, the fully qualified Pod hostname will be “ |
affinityKubernetes core/v1.Affinity |
(Optional)
If specified, the pod’s scheduling constraints |
schedulerNamestring |
(Optional)
If specified, the pod will be dispatched by specified scheduler. If not specified, the pod will be dispatched by default scheduler. |
tolerations[]Kubernetes core/v1.Toleration |
(Optional)
If specified, the pod’s tolerations. |
hostAliases[]Kubernetes core/v1.HostAlias |
(Optional)
HostAliases is an optional list of hosts and IPs that will be injected into the pod’s hosts file if specified. This is only valid for non-hostNetwork pods. |
priorityClassNamestring |
(Optional)
If specified, indicates the pod’s priority. “system-node-critical” and “system-cluster-critical” are two special keywords which indicate the highest priorities with the former being the highest priority. Any other name must be defined by creating a PriorityClass object with that name. If not specified, the pod priority will be default or zero if there is no default. |
priorityint32 |
(Optional)
The priority value. Various system components use this field to find the priority of the pod. When Priority Admission Controller is enabled, it prevents users from setting this field. The admission controller populates this field from PriorityClassName. The higher the value, the higher the priority. |
dnsConfigKubernetes core/v1.PodDNSConfig |
(Optional)
Specifies the DNS parameters of a pod. Parameters specified here will be merged to the generated DNS configuration based on DNSPolicy. |
readinessGates[]Kubernetes core/v1.PodReadinessGate |
(Optional)
If specified, all readiness gates will be evaluated for pod readiness. A pod is ready when all its containers are ready AND all conditions specified in the readiness gates have status equal to “True” More info: https://git.k8s.io/enhancements/keps/sig-network/0007-pod-ready%2B%2B.md |
runtimeClassNamestring |
(Optional)
RuntimeClassName refers to a RuntimeClass object in the node.k8s.io group, which should be used to run this pod. If no RuntimeClass resource matches the named class, the pod will not be run. If unset or empty, the “legacy” RuntimeClass will be used, which is an implicit class with an empty definition that uses the default runtime handler. More info: https://git.k8s.io/enhancements/keps/sig-node/runtime-class.md This is a beta feature as of Kubernetes v1.14. |
enableServiceLinksbool |
(Optional)
EnableServiceLinks indicates whether information about services should be injected into pod’s environment variables, matching the syntax of Docker links. Optional: Defaults to true. |
preemptionPolicyKubernetes core/v1.PreemptionPolicy |
(Optional)
PreemptionPolicy is the Policy for preempting pods with lower priority. One of Never, PreemptLowerPriority. Defaults to PreemptLowerPriority if unset. This field is beta-level, gated by the NonPreemptingPriority feature-gate. |
overheadKubernetes core/v1.ResourceList |
(Optional)
Overhead represents the resource overhead associated with running a pod for a given RuntimeClass. This field will be autopopulated at admission time by the RuntimeClass admission controller. If the RuntimeClass admission controller is enabled, overhead must not be set in Pod create requests. The RuntimeClass admission controller will reject Pod create requests which have the overhead already set. If RuntimeClass is configured and selected in the PodSpec, Overhead will be set to the value defined in the corresponding RuntimeClass, otherwise it will remain unset and treated as zero. More info: https://git.k8s.io/enhancements/keps/sig-node/20190226-pod-overhead.md This field is alpha-level as of Kubernetes v1.16, and is only honored by servers that enable the PodOverhead feature. |
topologySpreadConstraints[]Kubernetes core/v1.TopologySpreadConstraint |
(Optional)
TopologySpreadConstraints describes how a group of pods ought to spread across topology domains. Scheduler will schedule pods in a way which abides by the constraints. All topologySpreadConstraints are ANDed. |
setHostnameAsFQDNbool |
(Optional)
If true the pod’s hostname will be configured as the pod’s FQDN, rather than the leaf name (the default). In Linux containers, this means setting the FQDN in the hostname field of the kernel (the nodename field of struct utsname). In Windows containers, this means setting the registry value of hostname for the registry key HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\Tcpip\Parameters to FQDN. If a pod does not have FQDN, this has no effect. Default to false. |
PredictorConfig
(Appears on:PredictorProtocols, PredictorsConfig)
| Field | Description |
|---|---|
imagestring |
predictor docker image name |
defaultImageVersionstring |
default predictor docker image version on cpu |
defaultGpuImageVersionstring |
default predictor docker image version on gpu |
defaultTimeout,stringint64 |
Default timeout of predictor for serving a request, in seconds |
multiModelServer,booleanbool |
Flag to determine if multi-model serving is supported |
supportedFrameworks[]string |
frameworks the model agent is able to run |
PredictorExtensionSpec
(Appears on:LightGBMSpec, ModelSpec, ONNXRuntimeSpec, PMMLSpec, PaddleServerSpec, SKLearnSpec, TFServingSpec, TorchServeSpec, TritonSpec, XGBoostSpec)
PredictorExtensionSpec defines configuration shared across all predictor frameworks
| Field | Description |
|---|---|
storageUristring |
(Optional)
This field points to the location of the trained model which is mounted onto the pod. |
runtimeVersionstring |
(Optional)
Runtime version of the predictor docker image |
protocolVersiongithub.com/kserve/kserve/pkg/constants.InferenceServiceProtocol |
(Optional)
Protocol version to use by the predictor (i.e. v1 or v2) |
ContainerKubernetes core/v1.Container |
(Members of Container enables overrides for the predictor. Each framework will have different defaults that are populated in the underlying container spec. |
PredictorImplementation
PredictorImplementation defines common functions for all predictors e.g Tensorflow, Triton, etc
PredictorProtocols
(Appears on:PredictorsConfig)
| Field | Description |
|---|---|
v1PredictorConfig |
|
v2PredictorConfig |
PredictorSpec
(Appears on:InferenceServiceSpec)
PredictorSpec defines the configuration for a predictor, The following fields follow a “1-of” semantic. Users must specify exactly one spec.
| Field | Description |
|---|---|
sklearnSKLearnSpec |
Spec for SKLearn model server |
xgboostXGBoostSpec |
Spec for XGBoost model server |
tensorflowTFServingSpec |
Spec for TFServing (https://github.com/tensorflow/serving) |
pytorchTorchServeSpec |
Spec for TorchServe (https://pytorch.org/serve) |
tritonTritonSpec |
Spec for Triton Inference Server (https://github.com/triton-inference-server/server) |
onnxONNXRuntimeSpec |
Spec for ONNX runtime (https://github.com/microsoft/onnxruntime) |
pmmlPMMLSpec |
Spec for PMML (http://dmg.org/pmml/v4-1/GeneralStructure.html) |
lightgbmLightGBMSpec |
Spec for LightGBM model server |
paddlePaddleServerSpec |
Spec for Paddle model server (https://github.com/PaddlePaddle/Serving) |
modelModelSpec |
Model spec for any arbitrary framework. |
PodSpecPodSpec |
(Members of This spec is dual purpose. |
ComponentExtensionSpecComponentExtensionSpec |
(Members of Component extension defines the deployment configurations for a predictor |
PredictorsConfig
(Appears on:InferenceServicesConfig)
| Field | Description |
|---|---|
tensorflowPredictorConfig |
|
tritonPredictorConfig |
|
xgboostPredictorProtocols |
|
sklearnPredictorProtocols |
|
pytorchPredictorConfig |
|
onnxPredictorConfig |
|
pmmlPredictorConfig |
|
lightgbmPredictorConfig |
|
paddlePredictorConfig |
SKLearnSpec
(Appears on:PredictorSpec)
SKLearnSpec defines arguments for configuring SKLearn model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
TFServingSpec
(Appears on:PredictorSpec)
TFServingSpec defines arguments for configuring Tensorflow model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
TorchServeSpec
(Appears on:PredictorSpec)
TorchServeSpec defines arguments for configuring PyTorch model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
TransformerConfig
(Appears on:TransformersConfig)
| Field | Description |
|---|---|
imagestring |
transformer docker image name |
defaultImageVersionstring |
default transformer docker image version |
TransformerSpec
(Appears on:InferenceServiceSpec)
TransformerSpec defines transformer service for pre/post processing
| Field | Description |
|---|---|
PodSpecPodSpec |
(Members of This spec is dual purpose. |
ComponentExtensionSpecComponentExtensionSpec |
(Members of Component extension defines the deployment configurations for a transformer |
TransformersConfig
(Appears on:InferenceServicesConfig)
| Field | Description |
|---|---|
feastTransformerConfig |
TritonSpec
(Appears on:PredictorSpec)
TritonSpec defines arguments for configuring Triton model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
XGBoostSpec
(Appears on:PredictorSpec)
XGBoostSpec defines arguments for configuring XGBoost model serving.
| Field | Description |
|---|---|
PredictorExtensionSpecPredictorExtensionSpec |
(Members of Contains fields shared across all predictors |
Generated with gen-crd-api-reference-docs
on git commit ab2896d2.