MLCompilerBridge
Tools for streamlining communication with ML models for compiler optimizations.
|
gRPCModelRunner class supporting communication via gRPC. More...
#include "MLModelRunner/MLModelRunner.h"
#include <future>
#include <google/protobuf/text_format.h>
#include <grpcpp/grpcpp.h>
#include <grpcpp/health_check_service_interface.h>
#include <memory>
#include <thread>
Go to the source code of this file.
Classes | |
class | MLBridge::gRPCModelRunner< Client, Stub, Request, Response > |
This class is used to create the grpc model runner object. More... | |
Namespaces | |
namespace | MLBridge |
gRPCModelRunner class supporting communication via gRPC.
This class is used to communicate with the gRPC server and send/receive data to/from the model. Supports interleaved communication with the model.
There are two ways to use this class:
In Inference mode, the compiler pass using this class acts as the client and sends the request to the server implemented by using GrpcCompilerInterface.py.
Interfacing with the model using protobuf (.proto) files: Users should define the service, stub, request and response by writing a .proto file. The service and stub objects are generated using the protoc compiler. The request and response objects are generated using the protoc compiler or created by the user. The build process will automatically compile .proto files and generates the relevant stubs and request/response objects in both CPP and Python which will be used by gRPCModelRunner and GrpcCompilerInterface respectively..
In Inference mode, users should override getAdvice()
RPC method in the Python model code to process the request and send the response back to the client. This method is called by the gRPC server in the evaluate_untyped() method of gRPCModelRunner class to get the result from the model after populating the features in the request object.
In Training mode, users should override the RPC function/service that they declare in the .proto file in the Compiler pass which is using the gRPCModelRunner. This function is called by the gRPC client (Python model) that is using the GrpcCompilerInterface class to send the request to the server.
Usage:
Definition in file gRPCModelRunner.h.