Callbacks#
After performing post(), you may want to further process the obtained results.
For this purpose, Jina implements a promise-like interface, letting you specify three kinds of callback functions:
on_doneis executed while streaming, after successful completion of each requeston_erroris executed while streaming, whenever an error occurs in each requeston_alwaysis always performed while streaming, no matter the success or failure of each request
Note that these callbacks only work for requests (and failures) inside the stream, for example inside an Executor.
If the failure is due to an error happening outside of
streaming, then these callbacks will not be triggered.
For example, a SIGKILL from the client OS during the handling of the request, or a networking issue,
will not trigger the callback.
Callback functions in Jina expect a Response of the type DataRequest, which contains resulting Documents,
parameters, and other information.
Handle DataRequest in callbacks#
DataRequests are objects that are sent by Jina internally. Callback functions process DataRequests, and client.post()
can return DataRequests.
DataRequest objects can be seen as a container for data relevant for a given request, it contains the following fields:
The request header.
from pprint import pprint
from jina import Client
Client().post(on='/', on_done=lambda x: pprint(x.header))
request_id: "ea504823e9de415d890a85d1d00ccbe9"
exec_endpoint: "/"
target_executor: ""
The input parameters of the associated request. In particular, DataRequest.parameters['__results__'] is a
reserved field that gets populated by Executors returning a Python dict.
Information in those returned dicts gets collected here, behind each Executor ID.
from pprint import pprint
from jina import Client
Client().post(on='/', on_done=lambda x: pprint(x.parameters))
{'__results__': {}}
The routing information of the data request. It contains the which Executors have been called, and the order in which they were called. The timing and latency of each Executor is also recorded.
from pprint import pprint
from jina import Client
Client().post(on='/', on_done=lambda x: pprint(x.routes))
[executor: "gateway"
start_time {
seconds: 1662637747
nanos: 790248000
}
end_time {
seconds: 1662637747
nanos: 794104000
}
, executor: "executor0"
start_time {
seconds: 1662637747
nanos: 790466000
}
end_time {
seconds: 1662637747
nanos: 793982000
}
]
The DocList being passed between and returned by the Executors. These are the Documents usually processed in a callback function, and are often the main payload.
from pprint import pprint
from jina import Client
Client().post(on='/', on_done=lambda x: pprint(x.docs))
<DocList (length=0)>
Accordingly, a callback that processing documents can be defined as:
from jina.types.request.data import DataRequest
def my_callback(resp: DataRequest):
foo(resp.docs)
Handle exceptions in callbacks#
Server error can be caught by Client’s on_error callback function. You can get the error message and traceback from header.status:
from pprint import pprint
from jina import Flow, Client, Executor, requests
class MyExec1(Executor):
@requests
def foo(self, **kwargs):
raise NotImplementedError
with Flow(port=12345).add(uses=MyExec1) as f:
c = Client(port=f.port)
c.post(on='/', on_error=lambda x: pprint(x.header.status))
code: ERROR
description: "NotImplementedError()"
exception {
name: "NotImplementedError"
stacks: "Traceback (most recent call last):\n"
stacks: " File \"/Users/hanxiao/Documents/jina/jina/serve/runtimes/worker/__init__.py\", line 181, in process_data\n result = await self._data_request_handler.handle(requests=requests)\n"
stacks: " File \"/Users/hanxiao/Documents/jina/jina/serve/runtimes/request_handlers/data_request_handler.py\", line 152, in handle\n return_data = await self._executor.__acall__(\n"
stacks: " File \"/Users/hanxiao/Documents/jina/jina/serve/executors/__init__.py\", line 301, in __acall__\n return await self.__acall_endpoint__(__default_endpoint__, **kwargs)\n"
stacks: " File \"/Users/hanxiao/Documents/jina/jina/serve/executors/__init__.py\", line 322, in __acall_endpoint__\n return func(self, **kwargs)\n"
stacks: " File \"/Users/hanxiao/Documents/jina/jina/serve/executors/decorators.py\", line 213, in arg_wrapper\n return fn(executor_instance, *args, **kwargs)\n"
stacks: " File \"/Users/hanxiao/Documents/jina/toy44.py\", line 10, in foo\n raise NotImplementedError\n"
stacks: "NotImplementedError\n"
executor: "MyExec1"
}
In the example below, our Flow passes the message then prints the result when successful. If something goes wrong, it beeps. Finally, the result is written to output.txt.
from jina import Flow, Client
from docarray import BaseDoc
def beep(*args):
# make a beep sound
import sys
sys.stdout.write('\a')
with Flow().add() as f, open('output.txt', 'w') as fp:
client = Client(port=f.port)
client.post(
'/',
BaseDoc(),
on_done=print,
on_error=beep,
on_always=lambda x: x.docs.save(fp),
)
What errors can be handled by the callback?
Callbacks can handle errors that are caused by Executors raising an Exception.
A callback will not receive exceptions:
from the Gateway having connectivity errors with the Executors.
between the Client and the Gateway.