Advanced Guide — Intel® Edge Insights System Documentation v1.5 documentation

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

Adding New Services to Intel® Edge Insights System Stack 

This section provides information about adding a service, subscribing to the EdgeVideoAnalyticsMicroservice( [WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice ), and publishing it on a new port. Add a service to the Intel® Edge Insights System stack as a new directory in the IEdgeInsights( [WORK_DIR]/IEdgeInsights/ ) directory. The Builder registers and runs any service present in its own directory in the IEdgeInsights( [WORK_DIR]/IEdgeInsights/ ) directory. The directory should contain the following:

A docker-compose.yml file to deploy the service as a docker container. The AppName is present in the environment section in the docker-compose.yml file. Before adding the AppName to the main build/eii_config.json , it is appended to the config and interfaces as /AppName/config and /AppName/interfaces .
A config.json file that contains the required config for the service to run after it is deployed. The config.json consists of the following:
A config section, which includes the configuration-related parameters that are required to run the application.
An interfaces section, which includes the configuration of how the service interacts with other services of the Intel® Edge Insights System stack.

For more information on adding new Intel® Edge Insights System services, refer to the Intel® Edge Insights System sample apps at Samples written in C++, Python, and Golang using the Intel® Edge Insights System core libraries.

The following example shows:

How to write the config.json for any new service
Subscribe to EdgeVideoAnalyticsMicroservice
Publish on a new port

"config"

{

"paramOne"

"Value"

"paramTwo"

[

"paramThree"

4000

"paramFour"

true

"interfaces"

{

"Subscribers"

[

"Name"

"default"

"Type"

"zmq_tcp"

"EndPoint"

"127.0.0.1:65114"

"PublisherAppName"

"EdgeVideoAnalyticsMicroservice"

"Topics"

[

"edge_video_analytics_results"

"Publishers"

[

"Name"

"default"

"Type"

"zmq_tcp"

"EndPoint"

"127.0.0.1:65113"

"Topics"

[

"publish_stream"

"AllowedClients"

[

"ClientOne"

"ClientTwo"

"ClientThree"

The config.json file consists of the following key and values:

Value of the config key is the config required by the service to run.
Value of the interfaces key is the config required by the service to interact with other services of Intel® Edge Insights System stack over the Message Bus.
The Subscribers value in the interfaces section denotes that this service should act as a subscriber to the stream being published by the value specified by PublisherAppName on the endpoint mentioned in value specified by EndPoint on topics specified in value of Topic key.
The Publishers value in the interfaces section denotes that this service publishes a stream of data after obtaining and processing it from EdgeVideoAnalyticsMicroservice . The stream is published on the endpoint mentioned in value of EndPoint key on topics mentioned in the value of Topics key.
The services mentioned in the value of AllowedClients are the only clients that can subscribe to the published stream, if it is published securely over the Message Bus.
Like the interface keys, Intel® Edge Insights System services can also have Servers and Clients interface keys.
For more information on the interfaces key responsible for the Message Bus endpoint configuration, refer to README .
For the etcd secrets configuration, in the new Intel® Edge Insights System service or app docker-compose.yml file, add the following volume mounts with the right AppName env value:

volumes

./Certificates/[AppName]:/run/secrets/[AppName]:ro

./Certificates/rootca/cacert.pem:/run/secrets/rootca/cacert.pem:ro

Running With Multiple Use Cases 

Builder uses a yml file for configuration. The config yml file consists of a list of services to include. You can mention the service name as the path relative to IEdgeInsights or full path to the service in the config yml file. To include only a certain number of services in the Intel® Edge Insights System stack, you can add the -f or yml_file flag of builder.py. You can find the examples of yml files for different use cases as follows:

Azure( [WORK_DIR]/IEdgeInsights/build/usecases/video-streaming-azure.yml )

The following example shows running Builder with the -f flag:

python3 builder.py -f usecases/video-streaming.yml
Main Use Cases
Use case
yaml file
Video + Time Series
build/usecases/video-timeseries.yml([WORK_DIR]/IEdgeInsights/build/usecases/video-timeseries.yml)
Video
build/usecases/video.yml([WORK_DIR]/IEdgeInsights/build/usecases/video.yml)
Time Series
build/usecases/time-series.yml([WORK_DIR]/IEdgeInsights/build/usecases/time-series.yml)
Video streaming with EVAM
build/usecases/video-streaming-evam.yml([WORK_DIR]/IEdgeInsights/build/usecases/video-streaming-evam.yml)
Video streaming and historical
build/usecases/video-streaming-evam-datastore.yml([WORK_DIR]/IEdgeInsights/build/usecases/video-streaming-evam-datastore.yml)
Video streaming with DataCollection
build/usecases/video-streaming-dcaas-evam-datastore.yml([WORK_DIR]/IEdgeInsights/build/usecases/video-streaming-dcaas-evam-datastore.yml)
Video streaming with ModelRegistry
build/usecases/evam-datastore-model-registry.yml([WORK_DIR]/IEdgeInsights/build/usecases/evam-datastore-model-registry.yml)
When you run the multi-instance config, a build/multi_instance directory is created in the build directory. Based on the number of video_pipeline_instances specified, that many directories of EdgeVideoAnalyticsMicroservice are created in the build/multi_instance directory.
The following section provides an example for running the Builder to generate the multi-instance boiler plate config for 3 streams of video-streaming use case.
If required, you can generate the multi-instance docker-compose.yml and config.json files using the Builder. You can use the -v or video_pipeline_instances flag of the Builder to generate boiler plate config for the multiple-stream use cases. The -v or video_pipeline_instances flag creates the multi-stream boiler plate config for the docker-compose.yml and eii_config.json files.
The following example shows running builder to generate the multi-instance boiler plate config for 3 streams of video-streaming use case:
python3 builder.py -v 3 -f usecases/video-streaming-evam.yml
Using the previous command for 3 instances, the build/multi_instance directory consists of the following directories
EdgeVideoAnalyticsMicroservice1
EdgeVideoAnalyticsMicroservice2
EdgeVideoAnalyticsMicroservice3
Initially each directory will have the default config.json and the docker-compose.yml files that are present within the EdgeVideoAnalyticsMicroservice/eii directory.
      ./build/multi_instance/
      |-- EdgeVideoAnalyticsMicroservice1
      |   |-- config.json
      |   `-- docker-compose.yml
      |-- EdgeVideoAnalyticsMicroservice2
      |   |-- config.json
      |   `-- docker-compose.yml
      |-- EdgeVideoAnalyticsMicroservice3
      |   |-- config.json
      |   `-- docker-compose.yml
You can edit the config of each of these streams within the ``build/multi_instance`` directory. To generate the consolidated ``docker compose`` and ``eii_config.json`` file, rerun the ``builder.py`` command.
The multi-instance feature support of Builder works only for the video pipeline that is the usecases/video-streaming.yml and video-streaming-evam.yml use case and not with any other use case yml files like usecases/video-streaming-storage.yml and so on. Also, it doesn’t work for cases without the -f switch. The previous example will work with any positive number for -v.
If you are running the multi-instance config for the first time, it is recommended not to change the default config.json file and the docker-compose.yml file in the EdgeVideoAnalyticsMicroservice/eii directory.
If you are not running the multi-instance config for the first time, the existing config.json and docker-compose.yml files in the build/multi_instance directory will be used to generate the consolidated eii-config.json and docker-compose files. If you want to use config.json and docker-compose.yml files from EdgeVideoAnalyticsMicroservice/eii directory then delete the build/multi_instance directory.
The docker-compose.yml files present within the build/multi_instance directory will have the following:
the updated service_name, container_name, hostname, AppName, ports and secrets for that respective instance.
The config.json file in the build/multi_instance directory will have the following:
the updated Name, Type, Topics, Endpoint, PublisherAppname, ServerAppName, and AllowedClients for the interfaces section.
the incremented RTSP port number for the config section of that respective instance.
Ensure that all containers are down before running the multi-instance configuration. Run the docker compose down -v command or  before running the builder.py script for the multi-instance configuration.
List of Intel® Edge Insights System Services
Based on requirement, you can include or exclude the following Intel® Edge Insights System services in the [WORKDIR]/IEdgeInsights/build/docker-compose.yml file:
Config Manager Agent
Config Manager Agent is an Intel® Edge Insights System service, that is responsible for the
following:
Puts the EII services configs to the Intel® Edge Insights System config manager data store
Additionally in PROD mode, generates the following:
Required config manager data store keys/certificates to interact
with EII config manager data store like etcd and puts in the
volume mounts to be shared with other Intel® Edge Insights System services
Required messagebus keys for Intel® Edge Insights System services communication
Creates required provisioning folders with the right permissions
needed for other Intel® Edge Insights System services via volume mounts
Note: Intel® Edge Insights System Certificates will be re-generated everytime
the ConfigMgrAgent service brought up By default, the Intel® Edge Insights System
Certificates, configs and secrets (etcd certs, zmq keys, pem/der
certs) will not be regenerated upon container restarts and if one has
to force this recreation, then you have to delete the docker volume
by either running docker compose down -v or
docker volume rm edgeinsightssoftware_vol_cma_status_file
docker exec -it ia_configmgr_agent ./scripts/etcd_capture.sh
ETCD UI
Etcd web UI is exposed via the ConfigMgrAgent service. It can be
accessed with the following steps. Configuration changes are made for
respective EII container services.
Open the browser and enter the address:
https://$(HOST_IP):7071/etcdkeeper/ (when EII is running in secure
mode). In this case, CA cert has to be imported in the browser. For
insecure mode i.e. DEV mode, it can be accessed at
https://$(HOST_IP):7071/etcdkeeper/.
Click on the version of the title to select the version of ETcd. By
default the version is V3. Reopening will remember user’s choice.
Right-click on the tree node to add or delete.
For secure mode, authentication is required. User name and password
needs to be entered in the dialogue box.
Username is the ‘root’ and default password is located at
ETCDROOT_PASSWORD key under environment section in
docker-compose.yml
This service is accessed from a remote system at address:
https://$(HOST_IP):7071 (when EII is running in secure mode). In this
case, CA cert has to be imported in the browser. For insecure mode
i.e. DEV mode, it is accessed at http://$(HOST_IP):7071
If ETCDROOT_PASSWORD is changed, there must be consolidated
docker-compose.yml generated using builder script and EII must to be
provisioned again. Run the following commands:
cd [WORKDIR]/IEdgeInsights/build
python3 builder.py -f usecases/<usecase.ml>
docker compose up -d ia_configmgr_agent
The ETCD watch capability is enabled for video and timeseries
services. It will auto-restart microservices when microservices
config/interface changes are done via the EtcdUI interface. Any
changes done to these keys are reflected at runtime in EII.
For changes done to any other keys, the EII stack needs to be
restarted to be effective. Run the following commands in the working
directory, to build or restart EII:
cd [WORKDIR]/IEdgeInsights/build
docker compose down -v
docker compose up -d
Refer prerequisites for video
accelerators
and prerequisities for
cameras
before changing the configuration dynamically through ETcdUI.
Steps to Add Self Signed Certificate as Trusted in local Browser
Go to Build Directory of the repo
cd <workdir>/IEdgeInsights/build/Certificates/ConfigMgrAgent/
Download Root CA Cert from the EtcdUI 
Import the RootCA certificates as Trusted Root Certificate in browser
For Chrome Browser, Follow the below steps i. Open Chrome settings,
scroll to the bottom, and click Privacy and security ii. Click
Manage devices certificates…  iii. Click the Trusted
Root Certification Authorities tab, then click the Import…
button. This opens the Certificate Import Wizard. Click Next to
get to the File to Import screen. iv. Click Browse… and under
File Type select All Files and select the certificate file you saved
earlier, then click Next.  v. Select Place all
certificates in the following store. The selected store should be
Trusted Root Certification Authorities. If it isn’t, click
Browse… and select it. Click Next and Finish vi. Click
Yes on the security warning. vii. Restart Chrome.
Edge Video Analytics Microservice
The Edge Video Analytics Microservice (EVAM) combines video ingestion and analytics
capabilities provided by Intel® Edge Insights System visual ingestion and analytics modules.
This directory provides the Intel® Deep Learning Streamer (Intel® DL Streamer) pipelines
to perform object detection on an input URI source and send the ingested frames and
inference results using the MsgBus Publisher. It also provides a Docker compose
and config file to use EVAM with the Edge insights software stack.
Configuration
Refer edge-video-analytics-microservice/eii/config.json([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json




    
) file for the configuration of EVAM. The default configuration will start the
object_detection demo for Intel® Edge Insights System.
The config file is divided into two sections as follows:
Parameter
Description
cert_type
Type of Intel® Edge Insights System certs to be created. This should be "zmq" or "pem".
source
Source of the frames. This should be "gstreamer" or "msgbus".
pipeline
The name of the DL Streamer pipeline to use. This should correspond to a directory in the pipelines directory.
pipeline_version
The version of the pipeline to use. This typically is a subdirectory of a pipeline in the pipelines directory.
Additional information to store with frame metadata. e.g. camera location/orientation of video input
publish_raw_frame
The Boolean flag for whether to publish raw frame.
encoding
Encodes the image in jpeg or png format.
mqtt_publisher
Publishes frame/metadata to  mqtt broker.
convert_metadata_to_dcaas_format
Converts inference results to Intel Edge Data Collection standardized format.
For jpeg encoding type, level is the quality from 0 to 100. A higher value means better quality.
For png encoding type, level is the compression level from 0 to 9. A higher value means a smaller size and longer compression time.
Encoding elements can be used in the pipeline as an alternative to specifying the encoding parameters.
Refer to the below pipeline for using jpegenc in config.json.
.. code-block:: javascript
“pipeline”: “multifilesrc loop=FALSE stop-index=0 location=/home/pipeline-server/resources/pcb_d2000.avi name=source ! h264parse ! decodebin ! videoconvert ! video/x-raw,format=BGR ! udfloader name=udfloader ! jpegenc ! appsink name=destination”,
Refer to the below pipeline for using pngenc in config.json.
.. code-block:: javascript
“pipeline”: “multifilesrc loop=FALSE stop-index=0 location=/home/pipeline-server/resources/pcb_d2000.avi name=source ! h264parse ! decodebin ! videoconvert ! video/x-raw,format=BGR ! udfloader name=udfloader ! videoconvert ! pngenc ! appsink name=destination”,
convert_metadata_to_dcaas_format, when set to true in config.json converts the metadata to Intel Edge Data Collection compatible format. Currently this has been tested for gvadetect element used in the pipeline.
Refer to the below pipeline for example,
.. code-block:: javascript
“pipeline”: “multifilesrc loop=TRUE stop-index=0 location=/home/pipeline-server/resources/classroom.avi name=source ! h264parse ! decodebin ! queue max-size-buffers=10 ! videoconvert ! video/x-raw,format=BGR ! gvadetect model=/home/pipeline-server/models/object_detection/person/FP32/person-detection-retail-0013.xml model-proc=/home/pipeline-server/models/object_detection/person/person-detection-retail-0013.json ! jpegenc ! appsink name=destination”,
Interfaces
Currently in the Intel® Edge Insights System mode, EVAM supports launching a single pipeline and publishing on a single topic. This implies that in the configuration file (“config.json”), the single JSON object in the Publisher list is where the configuration resides for the published data. For more details on the structure, refer to the documentaions.
EVAM also supports subscribing and publishing messages or frames using the Message Bus. The endpoint details for the Intel® Edge Insights System service you need to subscribe from are to be provided in the Subscribers section in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file and the endpoints where you need to publish to are to be provided in Publishers section in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file.
To enable injection of frames into the GStreamer pipeline obtained from Message Bus, ensure to make the following changes:
The source parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file is set to msgbus. Refer to the following code snippet:
"config": {
    "source": "msgbus"
The pipeline is set to appsrc as source instead of uridecodebin. Refer to the following code snippet:
    "pipeline": "appsrc name=source ! rawvideoparse ! appsink name=destination"
MQTT publishing
Prior to publishing, MQTT broker/subscriber needs to be configured and started. Broker receives messages from EVAM publisher and forwards the messages to subscribers. Subscriber receives messages from broker by subscribing to a topic. Refer to [mosquitto](https://mosquitto.org/man/mosquitto-8.html) and [mosquitto_sub](https://mosquitto.org/man/mosquitto_sub-1.html) for starting broker/subscriber.
Add below configuration in IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json to enable publishing to the mqtt broker.
"mqtt_publisher": {
  "host": "<mqtt broker address>",
  "port": 1883
host mqtt broker hostname or IP address
port port to connect to the broker
topic topic to which message will be published. Defaults to edge_video_analytics_results (optional)
publish_frame whether to publish only metadata or both frames and metadata can be published to the mqtt broker. Defaults to false (optional)

When publish_frame is false, only metadata will be published.
When publish_frame is true, both metadata and frame will be published.
qos quality of service level to use which defaults to 0. Values can be 0, 1, 2. (optional)
More details on the QoS levels can be found [here](https://www.hivemq.com/blog/mqtt-essentials-part-6-mqtt-quality-of-service-levels)
protocol protocol version to use which defaults to 4 i.e. MQTTv311. Values can be 3, 4, 5 based on the versions MQTTv3, MQTTv311, MQTTv5 respectively (optional)
MQTT publishing to broker could be over a secure communication channel providing encryption and authentication over SSL/TLS. MQTT broker should be configured for SSL/TLS connection. More details on the broker configuration options can be found [here](https://mosquitto.org/man/mosquitto-conf-5.html) and the files required for SSL/TLS support are specified [here](https://mosquitto.org/man/mosquitto-tls-7.html).
Add below configuration in IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json to enable secure publishing to the mqtt broker.
"mqtt_publisher": {
  "host": "<mqtt broker address>",
  "port": 8883,
  "tls": {
    "ca_cert": "/MqttCerts/ca.crt",
    "client_key": "/MqttCerts/client/client.key",
    "client_cert": "/MqttCerts/client/client.crt"
Refer to the document MQTT docs for details on prerequisities, configuration, filtering and error handling.
MQTT publishing can be enabled along with  Intel® Edge Insights System Message Bus publishing.
Object Tracking
For enabling object tracking in the pipeline, refer object_tracking
Camera Configurations
You need to make changes to the config.json([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) and the templates section of the pipeline.json([WORK_DIR]/IEdgeInsights/pipeline.json) files while configuring cameras.
By default the pipeline.json([WORK_DIR]/IEdgeInsights/pipeline.json) file has the RTSP camera configurations.
The camera configurations for the Edge Video Analytics Microservice module are as follows:
GenICam GigE or USB3 Cameras
Prerequisites for Working with the GenICam Compliant Cameras
The following are the pre-requisites for working with the GeniCam compliant cameras.
For GenICam compliant GigE cameras:
Update the ETCD_HOST key with the current system’s IP in the .env([WORK_DIR]/IEdgeInsights/build/.env) file.
ETCD_HOST=<HOST_IP>
Add network_mode: host in the docker-compose.yml([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/docker-compose.yml) file and comment/remove networks and ports sections.
Make the following changes in the docker-compose.yml([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/docker-compose.yml) file.
ia_edge_video_analytics_microservice:
  # Add network mode host
  network_mode: host
  environment:
    # Please make sure only environment variables are added under environment section in docker compose file.
    # Add HOST_IP to no_proxy and ETCD_HOST
    no_proxy: "<eii_no_proxy>,${RTSP_CAMERA_IP},<HOST_IP>"
    ETCD_HOST: ${ETCD_HOST}
  # Comment networks section will throw an error when network mode host is used.
  # networks:
    # - eii
  # Comment ports section as following
  # ports:
  #   - '65114:65114'
For GenICam compliant USB3.0 cameras:
Increase the USBFS buffer size to at least 256 MB or even more depending on your application.
By default, USB-FS on Linux system allows only 16MB buffer limit which will not be sufficient to work with high framerate/resolution cameras and multi-camera setup. In such scenarios increase USB-FS buffer size. Check USB-FS buffer size by using command cat /sys/module/usbcore/parameters/usbfs_memory_mb. If it is less than 256MB, then follow these [steps to increase the USBFS value](https://assets.balluff.com/documents/DRF_957345_AA_000/Troubleshooting_section_Checklist_USB3_Settings.html#Troubleshooting_Checklist_USB3_IncreasingTheKernelMemory).
Refer the following pipeline for configuring the config.json([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file for GenICam GigE/USB3.0 cameras.
"pipeline": "gencamsrc serial=<DEVICE_SERIAL_NUMBER> pixel-format=<PIXEL_FORMAT> name=source ! videoconvert ! video/x-raw,format=BGR ! appsink name=destination"
After making the changes, run the command python3 builder.py -f <usecase> from [WORKDIR]/IEdgeInsights/build directory to generate the consolidated files.
For more information or configuration details for the GenICam GigE or the USB3 camera support, refer to the Generic Plugin docs.




    

For other cameras such as RSTP, and USB (v4l2 driver compliant) revert the changes that are mentioned in this section.
Refer to the Basler docs for more information/configuration on Basler camera.
Generic Plugin can work only with GenICam compliant cameras and only with gstreamer ingestor.
The above gstreamer pipeline was tested with Basler and IDS GigE cameras.
If serial is not provided, then the first connected camera in the device list will be used.
If pixel-format is not provided then the default mono8 pixel format will be used.
If width and height properties are not set then gencamsrc plugin will set the maximum resolution supported by the camera.
Camera field of view getting cropped is an expected behavior when a lower resolution is set using height or width parameter. Setting these parameters would create an Image ROI which will originate from the top left corner of the sensor. Refer https://docs.baslerweb.com/image-roi  for more details.
Using a higher resolution might have other side effects like “lag issue in the pipeline” when the model is compute intensive.
By default, exposure-auto property is set to on. If the camera is not placed under sufficient light then with auto exposure, exposure-time can be set to very large value which will increase the time taken to grab frame. This can lead to No frame received error. Hence it is recommended to manually set exposure as in the following sample pipeline when the camera is not placed under good lighting conditions.
throughput-limit is the bandwidth limit for streaming out data from the camera(in bytes per second). Setting this property to a higher value might result in better FPS but make sure that the system and the application can handle the data load otherwise it might lead to memory bloat.
Refer the below example pipeline to use the above mentioned properties:
`javascript
"pipeline": "gencamsrc serial=<DEVICE_SERIAL_NUMBER> pixel-format=ycbcr422_8 width=1920 height=1080 exposure-time=5000 exposure-mode=timed exposure-auto=off throughput-limit=300000000 name=source ! videoconvert ! video/x-raw,format=BGR ! appsink name=destination"
By default, USB-FS on Linux system allows only 16MB buffer limit which might not be sufficient to work with high framerate, high resolution cameras and multiple camera setup. In such scenarios configure USB-FS to increase the buffer memory limit for USB3 vision camera. While using the basler USB3.0 camera, ensure that the USBFS limit is set to 1000MB. You can verify this value by using command cat /sys/module/usbcore/parameters/usbfs_memory_mb. If it is less than 256MB, then follow these [steps to increase the USBFS value](https://assets.balluff.com/documents/DRF_957345_AA_000/Troubleshooting_section_Checklist_USB3_Settings.html#Troubleshooting_Checklist_USB3_IncreasingTheKernelMemory).

If the GenICam cameras do not get initialized during the runtime, then on the host system, run the docker system prune command. After that, remove the GenICam specific semaphore files from the /dev/shm/ path of the host system. The docker system prune command will remove all the stopped containers, networks that are not used (by at least one container), any dangling images, and build cache which could prevent the plugin from accessing the device.
Known Limitation:
If one observes Feature not writable message while working with the GenICam cameras, then reset the device using the camera software or using the reset property of the Generic Plugin. For more information, refer Gencamsrc docs.
Prerequisites for Working with Xiris Camera
The following are the prerequisites for working with Xiris cameras.
Update the ETCD_HOST key with the current system’s IP in the .env([WORK_DIR]/IEdgeInsights/build/.env) file.
ETCD_HOST=<HOST_IP>
Add network_mode: host in the docker-compose.yml([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/docker-compose.yml) file and comment/remove networks and ports sections.
Make the following changes in the docker-compose.yml([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/docker-compose.yml) file.
ia_edge_video_analytics_microservice:
  # Add network mode host
  network_mode: host
  # Make sure that the above commands are not added under the environment section and also take care about the indentations in the compose file.
  environment:
    # Add HOST_IP to no_proxy and ETCD_HOST
    no_proxy: "<eii_no_proxy>,${RTSP_CAMERA_IP},<HOST_IP>"
    ETCD_HOST: ${ETCD_HOST}
  # Comment networks section will throw an error when network mode host is used.
  # networks:
    # - eii
  # Comment ports section as following
  # ports:
  #   - '65114:65114'
The ip_address parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file must be set to IP address of the camera.
The frame_rate parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file must be set to the desired ingestion frame rate from the camera.
The pixel_depth parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file must be set to the required pixel depth (in bits). It can take one of the following four values: 8, 12, 14 or 16. Note that the pixel_depth parameter has no bearing on the monochrome camera XVC-1000 that has been tested.
The flip_mode parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file must be set accordingly. It describes how an image is flipped. It can take one of the following four values: “None”, “FlipVertical”, “FlipHorizontal” or “FlipBoth”.
None: No flip performed.
FlipVertical: Flip the image vertically.
FlipHorizontal: Flip the image horizontally.
FlipBoth: Flip both horizontally and vertically. Equivalent to 180 degree rotation.
The set_sharpen parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) sets a value which controls if a sharpening filter is applied to the image or not. This parameter can take either “true” or “false” values.
The focus parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file sets a value indicating the commanded position of the focus adjustment.
The tone_map_curve_type parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file sets the shape of the tonemapping curve. Tone mapping is the process of building a displayable image from an HDR image. The type of curve specified controls the distribution of the contrast in the displayable image, and in some cases can be used to adjust the color balance of the image. The different values this parameter can take are:
linear: means that the contrast is mapped evenly.
gamma: accentuates the contrast in the darker areas using a y = pow(x, gamma) type of relationship.
scurve (unsupported at the moment): uses a curve in the general form of y = 1 / (1 + pow((alpha * x) / (1 - x), -beta).
The tone_map_curve_value parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file sets a value to the above mentioned curve type.
For linear curve, set this parameter value to 1
For gamma curve, it can take values in the range [-5, 5]. This is to reflect the WeldStudio’s gamma slider range. Note: WeldStudio is the camera GUI software from Xiris
For scurve: unsupported at the moment
Refer to the following code snippet:
"config": {
    "xiris": {
       "ip_address": "<set-xiris-camera-IP-here>",
       "frame_rate": 10,
       "pixel_depth": 8.
       "flip_mode": "FlipHorizontal",
       "set_sharpen": "false",
       "focus": 0,
       "tone_map_curve_type": "gamma",
       "tone_map_curve_value": "0"
The source parameter in the config([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file must be set to ingestor. Refer to the following code snippet:
"config": {
    "source": "ingestor"
The pipeline is set to appsrc as source and rawvideoparse element should be updated with the height, width and format of the Xiris frame. Refer to the following code snippet:
    "pipeline": "appsrc name=source ! rawvideoparse height=1024 width=1280 format=gray8 ! videoconvert ! video/x-raw,format=BGR ! appsink name=destination"
Xiris Camera model tested is XVC-1000(monochrome)
Only PixelDepth=8 (camera outputs 8 bits per pixel) is supported. In case of any frame rendering issues please check PixelDepth value from the logs and make sure it is set to 8.
In case a wrong or an invalid IP is provided for connecting to Xiris camera using XirisCameraIP env variable, ingestion will not work and there will be no error logs printed. Make sure correct IP is provided for ingestion to work.
To find the IP address of the camera please use the GUI tool provided by the Xiris (currently available on windows) or run the LinuxSample app under weldsdk installation directory on the hostsystem to find the available cameras.
RTSP Cameras
Update the RTSP camera IP or the simulated source IP to the RTSP_CAMERA_IP variable in the .env([WORK_DIR]/IEdgeInsights/build/.env) file. Refer to the RTSP docs. for information/configuration on RTSP camera.
Refer the following configuration for configuring the config.json([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file for RTSP camera.
"pipeline": "rtspsrc location=\"rtsp://<USERNAME>:<PASSWORD>@<RTSP_CAMERA_IP>:<PORT>/<FEED>\" latency=100 name=source ! rtph264depay ! h264parse ! vaapih264dec ! vaapipostproc format=bgrx ! videoconvert ! video/x-raw,format=BGR ! appsink name=destination"
The RTSP URI of the physical camera depends on how it is configured using the camera software. You can use VLC Network Stream to verify the RTSP URI to confirm the RTSP source.
USB v4l2 Cameras
For information or configurations details on the USB cameras, refer to USB v4l2 docs.
Refer the following configuration for configuring the config.json([WORK_DIR]/IEdgeInsights/EdgeVideoAnalyticsMicroservice/eii/config.json) file for USB v4l2 camera.
"pipeline": "v4l2src device=/dev/<DEVICE_VIDEO_NODE> name=source ! video/x-raw,format=YUY2 ! videoconvert ! video/x-raw,format=BGR ! appsink name=destination"
Anomalib
Anomalib is a deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets. Anomalib provides several ready-to-use implementations of anomaly detection algorithms described in the recent literature, as well as a set of tools that facilitate the development and implementation of custom models.
For more information on anomalib, refer to Anomalib docs .




    

Supported version in EVAM
v0.7.0
Training
Anomalib includes ready to use anomaly detection models. A model can be trained based on default config provided for the model or can be customized for a particular dataset and category.
More details on Training and exporting a trained model can be found in training section of this [document](https://github.com/openvinotoolkit/anomalib/tree/v0.7.0).
Supported models in EVAM
Currently a subset of available anomaly detection models in anomalib are supported in EVAM:
- [STFPM](https://github.com/openvinotoolkit/anomalib/blob/v0.7.0/src/anomalib/models/stfpm)
- [PADIM](https://github.com/openvinotoolkit/anomalib/blob/v0.7.0/src/anomalib/models/padim)
- [DFM](https://github.com/openvinotoolkit/anomalib/blob/v0.7.0/src/anomalib/models/dfm)
Inference using Anomalib UDF
When trained/exported model is available, the UDF IEdgeInsights/EdgeVideoAnalyticsMicroservice/user_scripts/udfs/python/anomalib_udf/ can be used for running inference.
As an example, a custom STFPM model trained on [Amazon’s Visual Anomaly (Visa) dataset](https://registry.opendata.aws/visa/) is included to detect anomalies in PCB.
Inferencing could be configured to be based on
- openvino - default openvino inferencer provided by Anomalib or
- openvino_nomask - custom openvino no mask inferencer derived from openvino inferencer which allows for image resolution to not have an impact on inference time.
Inference using Anomalib UDF
When trained/exported model is available, anomalib udf can be used for running inference.
As an example, a custom STFPM model trained on [Amazon’s Visual Anomaly (Visa) dataset](https://registry.opendata.aws/visa/) is included to detect anomalies in PCB.
Inferencing could be configured to be based on
- openvino - default openvino inferencer provided by Anomalib or
- openvino_nomask - custom openvino no mask inferencer derived from openvino inferencer which allows for image resolution to not have an impact on inference time.
Refer the below sample configuration to run the Anomalib UDF.
"udfs": [
              "device": "CPU",
              "task": "classification",
              "inferencer": "openvino_nomask",
              "model_metadata": "/home/pipeline-server/udfs/python/anomalib_udf/stfpm/metadata.json",
              "name": "python.anomalib_udf.inference",
              "type": "python",
              "weights": "/home/pipeline-server/udfs/python/anomalib_udf/stfpm/model.onnx"
inferencer config parameter can be used to change b/w the default openvino inferencer provided by anomalib and the openvino_nomask inferencer which inherits from openvino inferencer.
Person Detection
This Geti udf supports deploying a project for local inference with OpenVINO using Intel® Geti™ SDK python package. It uses a Geti based person detection model.
Refer the below config for the default config used for this Geti udf:
"udfs": [
               "name": "python.geti_udf.geti_udf",
               "type": "python",
               "device": "CPU",
               "deployment": "./resources/geti/person_detection/deployment",
               "visualize": "false",
               "metadata_converter": "geti_to_dcaas"
Add Label
This is developed to facilitate anomalib training data collection where one is pointing camera on anomalous or non-anomalous product. While you point a camera to a given category of scene, default label value which is a configurable field (“anomalous”: “true” or “anomalous”: “false”) in this UDF is applied to every frame. This process needs to be repeated to every type of class (“anomalous”: “true” or “anomalous”: “false”) that you want to capture from the scene for training.
The following example shows the configuration for add label UDF.
"udfs": [
        "type": "python",
        "name": "python.add_label",
        "anomalous": "true"
There is no AI model involved. It is a simple UDF script that labels the data with default label values and saves in DataStore which is further expected to be consumed by Visualizer microservice and Intel® Edge Data Collection microservice.
Compatibility with other microservices
Visualizer
In case you want to add your custom UDF and make it compatible with Visualizer, ensure that the following fields are present in the metadata published by the UDF. Any missing key here will result in an error.
Keys with sample values.
"task": "classification"
"detected_class": "anomalous"
"anomalous": predictions.pred_score
task: Refers to classification/object_detection model
detected_class: Refers to anomalous/non-anomalous or the applicable class name
anomalous: Refers to the confidence score for anomalous class or the applicable class
Edge Data Collection
In case you want to add your custom UDF and make it compatible with Intel® Edge Data Collection microservice, ensure that the following fields are present in the metadata published by the UDF. Any missing key here will result in an error.
Keys with sample values.
"last_modified": time_ns()
"export_code": 0
"annotation_type": "auto"
"annotations": {"objects":[]}
last_modified: Mandatory. Refers to the last time the metadata was modified. The value should be in nanoseconds.
export_code: Mandatory. Refers to the status of the data export. The following export codes are supported:
0: Data has not been exported
1: Data has been exported
annotation_type: Mandatory. Refers to the type of annotation. The following annotation types are supported:
auto: for auto annotation
human: for human annotation
annotations: Mandatory. Refers to the annotations for objects detection in the image.
objects: A list of objects detected in the image. If present, each object should have the following keys:
label: The label of the object
score: The confidence score of the object
bbox: The bounding box of the object. It is list of bbox coordinates in top-left and bottom-right format. For example, [x1, y1, x2, y2]
It is important that the objects within the annotations dictionary have bounding box dimensions in top-left and bottom-right format. Some models may have bounding box predicted as top-left coordinates, width and height. In that case, the UDF must convert it to the expected top-left and bottom-right format to avoid incorrect bounding box dimensions during data export or visualization.
Data Store Microservice
Data Store microservice supports Video and Time series use cases.
The Data Store microservice supports two type of Data:
JSON Datatype (InfluxDB)
Blob Datatype (MinIO Object Storage)
DataStore Configuration
The configurations for Data Store service is added in etcd. The
configuration details are available in the docker-compose file, under
AppName in the environment section of the app’s service definition.
For the scenario, when the AppName is DataStore, the following
example shows how the app’s config will look for /DataStore/config
key in etcd:
"datatypes": {
  "json": {
       "host" : "ia_influxdb",
       "port": 8086,
       "dbname": "datain",
       "verifySsl": false,
       "ignoreKeys": [
         "defects"
       "tagKeys": [],
       "retention": "1h",
       "topics": [
       "retentionPollInterval": "60s"
  "blob": {
       "host" : "ia_miniodb",
       "port": 9000,
       "dbname": "image-store-bucket",
       "retention": "1h",
       "topics": [
         "edge_video_analytics_results"
       "retentionPollInterval": "60s",
       "ignoreKeys": [
         "model-registry"
The following are the details of the keys in the above config: -
datatype (required)
The host is optional parameter in configuration, which is used for connecting the respective Database servers (Local/Remote). If the parameter is not provided, by default JSON Datatype will be selected with ia_influxdb and Blob Datatype will be selected with ia_miniodb
The port is optional parameter in configuration, which is used for connecting the respective Database servers port(Local/Remote). If the parameter is not provided, by default JSON Datatype will be selected with 8086 for Influx DB and Blob Datatype will be selected with 9000 for Minio Object Storage
The topics




    
 key determines which messages are to be processed by the corresponding DB microservice. Only the messages with a topic listed in topics key are processed by the individual module. If topics contain \*, then all the messages are processed. - The retention is required parameter in configuration. The retention parameter specifies the retention policy to apply for the images stored in MinIO object storage. In case of infinite retention time, set it to ““. Suitable duration string value as mentioned at https://golang.org/pkg/time/#ParseDuration.
The retentionPollInterval is required parameter in configuration. Used to set the time interval for checking images for expiration. Expired images will become candidates for deletion and no longer retained. In case of infinite retention time, this attribute will be ignored. Suitable duration string value as mentioned at https://golang.org/pkg/time/#ParseDuration
The ignoreKeys is list of string which allows to add keys. For JSON, this list will be used parser to choose whether to parse the nested json or save as string to database. For BLOB, this list will be used for not applying the retention policy for the respective buckets.
restserver
The Port key is used to configure the port exposed to access the REST APIs.
Note: Same should be updated under Port section in* docker-compose.yml
The cacertname key is filename of pem certificate
The caprivatekey key is filename of pem key
To use different certificates for restserver change the LHS of the volume mount path in docker-compose.yml for line item ./Certificates/DataStore_Server/:/run/secrets/RestAPI_Server:ro, update the cacertname and caprivatekey with respective file names in config.json
Refer Tutorial 2 in  REST API Endpoints. Swagger(API Documentation) is available at /docs. To access swagger, after starting data store open browser and enter url https://<ip>:8889. For DEV_MODE as true, use http://<ip>:8889
Note: Please note due to some cert issue, the Data Store swagger documentation at https://localhost:8889 in PROD mode OR http://localhost:8889 in DEV mode will not be accessible.
By default, both the DBs will be enabled. If you want to disable any of
the above DBs, remove the corresponding key and its value from the
config.
For Example, if you are not using MinIO object storage, you can disable
the same and modify the config as below:
"datatypes": {
  "json": {
       "host" : "ia_influxdb",
       "port": 8086,
       "dbname": "datain",
       "verifySsl": false,
       "ignoreKeys": [
         "defects"
       "tagKeys": [],
       "retention": "1h",
       "topics": [
       "retentionPollInterval": "60s"
For Backuping up the Json and Blob data following configuration is
provided in config.json
"storage": {
  "enabled": false,
  "type": "local",
  "autoSyncIntervalInSec": 3600,
  "blobIdentifier": "img_handle"
NOTE: Enabling storage feature with type local as above will
backup both json and blob data which would lead to high disk space
consumption on the host system and the consumption will be more in
case of multi streams.For example 1hr of data for a single stream
with image resolution as 1920 * 1200 consumes 3.1Gb of disk space.
The following are the details of the keys in the above config:
The enabled is by default false but if the backup of data is
needed then set the enabled flag as true.
The type denotes the type of storage needed i.e. either local or
cloud.
The autoSyncIntervalInSec denotes the time interval in seconds
indicating how often the database had to be queried to backup the
data.
The blobIdentifier denotes the identifier used for BLOB data. For
example below is a Edge Video Analytics Microservice’s data snipet
where the blobIdentifier will be img_handle
{'height': 1200, 'width': 1920, 'channels': 3, 'img_handle': 'cd3a30e5c3', 'defects': [], 'encoding_type': 'jpeg', 'encoding_level': 95}
The backed up data is stored at /opt/intel/eii/data/ds_backup/ folder which consist of two subfolder one for json data and one for blob data.
The json data is stored in the json format with filename is made up of json i.e edge_video_analytics_results_2024-01-23-14_44_58.json
For blob data the format is taken from ‘encoding_type’ field of Edge Video Analytics Microservice’s data with file name is made up of and blob data is stored in the folder blob/ i.e blob/2024-02-01-16:25:32/edge_video_analytics_results_ff875f1df.jpeg
JSON Datatype (InfluxDB)
For nested json data, by default, Data Store will flatten the nested
json and push the flat data to InfluxDB to avoid the flattening of any
particular nested key mention the tag key in the
config.json file. Currently the defects key is
ignored from flattening. Every key to be ignored has to be in a new
line.
For example,
ignore_keys = [ "Key1", "Key2", "Key3" ]
By default, all the keys in the data schema will be pushed to InfluxDB
as fields. If tags are present in data schema, it can be mentioned in
the config.json file then the data pushed to
InfluxDB, will have fields and tags both. At present, no tags are
visible in the data scheme and tag_keys are kept blank in the config
file.
For Example,
tag_keys = [ "Tag1", "Tag2" ]
Blob Datatype (MinIO Object Storage)
The MinIO object storage primarily subscribes to the stream that comes
out of the EdgeVideoAnalayticsMicroservice app via Intel® Edge Insights System messagebus and
stores the frame into minio for historical analysis.
The high-level logical flow of MinIO object storage is as follows:
The Intel® Edge Insights System messagebus subscriber in MinIO object storage will subscribe
to the EdgeVideoAnalayticsMicroservice published classified result
(metadata, frame) on the Intel® Edge Insights System messagebus. The img_handle is extracted
out of the metadata and is used as the key and the frame is stored as
a value for that key in minio persistent storage.
Intel® Edge Insights System Msgbus Interface
Data Store will start the Intel® Edge Insights System messagebus Publisher, Intel® Edge Insights System messagebus
Subscriber threads, and Intel® Edge Insights System messagebus request reply thread based on
PubTopics, SubTopics and Server configuration.
Intel® Edge Insights System messagebus Subscriber thread connects to the PUB socket of Intel® Edge Insights System
messagebus on which the data is published by
EdgeVideoAnalayticsMicroservice and push it to the
InfluxDB(Metadata).
Intel® Edge Insights System messagebus Publisher thread will publish the point data ingested
by the telegraf and the classifier result coming out of the point
data analytics.
Intel® Edge Insights System Msgbus/REST Request-Response Interface
For a historical analysis of the stored classified images or metadata,
Data Store starts a Intel® Edge Insights System Messagebus/gRPC/REST Request-Response Interface
server which provides the read, write, update, list, clear, delete
interfaces.
The payload format for Intel® Edge Insights System Msgbus/gRPC Request-Response Endpoints is available as part of Tutorial 1 in  Intel® Edge Insights System Msgbus/gRPC Request-Response Endpoints.
The payload format for Intel® Edge Insights System REST API Endpoints is available as part of Tutorial 2 in  REST API Endpoints.
Note: 1. The gRPC request-response interface server currently
supports DEV mode only.
DB Server Supported Version
Currently, DB Handlers are supported and tested with the following versions for respective DB Servers:
DB Server
Supported Version
Influx
1.8.7
Minio
RELEASE
2020-12-12T08:39:07Z
Intel® Edge Data Collection
Intel® Edge Data Collection is a microservice that helps in collection and annotation of input video data. The collected/annotated data can then be used for AI model training/fine-tuning and statistical analysis.
Note: We’ll be using the terms Intel® Edge Data Collection, EdgeDataCollection and data collection microservice interchangeably in this document. They refer to the same microservice.
Features and Modes 
The microservice can offer the following features based on the configuration it is run.
Features
Description
Human Anntoation
Allows user to review and/or annotate images stored in DataStore as tasks in CVAT
Auto Anntoation
Allows the microservice to annotate images live as is being ingested from source i.e. EVAM and store in DataStore. The annotating algorithm is a user defined functions (UDFs) configurable in the microservice’s config.json.
Remote Storage
Allows user to export object detection data or classfication data from DataStore in MS-COCO and imagenet data formats respectively to a persistent file storage.
Data Filter
Drops frames live as is being ingested from source i.e. EVAM to avoid storing in DataStore. The filtering algorithm a user defined functions (UDFs) and is configurable in the microservice’s config.json.
Configuration Changes and Deployment behavior 
Being deployed as part of Edge Insights System microservices stack, the behavior of Edge Data Collection microservice depends on the configuration currently set for it in the consolidated eii_config.json.
This configuration file can be found in [WORK_DIR]/IEdgeInsights/build/eii_config.json. The microservice’s configuration is under DataCollectionMicroservice/config section.
By default, human annotation is enabled where as features such as remote storage, auto annotation and data filter are disabled. Users can enable these features by updating the configuration in the following two ways.
If user wants to persist the change, the consolidated eii_config.json has to be updated before a redeployment. Upon re-deployment, the microservice will pick up the new configuration and run accordingly.
If user wants to make a temporary change without re-deploying the entire Edge Insights System stack, the microservice’s configuration can be updated in the EtCd UI. The changes will restart the microservice and run as per the new configuration.




    

The description of each feature alongwith their sub-configuration and the steps to enable them are provided below.
Human Annotation Mode 
The data collection microservice allows user to select a set of images already stored in DataStore and add/update annotations to them. The annotation tool used is CVAT offered as a separate microservice but is provided along with data collection microservice.
Users can choose which set of images to annotate alongwith other options provided as part of data collection microservice’s configuration.
In order to enable human annotation, the following steps need to be followed.
Pre-requsites
Edit the [WORK_DIR]/IEdgeInsights/build/.env file and add the entries that are related to CVAT as shown below.
# CVAT related config
CVAT_SUPERUSER_NAME=cvat_user
# For CVAT_SUPERUSER_PASSWORD, the length must be a minimum of 9 characters (alphanumerics). Do not use commonly used passwords. Do not use any special characters in your password. Ensure that the password is not entirely numeric. Example, somerandom123. More rules to be followed for successful password creation can be found here: https://docs.djangoproject.com/en/2.0/topics/auth/passwords/#password-validation
CVAT_SUPERUSER_PASSWORD=
# For CVAT_HOST, provide the host machine IP if the system is connected to an external network - In this case, CVAT can be accessed from any machine on the same network. If the system is not connected to an external network then provide the docker system host IP - In this case, CVAT can only be accessed from the same machine where it is running. The docker system host IP can be obtained by running the following command:
# docker network inspect bridge -f '{{range .IPAM.Config}}{{.Gateway}}{{end}}'
# Generally, the IP is found to be 172.17.0.1, but can vary and hence it is better to always obtain it from the above command
CVAT_HOST=172.17.0.1
CVAT_PORT=8443
CVAT_HTTPS=true
Note: In order to ensure successful CVAT_SUPERUSER_PASSWORD creation, length must be a minimum of 9 characters (alphanumerics). Do not use commonly used passwords or any special characters in it. Ensure that the password is not entirely numeric. More rules to be followed for successful password creation can be found here.
Examples of commonly used passwords to avoid can be found here
Configuration
The configuration for the human annotation sub-section can be found in the microservice’s config.json file, under the annotation section. The default configuration is as follows, but users can modify the values according to their needs.
"annotation": {
    "categories": [
        "anomalous"
    "human_annotation": {
        "enabled": true,
        "image_selection_for_annotation_policies": {
            "img_handles": [],
            "last_x_seconds_of_images": 0,
            "start_date": "2024-01-01T00:00:00Z",
            "end_date": "2024-01-01T13:00:00Z",
            "image_type": "annotated",
            "db_topics": [
                "edge_video_analytics_results"
        "task_sync_policies": {
            "is_auto_sync": true,
            "auto_sync_interval_in_sec": 30
        "cvat": {
            "task_creation_policies": {
                "max_images_per_task": 100,
                "max_images_per_job": 5,
                "max_images_per_live_task": 5
            "login_retry_limit": 10
    "auto_annotation":{
        "enabled": false,
Options to configure “human_annotation” in config.json([WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/config.json)
annotation.categories
Mandatory. Specify the categories which need to be supported in CVAT for human annotation.
Example: "categories":["vehicle", "Person"]
enabled
Mandatory. Ensure that you have set "enabled": true for “human_annotation”
Also ensure that you have set "enabled": false for “auto_annotation” since human annotation and auto annotation should not be enabled at the same time.
image_selection_for_annotation_policies
start_date and end_date
Update the above policies in YYYY-MM-DDTHH:MM:SSZ format.
Example: "start_date": "2022-09-02T00:00:00Z" and "end_date": "2022-09-10T00:00:00Z" if you would like to see CVAT tasks for images in image store ranging from 2022-09-02 (inclusive) to 2022-09-10 (inclusive).
Note: Ensure that both start_date and end_date are within the retention policy of DataStore. Refer DataStore (readme) for more details.
last_x_seconds_of_images
Specify a value in seconds in order to retrieve the last “x” seconds worth of images for creating human annotation tasks in CVAT
Example: A value of 60 would create human annotation task(s) in CVAT for the images that were stored in Data Store in the last 60 seconds.
Note:
Provide a value of 0 if you do not want this configuration setting to override the start_date and end_date configuration settings.
Specify image handles for creating human annotation tasks in CVAT
Example: "img_handles": ["1b387bad73", "5b387bad73"] would create CVAT task(s) for the mentioned image handles
Note:
Provide a value of "img_handles": [] if you do not want this configuration setting to override the start_date, end_date and last_x_seconds_of_images configuration settings.
This configuration setting works regardless of whether image handles are annotated or not.
image_type
Update this policy to “annotated”, “unannotated” or “all” to create annotation tasks in CVAT for the respective images.
Example 1: "image_type": "annotated" would create tasks in CVAT only for images that already have annotation associated with them in influxdb. Such tasks can be termed as “review tasks” that are meant for reviewing already existing annotation (that are present due to auto annotation module or human annotation module)
Example 2: "image_type": "unannotated" would create tasks in CVAT only for images that do not have any annotation associated with them in influxdb. Such tasks are meant to perform annotation for the first time.
Example 3: "image_type": "all" would create tasks in CVAT for images regardless of whether they have annotation associated with them in influxdb. These are hybrid tasks that are meant to perform review of existing annotation for certain images and to perform fresh annotation on the rest of the images.
db_topics
The topics/tables in the data store that need human annotation tasks. In multi-stream configuration, the topics are auto-populated with the topic names of all the streams.
Update this policy to either true or false.
Example 1: "is_auto_sync": true. In this case, auto sync will happen in regular intervals as specified by the auto_sync_interval_in_sec policy in the below point.
Example 2: "is_auto_sync": false. In this case, no annotations from CVAT will be synced with influxdb.
auto_sync_interval_in_sec
Update this policy to a value (in seconds) that defines how often you would like to poll the CVAT tasks list. This polling is necessary to update the influxdb with annotations from cvat. Note that the update to influxdb happens only when the task has been updated since the last time auto sync ran.
Example: "auto_sync_interval_in_sec": 60 polls the CVAT tasks list every 60 seconds and if any particular task happens to be updated since the last time auto sync ran (60 seconds ago), then the task data (annotation) is synced with influxdb.
Note: If there is a CVAT task that contains one or more images without annotation, then the respective CVAT job has to be marked as “completed” by clicking on Menu->Finish the job (and just clicking on “save” in the job is not enough) in order to update the annotation_type in influxdb as human for those images with empty annotation. This is necessary to have accurate annotation available for deep learning model training to avoid false positives/negatives.
Explanation with the help of an example: Let us say that there are 10 images in a certain CVAT task. 2 of them have annotation and the rest 8 do not. Upon clicking “save” within the CVAT job, the next time auto sync mechanism in Edge Data Collection runs, the annotations for the 2 images and their annotation type as human will be synced with influxdb. However, for the other 8 images that do not have any annotation, the empty annotation and the annotation type as human will be updated as human in influxdb by auto sync mechanism only upon completion (and not just “save”) of the CVAT job.
max_images_per_task
Update this policy to set a limit on maximum images that a CVAT task can hold
Example: "max_images_per_task": 100. This would mean that every task that gets created in CVAT can hold a maximum of 100 images. The actual amount of images that goes into a particular task is decided by the intersection with other policies: start_date, end_date and image_type
Note: Set this policy to 0 if you do not want any tasks to be created.
max_images_per_job
Update this policy to set a limit on maximum images that a CVAT job in a given task can hold
Example: "max_images_per_job": 1. This would mean that every job that gets created in CVAT can hold a maximum of 1 image.
Note: Setting this policy to a value greater than 1 would mean that, when a given job is saved, all images in the job, during auto sync, gets synced back to DataStore as ‘human’ annotated even though all those images might not have had modified annotation by the user.
max_images_per_live_task
Update this policy to set a limit on maximum images that a live task in CVAT can hold. The images in this case come from live feed. Consider giving this policy a lower number if you face frame drop issues.
Example: "max_images_per_live_task": 50
Note: Set this policy to 0 if you do not want any live tasks to be created.
login_retry_limit




    

Update this policy to a reasonable number which decides how many times login to CVAT will be attempted by the microservice before giving up.
Example: "login_retry_limit": 5
Note: After running eii_start.sh, open the CVAT web application by navigating to https://<host_machine_IP>:8443 in google chrome and login using the superuser credentials you provided in the .env file. Perform annotation as necessary for the created tasks. Annotators within the team could create accounts for themselves using the CVAT web app and perform annotations as well for the tasks that the superuser assigned to them.
IMPORTANT
Ensure not to logout of CVAT in browser as it causes issues in interaction with Edge Data Collection. In case you do logout, after you log back in, restart the Edge Data Collection container.
Google Chrome is the only browser that is supported by CVAT. For more details, see link
Auto Annotation Mode 
Edge Data Collection also allows users to annotate images automatically as are being ingested from the source i.e. EVAM. The annotating algorithm is a user defined functions (UDFs).
This is helpful in a scenario where user would like to run a AI model to generate a first set of annotations for the images and store it in DataStore. These annotations for the images can later be reviewed by a subject matter expert using the microservice in human annotation mode.
Since DataStore will now subscribe to the data collection microservice instead of EVAM, the following interface changes have to be done.
Pre-requsites
Remove "DataCollectionMicroservice": "" from the "subscriber_list" in [WORKDIR]/IEdgeInsights/build/builder_config.json if it already exists
Add the following subscriber and publisher to DataCollectionMicroservice interface in config.json([WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/config.json)
  "Publishers": [
        "AllowedClients": [
        "EndPoint": "/EII/sockets",
        "Name": "default",
        "Topics": [
            "dcaas_data"
        "Type": "zmq_ipc"
"Subscribers": [
        "EndPoint": "/EII/sockets",
        "Name": "default",
        "PublisherAppName": "EdgeVideoAnalyticsMicroservice",
        "Topics": [
            "edge_video_analytics_results"
        "Type": "zmq_ipc",
        "zmq_recv_hwm": 50
Add the following subscriber to DataStore’s config.json interface at [WORKDIR]/IEdgeInsights/DataStore/config.json
"Subscribers": [
        "EndPoint": "/EII/sockets",
        "Name": "default",
        "PublisherAppName": "DataCollectionMicroservice",
        "Topics": [
            "dcaas_data"
        "Type": "zmq_ipc",
        "zmq_recv_hwm": 50
With the above changes, Edge Data Collection will now subscribe to EVAM for the frames and metadata, auto-annotate it when enabled and publish for DataStore subscribe and store it.
Configuration
The configuration for the auto annotation sub-section can be found in the microservice’s config.json file, under the annotation section. Disabled by default.
"simple_storage": {
    "enabled": true
"annotation": {
    "categories": [
        "anomalous"
    "auto_annotation": {
        "enabled": true,
        "udfs": [
                "deployment": "/udfs/person_detection/deployment",
                "device": "CPU",
                "name": "person_detection.geti_udf",
                "type": "python",
                "visualize": "false",
                "metadata_converter": "geti_to_dcaas",
                "is_data_filter": false,
                "filter_threshold":-1
    "human_annotation":{
        "enabled": false,
Options to configure “auto_annotation” in config.json([WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/config.json)
enabled
Ensure that you have set "enabled": true for “auto_annotation”
Ensure that you have set "enabled": true for “simple_storage”
Also ensure that you have set "enabled": false for “human_annotation” since human annotation and auto annotation should not be enabled at the same time.
categories
Specify the categories which need to be supported in auto annotation
Example: "categories":["No object","Person"]. Ensure that all the possible labels that the udf would output is present in this categories list.
For auto annotation in Edge Data Collection, your model of choice is run as a User Defined Function (UDF). Obtain the required model files and the UDF script and place it under DataCollectionMicroservice/udfs directory.
Currently only python UDFs are supported. So, ensure that "type": "python" is set and "name" is provided mandatorily.
Data Filter 
The Edge Data Collection microservice allows user to filter incoming frame data so that they are not processed and stored to DataStore.
This is helpful in scenarios where user would like to drop frames that are not of interest to them. The filtering algorithm is a user defined functions (UDFs).
The subscriber and publisher interface setup is similar to auto-annotation modem, where ingested frames are subscribed from EVAM and published to DataStore.
Configuration
The configuration for the data filter sub-section can be found in the microservice’s config.json file, under the data_filters section. Disabled by default.
"data_filters": {
    "enabled": true,
    "udfs": [
            "deployment": "/udfs/person_detection/deployment",
            "device": "CPU",
            "name": "person_detection.geti_udf",
            "type": "python",
            "visualize": "false",
            "metadata_converter": "geti_to_dcaas",
            "is_data_filter": true,
            "filter_threshold":0.7
Options to configure “data_filters” in config.json([WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/config.json)
enabled
Set "enabled": true for “data_filters” to run or set "enabled": false to disable it.
The data filter in Edge Data Collection uses a person detection model to filter frames containing people. This model is run as an User Defined Function (UDF). Obtain the required model files and the UDF script and place it under DataCollectionMicroservice/udfs directory.
Currently, it only supports python UDFs. So, ensure that "type": "python" is set and "name" is provided mandatorily.
Note: A Geti based person detection model has already been provided as a data filter in the udfs/([WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/udfs/person_detection) directory with configuration set in config.json (disabled by default). It uses a threshold value of 0.7 to drop image frames that are detected to have at least one person with a confidence beyond 0.7.
If you wish to add your own custom filter, please add the appropriate data filter in the udfs/ directory and update the data filter section in config.json.
Note 
At present, Edge Data Collection cannot publish or subscribe to more than 1 topic. Please ensure that we do not add more than 1 topic for publisher and subscriber each in the microservice’s interface section in config.json and/or consolidated build/eii_config.json.

Remote Storage 
Edge Data Collection provides optional remote storage feature that allows user to export object detection data or classfication data in MS-COCO and imagenet data formats respectively to a persistent file storage. The file storage could be a local disk or a network mounted volume as set in the environment DCAAS_STORAGE_DIR in IEdgeInsights/build/.env file. If one has valid azure storage credentials, they also make use of uploading the exact data to azure blob storage service. The data export feature can be run periodically (autosync) or as a one-time(snapshot) activity. Read further below for more.
Configuration
Remote storage provides following options to configure in config.json. Sample config below.
"annotation": {
    "categories"




    
: [
        "anomalous"
"storage":{
    "mode": 0,
    "device":"disk",
    "auto_sync_interval_in_sec": 30,
    "ingestion_fps": 9,
    "db_topics": ["edge_video_analytics_results"]
    "img_recognition_configs":[
        "type": "classification",
        "export_format": "imagenet",
        "classification_threshold": {"anomalous": 0.5},
        "filter": {
          "annotation_type": ["auto", "human"],
          "start_date": "2023-06-06T00:00:00Z",
          "end_date": "2024-12-30T13:00:00Z"
Mandatory. set it to 0 if you wish to turn off remote storage feature. That would make other options in the storage config irrelevant. The following modes are supported:
0: turn off data export functionality
1: snapshot mode: a one time data export based on the img recognition config options provided.
2: autosync mode: a periodic data export based on the img recognition config options provided.
device
Mandatory. Refers to the type of storage to be used. The following device types are supported:
disk:
when set, the target directory is a disk path (can be host or network mounted). the path is set by DCAAS_LOCAL_STORAGE_DIR in [WORK_DIR]/IEdgeInsights/build/.env file. Ensure that the path has necessary write permission for the microservice. Read further on how to do that.
azure:
when set, the target directory is an azure storage container https://learn.microsoft.com/en-us/azure/storage/blobs/storage-blobs-introduction#containers.
User must have a valid azure storage subscription before being able to run remote storage in azure mode. Following additional account keys and azure storage container path needs to be added as environment variable to the build .env file besides DCAAS_LOCAL_STORAGE_DIR. See Additional Info section on how to retrieve these values from Azure.
AZURE_STORAGE_CONNECTION_STRING = # <storage account connection string>
AZURE_STORAGE_CONTAINER = # <container or directory in azure cloud where data will be uploaded>
NOTE: Regardless of storage device used - disk or azure, the DCAAS_LOCAL_STORAGE_DIR must be set. When storage type is disk, this is the target directory whereas, when storage type is azure, a subdirectory called tmp is created inside this path and is used as a temporary directory for caching local changes before being uploaded to azure. The contents stored in the temporary directory are not guaranteed to persist unlike in disk storage type.
auto_sync_interval_in_sec: Mandatory. Refers to the time interval (in sec) at which auto-sync runs
ingestion_fps: Mandatory. An estimated number of frames being published per second. defaults to 9. This helps remote storage algorithm to adjust/limit the number of rows to fetch from datastore during auto sync. Higher value will increase the the max number of rows to be fetched upon every query at a possible larger response time and vice-versa. Cannot be 0 or negative
db_topics: Mandatory. List of topics/tables in the data store to be queried by the remote storage. Needs at least 1 topic if storage mode is set to 1 or 2. In case of multi-stream configuration, the topics are auto-populated with the topic names of all the streams.
img_recognition_configs: Mandatory. Refers to sub config related to specific image recognition type.
sample classfication type config
"img_recognition_configs": [
      "type": "classification",
      "export_format": "imagenet",
      "classification_threshold": {"anomalous":0.5},
      "filter": {
          "annotation_type": ["auto", "human"],
          "start_date": "2023-06-06T00:00:00Z",
          "end_date": "2024-12-30T13:00:00Z"
sample object_detection type config
"img_recognition_configs": [
      "type": "object_detection",
      "export_format": "COCO",
      "filter": {
          "label_score": {"Person": 0.45, "vehicle": 0.6},
          "annotation_type": ["auto", "human"],
          "start_date": "2023-06-06T00:00:00Z",
          "end_date": "2024-12-30T13:00:00Z"
type Mandatory. Refers to the type of image recognition. The following types are supported:
classification: for classification type of image recognition
object_detection: for object detection type of image recognition
Note: Both types of configs together is not supported at the moment due to database schema challenges.
export_format Mandatory. Refers to the format in which the data is exported. The following formats are supported:
imagenet: for classification type of image recognition
coco: for object detection type of image recognition
classification_threshold Optional. Specific to classification type. Defaults to 0.5 for a class not provided. Refers to A key-value pair referring to class label that will set the threshold >= which data is exported. For exmple, a metadata has a label named anomalous score of 0.45, and in the config, classficiation_threshold is set to 0.5, the export for this particular metadata will be ignored.
"classification_threshold": {"anomalous": 0.5}
filter Mandatory in snapshot_mode(mode=1). Refers to the filter criteria for the data to be exported. The following filter criteria are supported. Data in DataStore satisfying these filter combination (AND of all the options provided) will only be exported.
annotation_type: Optional. Refers to a list of annotation types to be exported. The following annotation types are supported:
auto: for auto annotation
human: for human annotation
start_date: Mandatory when in snapshot mode(mode=1). Irrelevant in autosync mode(mode=2). The start date for the data to be exported in UTC. The date should be in the format YYYY-MM-DDTHH:MM:SSZ.
end_date: Must if start_date is also provided. Mandatory when in snapshot mode(mode=1). Irrelevant in autosync mode(mode=2) The end date for the data to be exported in UTC. The date should be in the format YYYY-MM-DDTHH:MM:SSZ.
Note:

Ensure that both start_date and end_date are within the retention policy of DataStore. Refer DataStore (readme) for more details.
When in autosync mode, start_date and end_date doesnot matter and can be removed.
label_score: Optional. Specific to object_detection. Refers to a key-value pair refererring to bounding box labels and their score, >= which the frame metadata export will be exported. Any one bbox if found violating the label-score, the export for entire frame is dropped.
All filter values are optional. If ommitted, they just won’t be the part of where clause in search query thus widening the search criteria.
Export behavior
Depending upon the image_recognition type chosen, the exported directory structure can vary.
For both, topic wise parent directories are created within which data is exported.
Classification
For classfication type of export, inside the topic directory, a class wise folder structure is created containing images tagged to that particular class. This particular method of seggregating images is also known as ImageNet data format for classification.
Before export to relevant label directory, we identify the type of export whether it is a binary class export or multi-class export. We make use of the labels present in category list in the annotation section microservice’s config.json for this. The directory assignment logic for export is as follows.
If only one category is present in category list, the export is considered as a binary class export,
For an image handle, if the label (from category list) is found to be NEGATIVE class (i.e.<threshold), it is moved into non-<label> dir
For an image handle, if the label (from category list) is found to be POSITIVE class (i.e.>=threshold), it is moved into its <label> dir
However, if multiple cateogies are present in category list, the export is considered as multi-class export:
For an image handle, if all the labels found in db are all NEGATIVE class(i.e.<threshold), it is moved to no_class dir
For an image handle, any/all labels are found to be POSITIVE class(i.e.>=threshold), it is copied to their respective <label> dir
For example, for the following metadata in influx for the table edge_video_analytics_results




    
 ,
img_handle
person
Object Detection
For object detection type of export, data format used is MS-COCO. Similar to classfication export, data is grouped inside a topic directory. The data is seggregated into daywise subdirectory depending on their ingestion time into influx (YYYYMMDD format). Inside it, the data in COCO format exported. See sample structure below.
└── edge_video_analytics_results
    ├── 20241702
    │   ├── annotations
    │   │   └── instances_20241702.json
    │   └── images
    │       ├── x.jpg
    │       └── y.jpg
    └── 20241703
        ├── annotations
        │   └── instances_20241703.json
        └── images
            ├── a.jpg
            ├── b.jpg
            └── c.jpg
Note if data is exported with snapshot mode on, the above exported data will be kept in a separate directory with folder name as time of run to distinguish between other snapshot exports, if any.
Unsupported features
Both object detection and classification data export are supported with following exclusions.
Unsupported features for classification type data export
device type azure is not supported for type is classification.
Category Labels
Ensure that the labels provied in classification_threshold or in label_score are present in  annotation.categories list in Edge Data Collection’s config.(e.g. anomalous in the given example config.)
Permissions for storage dir
If remote storage is enabled in config, set the target path for storage and enable permissions as follows. In the .env file, set DCAAS_STORAGE_DIR to the directory where you wish to remotely save the annotations and images (ex: /path/to/persistent/remote/storage). As a better alternative to sudo chmod 777 /path/to/persistent/remote/storage (since this allows anyone to read/write the data), please ensure that write permissions are given to the persistent remote storage path for the user mentioned in env variable EII_USER which defaults to eiiuser.
Configuring other options 
Configuring “options” in config.json([WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/config.json)
input_queue
Stores raw frames from video ingestion.
data_filter_queue
Stores frames from data filter.
output_queue
Stores processed frames that are ready to be published.
max_size for input_queue, data_filter_queue and output_queue.
Sets the size of the mentioned queues.
Consider giving it a higher value if you face frame drop issues.
If value is <=0, queue size is infinite.
Mandatory Metadata Fields & Compatible UDFs 
Features like Annotation and Remote Storage provided by Edge Data Collection requires certain fields to be present in the frame metadata to operate. These fields carry annotation information, track data export status and other important details.
Since User Defined Functions (UDFs) are usually attached with ingestion source such as EdgeVideoAnalyticsMicroservice (EVAM), we let UDFs add these fields along with any other fields it already adds as part of its functionality. Users need to add these UDFs are added to EVAM’s configuration file.
Mandatory Fields
In case you want to add your custom UDF and make it compatible with Edge Data Collection, ensure that the following fields are present in the metadata published by the UDF. Any missing key here will result in an error.
Keys with sample values.
"last_modified": time_ns()
"export_code": 0
"annotation_type": "auto"
"annotations": {"objects":[]}
last_modified: Mandatory. Refers to the last time the metadata was modified. The value should be in nanoseconds.
export_code: Mandatory. Refers to the status of the data export. The following export codes are supported:
0: Data has not been exported
1: Data has been exported
annotation_type: Mandatory. Refers to the type of annotation. The following annotation types are supported:
auto: for auto annotation
human: for human annotation
annotations: Mandatory. Refers to the annotations for objects detection in the image.
objects: A list of objects detected in the image. If present, each object should have the following keys:
label: The label of the object
score: The confidence score of the object
bbox: The bounding box of the object. It is list of bbox coordinates in top-left and bottom-right format. For example, [x1, y1, x2, y2]
It is important that the objects within the annotations dictionary have bounding box dimensions in top-left and bottom-right format. Some models may have bounding box predicted as top-left coordinates, width and height. In that case, the UDF must convert it to the expected top-left and bottom-right format to avoid incorrect bounding box dimensions during data export or visualization.
Classification label fields
The annotations key value pair only carries bounding box information from an object detection model. However, if the UDF is a classification model, ensure that a classification label key field with score value is also present in the metadata published by the UDF in addition to the mandatory ones above.
For example: A UDF that classifies an image to be of Person class with a confidence score of 0.45, the published metadata must have the key-value pair- "Person": 0.45.
If a data export is to be done, the classification label Person should be present in categories field of [WORK_DIR]/IEdgeInsights/DataCollectionMicroservice/config.json. See Remote storage section for more details. The annotations field is still required for classification type of UDFs but can have empty objects list as shown in the sample earlier.
Sample compatible UDFs
Certain sample UDFs are already provided with EVAM that adds these metadata fields making them Edge Data Collection compatible.
Anomalib UDF
Person Detection UDF
To check Edge Data Collection container logs
Run docker logs ia_data_collection_microservice --follow
To verify that the corresponding image meta-data (**ex: human/auto annotation**) is being saved properly in the local database:
docker exec -it ia_datastore /bin/bash
influx -precision rfc3339 -database datain -username <INFLUXDB_USERNAME_provided_in_.env_file> -password <INFLUXDB_PASSWORD_provided_in_.env_file> -ssl -unsafeSsl
select * from dcaas_data # Or any desired influx query that helps validate your use case
Annotation Filtering 
A script has been provided to perform offline aggregation and filtering of images and their corresponding annotations and can be found under helpers/filter_annotations.py.
Before running the script, install cocojson using pip install cocojson
The script provides the following command line arguments
--input_path (required): Path to the Edge Data Collection remote storage directory
--output_path (required): Path to the directory where output should be written
--topics: A list of 0 or more topics being subscribed to that should be included in the output
--labels: A list of 0 or more categories of objects identified in the images that should be included
--annotation_types: A list of 0 or more methods of how the images were annotated e.x. human or auto. If left empty, all types will be included.
–creation_start_date: The earliest date when images were ingested by Edge Data Collection. Should be in the format yyyy-mm-ddTHH:MM:SS
If this parameter is provided, --creation_end_date must also be provided
–creation_end_date: The latest date when images were ingested by Edge Data Collection. Should be in the format yyyy-mm-ddTHH:MM:SS
If this parameter is provided, --creation_start_date must also be provided
–last_modified_start_date: The earliest date when images were last modified. Should be in the format yyyy-mm-ddTHH:MM:SS
If this parameter is provided, --last_modified_end_date must also be provided
–last_modified_end_date: The latest date when images were last modified. Should be in the format yyyy-mm-ddTHH:MM:SS
If this parameter is provided, --last_modified_start_date must also be provided
--log_level: The level of logs that should be displayed to the user. Defaults to INFO
Example Usage
python3 filter_annotations.py \
--input_path='/home/user/dcaas_remote_store/' \
--output_path '/home/user/Downloads' \
--topics dcaas_data \
--annotation_types auto human \
--creation_start_date=2023-02-28T00:00:00 \
--creation_end_date=2023-02-28T23:59:59 \
--last_modified_start_date=2023-02-28T00:00:00 \
--last_modified_end_date=2023-02-28T23:59:59 \
--labels box "No Object"
Status Notification 
Edge Data Collection export statuses of task creation and data export progress for snapshots in the form of a json file.
For both, progress in terms of no of images processed is continuously updated and status is set to COMPLETED or IN_PROGRESS depending upon whether all the frames have been processed or not.
The notification file can be found in ${EII_INSTALL_PATH}/data/dcaas/dcaas_notification.json
Here is a sample of notification json file (comments added for explanation).
    "services": [   # list of enabled services with status
        "remote_storage",
        "human_annotation"
    "remote_storage": {   # remote storage specific status
        "snapshot_list": [
            "2024-02-14-193903"   # list of snapshot. currently only 1 i.e.latest is mentioned
        "2024-02-14-193903": {    # snapshot instance details
            "time_utc": "2024-02-14T19:39:03",
            "config": {
                "mode": 1,
                "device": "disk",
                "auto_sync_interval_in_sec": 30,
                "ingestion_fps": 9,
                "img_recognition_configs": [
                        "type": "classification",
                        "export_format": "imagenet",
                        "filter": {
                            "start_date": "2023-06-06T00:00:00Z",
                            "end_date": "2024-12-30T13:00:00Z",
                            "annotation_type": [
                                "auto",
                                "human"
                        "classification_threshold": {
                            "anomalous": 0.4
                "db_topics": [
                    "edge_video_analytics_results"
            "status": "IN_PROGRESS",    # export status, 'COMPLETED' when done
            "export_dir": "/storage/2024-02-14-193903",   # exported relative dir
            "processed_frames": 200,    # frames exported till now
            "total_frames": 402
    "human_annotation": {   # human annotation specific status
        "time_utc": "2024-02-14T19:39:22.443201Z",
        "config": {
            "cvat": {
                "login_retry_limit": 10,
                "task_creation_policies": {
                    "max_images_per_job": 5,
                    "max_images_per_live_task": 5,
                    "max_images_per_task": 100
            "enabled": true,
            "image_selection_for_annotation_policies": {
                "db_topics": [
                    "edge_video_analytics_results"
                "end_date": "9999-12-30T13:00:00Z",
                "image_type": "annotated",
                "img_handles": [],
                "last_x_seconds_of_images": 0,
                "start_date": "2024-01-02T00:00:00Z"
            "task_sync_policies": {
                "auto_sync_interval_in_sec": 30,
                "is_auto_sync": true
        "status": "COMPLETED",    # task creation status
        "processed_frames": 561,    # net frames processed as task are being created
        "total_frames": 561
Behavior With and Without other Edge Insights System services 
Sync of manually reviewed annotations and remote backup of annotations/images is dependent on availability of that particular data in DataStore as per its retention policy.
Edge Data Collection waits 5 mins for DataStore to provide a response for queried data. After the elapsed time, Edge Data Collection doesn’t error out in the hope that during the next iteration of the query API call to DataStore, meaningful results will be obtained.
Incase of a fatal error in Edge Data Collection, there is no guarantee of successful completion of remote data backup.
Edge Data Collection requires certain bare minimum key-value to be present in metadata. EdgeVideoAnalyticsMicroservice being the direct source of metadata, it is upto the EVAM and its UDFs to ensure that these fields are present in the metadata. To learn more on this, refer Mandatory Metadata Fields & Compatible UDFs section in this document.
Useful statistics to consider before running Edge Data Collection 
Please find the table below that gives the approximate disk space consumption for the mentioned video ingestion (image) configuration with single stream. So, based on how long you plan to run Edge Data Collection microservice, please ensure that there is a relevant amount of disk space available on the edge node.
Mode of execution
Image resolution (height x width)
Image encoding type
Image encoding level
Disk space consumption
Simple Storage or Auto Annotation
432 x 768
~30 GB for ~24 hours
Human Annotation*
432 x 768
~70 GB for ~24 hours
*Human Annotation configured with “image_type: all” and with live task creation turned ON.
Unit tests
To run the unit tests, navigate to Edge Data Collection repo root directory and run the following
make build
make test
Note for developers 
Disclaimer: The below details on gracefully shutting down Edge Data Collection are valid only until Intel® Edge Insights System supports ‘depends_on’ flag in builder.py while generating the consolidated docker-compose.yml file.
For gracefully shutting down Edge Data Collection container completing all the necessary data sync, there is a need for the other Intel® Edge Insights System containers to be up and running (e.g. cvat, ia_imagestore etc.). These dependencies can be manually added to ‘depends_on’ flag in the consolidated docker-compose.yml.
Edge Data Collection does not do color conversion of the input frame i.e., BGR to RGB or vice versa. Any desired color format for processing in the UDF that is plugged in to Edge Data Collection needs to be taken care of in the UDF itself. All images are considered to be in BGR mode by default in Edge Data Collection.
Retrieving Azure keys
Before being able to retrieve the required Azure keys, make sure you have a valid azure storage subscription. If not, please contact your system administrator.
AZURE_STORAGE_CONNECTION_STRING: This connection string can be found in the azure web portal. Login to Azure portal. Navigate to Storage Accounts. Find the storage account available to you or your team. Under Security + networking section, select Access keys. This should link to a page where the key(s) associated to your storage account are available. If there are multiple keys, choose any one and click Show, and then copy its “Connection string”. This string should be set to the AZURE_STORAGE_CONNECTION_STRING in .env file inside your IEdgeInsightsbuild directory.
AZURE_STORAGE_CONTAINER: This is the name of the container where the exported annotations and images will be uploaded by Edge Data Collection. container is a directory in Azure storage terms. To create one, login to Azure portal. Navigate to Storage Accounts. Find the storage account available to you or your team. Under Data storage section, select Containers. This should open a portal that lists all the containers created within this storage account. Click on + Container link at the top.
Enter a name you wish to give it to the container. Click Create. This string should be set to the AZURE_STORAGE_CONTAINER in .env file inside your IEdgeInsightsbuild directory. If you are using an existing container please set the value to that name. To learn more, goto container.
Logs from azcopy may contain proxy information of your network. While sharing the logs, if you donot want this or any similar information, please redact them from the logs.
General 

For any issues or abnormal behavior, it is a good idea to have a look at the container health status by running



        

        

    
        推荐文章
    

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            怕老婆的仙人球
                             · 
                            - HSM IPC 初始化场景 - Knowledge Portal
                        
                        
                        1 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            面冷心慈的手套
                             · 
                            Hello World! (iOS)  |  Google AI Edge  |  Google AI for Developers
                        
                        
                        1 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            没有腹肌的雪糕
                             · 
                            - 如何从 VN8900 系列设备获取状态信息 - Knowledge Portal
                        
                        
                        1 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            淡定的松鼠
                             · 
                            Ubuntu安装Edge浏览器，好用的浏览器！！ - 泪水下的笑靥 
                        
                        
                        3 周前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            从容的柳树
                             · 
                            古天乐代言太阳集团(中国)有限公司
                        
                        
                        5 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            讲道义的松树
                             · 
                            WE官宣：原RNG下路选手LP正式加入队伍_5EPlay - 5EPlay赛凡网
                        
                        
                        6 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            帅气的甜瓜
                             · 
                            【元件库 】移动端通用元件组合集 - Axure工坊
                        
                        
                        9 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            豁达的帽子
                             · 
                            MySQL中给表添加一个字段（字段名、是否为空、默认值） - MySQL - 洪哥笔记
                        
                        
                        10 月前
                        
                            
                        
                    
                
                
            
        

    

        

            
                
                

                    
                        
                    
                    
                    
                        
                            烦恼的黑框眼镜
                             · 
                            快手抖音左公子微博微密健身高清图片视频下载 – 喜欢你
                        
                        
                        10 月前