Data Retention and Backup
Data retention
The SmartFace Platform allows you to set a data retention policy. This also allows you to keep the size of stored data under control.
What data is stored
The SmartFace stores it’s data in two types databases due to different nature of the data stored.
Min.IO
Files and objects are stored in the Min.io database. MinIO is a high-performance, S3 compatible object store. It is suitable for distributed, cloud or on-premises infrastructure. For more information please read project’s webpage.
SQL Databases
The text records and tabular data such as watchlists, watchlist members, templates, match results etc. are stored in SQL databases. The SmartFace supports two database providers:
Microsoft SQL Database
This is the default option for the Windows installation. For more information please read project’s webpage.
Postgre SQL Database
This is the default option for the Docker installation. For more information please read project’s webpage.
Configuring Data Storage In SmartFace
SmartFace Platform can store all data which it recognizes in the incoming live streams or video files. This data may be, for example, whether a face was detected, identified, when it disappeared from a video, on which camera occurred an event and when, etc. All this data can be stored together with cropped images of a person or an object and the full frame from the input.
In various situations, it is required that SmartFace doesn’t store this data. For example, because of the General Data Protection Regulation (GDPR) in European Union, lack of people’s consent for processing of their personal data or simply because of huge requirements on the data storage.
SmartFace Platform allows you to enable or disable the collecting and storing of this data. By disabling data storage, data processed from live streams and video files won’t be stored. Disabling data storage however, doesn’t affect data stored in watchlists.
Data storage options
To view the current configuration and to update the data storage option you can visit the /api/v1/Setup/DataStorage/Video endpoint.
Using the GET /api/v1/Setup/DataStorage/Video method you can see your current configuration. Per default it looks like below:
{
"storageMode": "All"
}
You can enable or disable video data storage by configuring one of the following options in the property storageMode
. If you configure the option None
, SmartFace Platform stops storing the processed data into the database and stops sending database notifications. Only direct notifications about match results sent from SmartFace Platform will be available to you. For more information, see the chapter Notifications explained.
If you configure the option All
, SmartFace Platform stores the events for detected people (regardless if they are registered in a watchlist or not). Data together with the face images and full frames are saved into the database and database notifications are triggered as well. The option All
is the default value.
Face Save Strategy
When storageMode
is set to All
, you should define the configuration FaceSaveStrategy to specify which data will be stored. The following table describes options for FaceSaveStrategy configuration:
Save Strategy Option | Description |
---|---|
Balanced | After the first successful detection, everytime a new best biometric score or detection quality is achieved a new image is saved within the tracklet. |
All | All faces for the detected person are stored. |
MatchedOnly | Only faces which are matched against a watchlist member are stored. |
The default value for FaceSaveStrategy is Balanced
.
Full Frame Saving Strategy
It is also possible to configure whether SmartFace stores full frames for a detected person or not. Configuration of this option is in the property SaveFrameImageData
with values true
or false
. The default value is true
.
Pedestrian Save Strategy
If a Pedestrian Detector Resource is set for a camera, you can then set the configuration regarding the saving strategy for the pedestrian detector. Depending on your configuration you will either save all the images from a pedestrian tracklet, or you will use the balanced strategy, adding a new image into the tracklet when a better pedestrian detection occurs. Therefore there are two options: Balanced
and All
. The default value is Balanced
.
Object Save Strategy
If an Object Detector Resource is set for a camera, you can then set the configuration regarding the saving strategy for the object detector. Depending on your configuration you will either save all the images from an object tracklet, or you will use the balanced strategy, adding a new image into the tracklet when a better pedestrian detection occurs. Therefore there are two options: Balanced
and All
. The default value is Balanced
.
Data Retention Preset’s
Please see below the most common presets, with suggested settings for each preset/use case.
unlimited storage capacity solution
- faceSaveStrategy:
all
- saveFrameImageData:
true
- imageQuality:
100
privacy / GDPR solution
- faceSaveStrategy:
matchedOnly
- saveFrameImageData:
false
compact data solution
- faceSaveStrategy:
FirstFace&BestFace
- saveFrameImageData:
false
- imageQuality:
70
no data solution
- videoDataStorageConfig:
none
Disable SQL Data storage
An additional option is to disable usage of the SQL Data storage. This means that no images are being preserved. This applies to notifications, history and watchlists. The detection, identification, extraction, matching and liveness functionality are preserved, however no image is being stored, so visualization of described events is not possible.
To apply, update the .env
file and set the NoSqlDataStorageDisabled
variable to true. Apply changes using docker-compose up -d
. For Windows version update the appsettings.json
file and restart the services.
NoSqlDataStorageDisabled=true
Automatized Database Cleanup
The SmartFace Platform has DB Cleanup functionality. You are able to setup a daily occuring database cleanup. It can remove both images and records - Smartface Platform allows you to manage the deletion of captured data in the databases.
The settings for the DB Cleanup is available in the REST API available on the port 8098
of your installation.
The /api/v1/Setup/DbCleanup endpoint allows two method of interaction:
- GET /api/v1/Setup/DbCleanup
Using the GET method you can get the information regarding the current configuration of the DB Cleanup.
The default configuration is as below:
{
"cleanupAmount": null,
"enabled": true,
"maxFramesCount": null,
"maxImageDataAge": null,
"cleanupStart": null,
"deleteSql": false,
"deleteMatchResults": false
}
- enabled - defines whether the DB Cleanup is enabled. You can use
true
andfalse
values - cleanupAmount - the amount of image objects to be removed. This value is currently deprecated.
- maxFramesCount - the maximum number of Frames to keep in database (oldest frames are deleted first). If DeleteSql option is disabled, only blob/image data of these frames will be deleted, SQL data will stay in database.
- maxImageDataAge - specifies maximum time (in days) to keep frames in database. E.g. when set to 7, all data older than 7 days will be deleted. If DeleteSql option is disabled, only blob/image data of these frames will be deleted, SQL data will stay in database.
- cleanupStart - CleanupStart defines the time of the day when cleanup should be performed (in UTC). Example value:
01:00:00
which means cleanup will be performed at 1 AM of UTC time. - deleteSql - If set to true, cleanup will delete all data related frames from SQL and NoSQL database. If set to false, cleanup will delete only image data of related frames (full frames, crop images of detected faces/pedestrians). SQL data will be persisted.
- deleteMatchResults - A boolean value indicating whether to also delete match results for faces that are no longer in database.
- PUT /api/v1/Setup/DbCleanup
Using the PUT method you can set the new configuration of the DB Cleanup. Please use a JSON format to set the values to match your needs.
Backup and restore data
This section is Docker specific
Docker Volumes
The SmartFace Platform stores it’s data on Docker volumes. The architecture using volumes are useful for backups, restores, and migrations. When the system is installed, 4 docker volumes are automatically created:
- sf_dependencies_miniodata - stores object and file data, such as images and videos
- sf_dependencies_mssqldata - stores SQL tabular data for the Microsoft SQL Database
- sf_dependencies_pgsqldata - stores SQL tabular data for the PostreSQL Database
- sf_dependencies_rmqdata - stores data for the RabbitMQ message broker
Boths SQL database volumes are created, even though depending on your setup, you will use only one SQL database engine at once. The default SQL database for the Docker version is the PostgreSQL.
Volume Backup
Please follow the steps mentioned below to backup a docker volume. This guide might not solve restoring data backup between all versions. As mentioned below if you are restoring data between two versions you can use the run.sh script for data migration between versions. If in doubt, please contact us for support.
1. Stop the containers
We recommend to stop all docker containers (both dependencies and the main SmartFace set of containers) so there would not be any changes while the backup and/or restore process is ongoing.
You can run the command below in each folder with used docker-compose.yml
- both the main and the sf-dependencies
folder.
docker-compose stop;docker-compose down
It is possible to stop ALL docker containers at once, but this might interfere with other containers than SmartFace’s, if such containers run on the same server. The command is:
docker stop $(docker ps -a -q)
2. Map an external data folder into a volume
Once the containers are stopped we need to open the docker-compose.yml
file in the sf-dependencies
folder with your preferred text editor. Please find the definition of the container you would like to backup in the file. We will show the step on a Min.io container. Per default it will look like this:
minio:
image: "minio/minio:RELEASE.2022-06-02T02-11-04Z"
container_name: minio
command: server /data --console-address ":9001"
ports:
- "9000:9000"
- "9001:9001"
restart: unless-stopped
volumes:
- miniodata:/data
Please create a folder you would like to use for a backup of the volume. For the ease of use we can create such folder directly under the sf_dependencies
folder. To allow us to backup the minio data we need to add new line under the volumes:
section of the Min.io setup, where we can map the backup folder. If the new folder is called backup_minio
, the new line would look like this:
- ./backup_minio:/backups
After adding the new like the Min.io container definition will look like this:
minio:
image: "minio/minio:RELEASE.2022-06-02T02-11-04Z"
container_name: minio
command: server /data --console-address ":9001"
ports:
- "9000:9000"
- "9001:9001"
restart: unless-stopped
volumes:
- miniodata:/data
- ./backup_minio:/backups
In the case of the other containers than the MinIO the paths will be different. For example for the PostreSQL backup mapping it can be ./backups_pgsql:/backups instead.
3. Run the Container
To initiate the mapping we need to run the SmartFace dependencies. In this example we are afer the Min.io. To avoid any changes during the copying, we recommend to keep the SmartFace Platform containers to stay stopped until the whole backup process is completed. You can run the dependencies by going into the sf_dependencies
folder and by running the command:
docker-compose up -d
If the docker containers were initiated successfully, the terminal output will look like below:
user@smartface-server:/srv/smartface/sf_dependencies$ docker-compose up -d
Creating pgadmin ... done
Creating mssql ... done
Creating jaeger ... done
Creating pgsql ... done
Creating minio ... done
Creating rmq ... done
4. Log into the Container
You can now log into the minio container using command:
docker exec -it minio bash
You will see such output in the terminal with prompt ready to accept commands:
user@smartface-server:/srv/smartface/sf_dependencies$ docker exec -it minio bash
[root@87cf176b4f00 /]#
Now we can copy the content of the data folder into the backups on the minio container.
cp -r /data/* /backups
The reason we copy from the data folder is because the line - miniodata:/data
in the docker-compose.yml
states so. This means that the Min.io data is mapped to the /data
folder inside the Container. The newly created backup folder is mapped as /backups
in the Container. If it was for the PostgreSQL container it would be var/lib/postgresql/data/
instead. So the PostgreSQL command would be:
cp -r /var/lib/postgresql/data/* /backups/
Once everything is copied you can actually see the content in the sf-dependencies
and it’s backup_minio
folders. Please check the content is there to verify the backup process. If no other changes are needed you can run the SmartFace containers again running the below command in the folder where the docker-compose.yml
file is located.
docker-compose up -d
Restore volume data
Before trying to restore/update the volume data, please ensure the containers are down, both the main and the dependencies:
docker-compose stop;docker-compose down
Ensure the mapping is set for each container, using steps from the Volume Backup.
Run the containers:
docker-compose up -d
Enter the container (for Min.io as below, for PostreSQL use pgsql
instead of minio
)
docker exec -it minio bash
Copy and replace the data from the backups into the data of the container. Ensure that only the backed up data are in the new folder. Do not mix old and new data.
cp -r /backups/* /data/
It is possible that such operation won’t be allowed by the container as the container is up and running and can occupy/use a file/folder that you would like to override/restore into. In such case you need to setup the container the way that it is up and running however it is not yet running for it’s main purpose - the functionality and services inside are not yet available. You can do so by altering the entrypoint temporarily. This change needs to be removed once the restoration is completed.
To adjust the entrypoint you need to add the line below to the container setup in the `docker-compose.yml`` file:
entrypoint: sleep 1h
The Min.io container definition will look like this:
minio:
image: "minio/minio:RELEASE.2022-06-02T02-11-04Z"
container_name: minio
command: server /data --console-address ":9001"
ports:
- "9000:9000"
- "9001:9001"
restart: unless-stopped
volumes:
- miniodata:/data
- ./backup_minio:/backups
entrypoint: sleep 1h
The value for the sleep command is by default in seconds, i.e. sleep 100 equals to 100 seconds. However you can use units too, where sleep 1h will equal to 1 hour etc.
Once it is set run all the containers by command in the main docker-compose.yml folder
docker-compose up -d
If the versions between the backup and restore do not match - you are restoring data to a newer version you need to run the run.sh script that will migrate the data. Script is located with the main SmartFace docker-compose.yml file in the same directory.
run.sh