Priorities and funded actions
To carry out the preparatory work needed for the creation of high-quality large-scale shareable data sets for innovation, including work on interoperability e.g. through standardisation and privacy e.g. through anonymization.
Priority 1 – Build the proposed activities to the extent possible on solutions and ecosystems already existing and available at EU level.
Among financed actions
Preparatory work shall include some of the following elements:
- Mapping of the families of datasets relevant to innovation in the field of security (e.g. photos, videos, voices samples, unstructured text (e.g. forums), unstructured hybrid data (e.g. web scraping or emails), structured data (e.g. telecommunication signalisation data), etc.
- Analysis of technologies that would allow the sharing of operational data, including but not limited to data anonymisation and generation of synthetic data sets, as a possible solution in cases of legal restrictions on collection and sharing of law enforcement datasets (e.g. need for additional consent to allow new processing of data for the purposes of innovation)
- Analysis and definition of requirements for tools for classification and annotation of data sets, including mapping of existing solutions
Analysis and definition of requirements for tools for data quality assessment including:
- Safety check features (i.e. a user uploading a dataset containing anonymized (e.g. blurred) facial images checking to ensure that no un-anonymized facial image is left in the dataset by accident)
- Quality or coherence checks, version control of the datasets and initial data set for testing and evaluation of AI systems (including synthetic data sets)
- Analysis (including mapping of existing solutions) and definition of requirements for data management tools
- Analysis and definition of requirements for a code repository for these tools, taking into account existing solutions and initiatives at EU level (e.g. Europol’s developed European Code Repository, or the European Anti Cybercrime Development Association’s work on tools for annotation of data)
- Analysis of the estimate cost of developing/acquisition of the above-mentioned solutions, while taking into consideration existing initiatives in order to avoid duplications
- Definition of requirements for User Access Management for the national components of the Data Space
- Defining a list of Requirements for the national components
- Definition of the micro/macro-services
- This calls for proposals is an open call for proposals
- Initial duration of the action: 24 months
- Be legal entities (public bodies from EU Member States’ law enforcement authorities; or private bodies, including profit and non-profit private entities)
- Schengen Associated Countries shall only be able to participate in the consortium as associated observers, upon agreement of the grant beneficiaries
- A consortium composed of at least 3 entities, established in at least 3 different EU Member States or associated countries to the programme must be constituted