Logo Pleamar

RETURN – Operationalization and exploitation of the information obtained from images of fish in the fish market using artificial intelligence.

Pleamar program

Description

The FOTOPEIX and FOTOPEIX 2.0 projects, beneficiaries of previous calls of the Pleamar Programme, have made it possible to develop the artificial intelligence tools necessary to automatically generate data on the size of up to four species from images of fish boxes. Both projects have not only led to a methodological change, but have also made it possible to strengthen a close collaboration with the company that markets the catches of the Mallorca fleet, OPMALLORCAMAR, establishing a solid channel of collaboration that allows the two-way transmission of knowledge between the fishing and scientific sectors.

The RETORNO project aims to close the conceptual and practical framework that began with the granting of the FOTOPEIX projects and to provide the institutions and companies involved with useful information both for the monitoring and evaluation of the populations and for possible internal improvements in the first sale process.

The RETORNO project focuses on the case of size data only as a proof of concept, i.e. to verify that the underlying concept or theory is likely to be exploited in a useful way. In essence, RETORNO aims to develop the basic core of a comprehensive system that can grow and evolve in the future. The fundamental objective of the integral RETORNO system, modular and dynamic, is to structure and automate the flow of information between the observations and the information demanded by the end user, necessary to solve each specific problem. To channel the flow of information, we have identified four phases:

  1. Data collection: a quality control protocol for the taking of images will be articulated, together with periodic sampling at the fish market to validate the size estimates
  2. Processing: application of the deep learning tools developed in the FOTOPEIX projects for the extraction of sizes.
  3. Generation of new information: a long-term time series will be maintained, with a weekly cadence, of the average size of each of the species considered. Likewise, a short-term predictive model (one week ahead) will be implemented. In this second case, it is intended to demonstrate that one of the most relevant potentialities of RETORNO is the iterative and automated updating of the models as time goes by. Finally, a Bayesian Belief Network will be implemented for one of the species considered (Coryphaena hippurus). This type of model allows the relationship between time series of landings, sizes, prices, production costs and other economic variables, together with environmental variables.
  4. Storage on three levels: images (all images of the boxes of fish sold each day), size data (size of each fish measured, with box identifier, day, species…) and output generated in the third phase (models and the weekly time series). The repositories in which the information will be stored will be those of the research centre (IMEDEA) and access will be restricted to project participants. This phase is of vital importance, as it will allow the information to be reviewed retrospectively.

A size bulletin will be published, which will be distributed among the fishermen, agreeing on the final format and exact content with OPEMALLORCAMAR. As a complementary activity, meetings will also be organised with other fish markets and fishermen’s organisations from outside Mallorca, in which the advantages of a comprehensive system such as the one implemented in RETORNO will be shown. The integral system will be structured in R language, emulating a recent proposal (White et. to the , 2019). Likewise, the advantages and disadvantages of alternative languages will be explored.

An example of information flow in the case of fish sizes would start with automatic imaging (Phase 1) whenever there is a sale at the fish market. Each image will be evaluated and, if it contains any specimen of any of the four species considered, the size of those individuals will be extracted (Phase 2). Each day the sizes obtained will be stored together with a box, species and day identifier (Phase 4) to maintain a long-term time series. These data will be automatically processed once a week to update the average size time series for each species and the expected size distribution prediction for the following week (Phase 3). Likewise, an automatic quality control will be carried out, comparing the estimates and observations of each week. The predictive model that will be implemented as a proof of concept will be a first-order autoregressive process, but as many alternative models as appropriate can be integrated into this module. Likewise, the quality control module will allow you to compare the history of the predictions of the different models. In parallel, for models involving predictive variables, an automatic process will be articulated to incorporate new values of these variables.

See the project

Line of action:

Marine ecosystems

Status:

Finalizado

Execution date:

2019

General:

Return to the institutions and companies involved useful information both for the monitoring and evaluation of the catches and for possible internal improvements in the first sale process.

Specific:

1. Design a quality control protocol for the taking of images in Lonja.

2. Application of the deep learning tools developed in the FOTOPEIX projects for the automatic extraction of sizes.

3. Generation of quasi-real-time information and predictions.

4. Storage of information.

5. Autonomous generation of information bulletins.

6. Dissemination of the developed system.

A1. Design of protocols for the correct storage of daily images. 1.1 Implementation of algorithms for image filtering.1.2 Analysis of fish market data for filtering validation.

A2. Automation of image entry into networks for size extraction.

2.1 Development of algorithms for the automation of the size extraction process.

2.2 Sampling at the fish market.

2.3 Data collection in the laboratory.

2.4 Validation of automatisms using real data.

A3. Generation of a long-term time series with weekly cadence.

3.1 Development of algorithms for obtaining weekly data

A4. Implementation of a short-term predictive model (one week ahead).

4.1 Development of a predictive size model.

4.2 Validation of the predictive model

A5. Implementation of a Bayesian Belief Network for the llampuga.

5.1 Implementation of a Bayesian Belief Network for the llampuga.

5.2. Communication of results to companies and regulatory bodies

A6. Implementation of the storage of the generated data.

6.1 Design and development of the storage system for the data generated. Sizes and models.

A7. Development of the contents of Newsletters.

7.1 Review of similar work carried out at national and international level.

7.2 Writing newsletter content

A8. Holding meetings with the sectors involved. 8.1 Meetings with administration, OPMALLORCAMAR and fishermen’s guilds

A9. Dissemination of the project and its results.

9.1 Dissemination in traditional media and social networks.

9.2 Visit to other fish markets to present the project.

9.3 Presentation of the results in the scientific field.

The RETORNO project has developed a system that allows information to be automatically extracted from the images collected at the fish market. To this end, a quality control protocol for the taking of images in the fish market was developed thanks to three filtering algorithms to identify the images that include fish boxes, discard the repeated ones and select the boxes of llampugas (Coryphaena hippurus). The new imaging protocol allows each image to be automatically associated with the weight metadata of the box. A tool has also been developed for the automation of the extraction of information on sizes.

On the other hand, it has managed to obtain time series, with weekly cadence, of the sizes (furcal length) extracted, as well as one-week forecasts of the sizes of llampuga through a statistical model and its comparison with the average size actually reported. Likewise, size bulletins have been sent weekly to fishermen with the information recorded.

In addition, for the generation of real-time information, a Bayesian Belief Network (BBN) has been used that incorporates up to 12 different socioeconomic and climate change scenarios for the forecast of sizes. This type of model allows the relationship between time series of landings, sizes, prices, production costs and other economic variables, together with environmental variables.

The system is prepared to continue operating automatically and indefinitely and, although it has been applied to the llama species, this system is suitable for use with other species, with the corresponding adaptation of the net and the estimation of the size.

programa
linea de actuación

RETURN – Operationalization and exploitation of the information obtained from images of fish in the fish market using artificial intelligence.