Technical ReportPDF Available

Report of the second workshop on an alternative approach to the segmentation of fishing fleets

Authors:

Abstract and Figures

Our second workshop on a newly developed, alternative approach to the segmentation of fishing fleets was held from 28th to 30th of March 2022. Due to travel restrictions caused by the global Covid-19 pandemic, the workshop took place online. Cisco Webex provided the online platform. The workshop was hosted by the Thünen Institute for Sea, Fisheries, part of the Johann Heinrich von Thünen Institute, Germany's Federal Research Insitute for rural areas, forestry, and fisheries. Thirty-seven experts representing 16 nations, the ICES, and DG MARE, participated in the workshop. Fifteen national fisheries data sets were analyzed. A new approach to the segmentation of fishing fleets was developed in a DCF pilot project at the Thünen Institute of Sea Fisheries and transferred to an R package, referred to as "FS-package" in the following. In March 2021, the first workshop on the novel approach and the FS-package was held with 34 experts representing 15 nations, and major progress in improving the package was made. After implementing the suggestions made by the attendants of the first workshop and identifying the next urgent steps for further developing the novel approach, the creators of the approach organized a second workshop.
Content may be subject to copyright.
Edited by Erik Sulanke and Dr. Jörg Berkenhagen
Workshop organized and hosted by:
Thünen Institute of Sea Fisheries, Bremerhaven, Germany, part of
Johann Heinrich von Thünen Institute
Federal Research Institute for
Rural Areas, Forestry and Fisheries
28 30 March 2022
Online Workshop
REPORT
OF THE SECOND WORKSHOP ON AN
ALTERNATIVE APPROACH TO THE
SEGMENTATION OF FISHING FLEETS
Introduction
Our second workshop on a newly developed, alternative approach to the segmentation of shing
eets was held from 28th to 30th of March 2022. Due to travel restrictions caused by the global
Covid-19 pandemic, the workshop took place online. Cisco Webex provided the online platform.
The workshop was hosted by the Thünen Institute for Sea, Fisheries, part of the Johann Heinrich
von Thünen Institute, Germany's Federal Research Insitute for rural areas, forestry, and sheries.
Thirty-seven experts representing 16 nations, the ICES, and DG MARE, participated in the
workshop. Fifteen national sheries data sets were analyzed. The list of participants can be found
in Annex 1. Erik Sulanke and Jörg Berkenhagen chaired the workshop. The agenda can be found
in Annex 2.
Executive summary
A new approach to the segmentation of shing eets was developed in a DCF pilot project at the
Thünen Institute of Sea Fisheries and transferred to an R package, referred to as "FS-package" in
the following. In March 2021, the rst workshop on the novel approach and the FS-package was
held with 34 experts representing 15 nations, and major progress in improving the package was
made. After implementing the suggestions made by the attendants of the rst workshop and
identifying the next urgent steps for further developing the novel approach, the creators of the
approach organized a second workshop and formulated the following ToRs:
1. Harmonize the data preparation and develop a standardized protocol.
2. Clarify the application of the eet segmentation approach and evaluate newly developed
tools.
3. Establish regionally consistent eet segments over multiple member states operating in the
same shing regions.
Besides the mentioned ToRs, various technical issues needed to be addressed before the workshop,
the most signicant being the publication of the FS-package in the publically available repository
GitHub. Regarding ToR 1, newly established, standardized thresholds in a key function assigning
stocks to catch data as well as a novel function for automatic data preparation were well received
by the participants. However, an appropriate way to pre-separate eet data to improve the
clustering result was not agreed upon. The remaining issues include the grouping of demersal
seiners and demersal trawlers in the DCF, no clear agreement on appropriate length pre-
segmentation, and the possibility of a regional pre-segmentation, e.g., ICES and non-ICES area
fleets.
The workshop participants gave overall positive feedback on the additional new functions of the
FS-package. Especially the new functions making use of target assemblages were highlighted, as
they have the potential to foster comparability between economic and metiér-based biological eet
classication methods. The regional group work, which was specied in ToR3, comprised the
- 1 -
major work conducted during the workshop. A full list of the analyzed regions can be found in
Annex 4. Due to the composition of workshop participants, some regions were more well-
represented than others, i.e., a comprehensive analysis could be conducted for, e.g., the North Sea
and the Northeast-Atlantic, but not, e.g., for the Black Sea. Despite this imbalance in the analysis,
the overall feedback of the regional group analyses was positive, and regionally consistent eet
segments could be identified in all adequately represented regions. Nevertheless, more detailed,
extensive, and inclusive work in separate regional sessions is necessary for an adequate regional
analysis of shing eets.
In conclusion, we highlight the necessity of a standardized workow to classify and name eet
segments. In addition, an articial intelligence analysis approach developed in cooperation with the
technical university of Kaiserlautern signicantly improves the applicability of the FS-package. A
corresponding publication is in preparation, and the AI approach needs to be included in the terms
of reference of a future eet segmentation workshop. All considerations and discussions of
reforming the DCF eet segmentation procedure need to take into account the criteria of the
suitability of a eet segmentation procedure. We identied a close connection of segments to
sheries, a homogenous cost structure of the resulting eet segments, and the feasibility of the
approach as the most important.
In the following, we present the Terms of Reference.
Terms of Reference
The overarching aim of our workshop was to test the amendments made to the newly developed
approach to the segmentation of shing eets. This approach was created in a DFC pilot project
of the Thünen Institute between September 2019 and December 2020 and transferred to an R
package. After testing the approach in cooperation with various national partners, a workshop on
the novel approach was held from March 29th to 31st, 2021. The results of this rst workshop led
to the following ToRs:
1. Harmonize the data preparation and develop a standardized protocol.
2. Clarify the application of the eet segmentation approach and evaluate newly developed
tools.
3. Establish regionally consistent eet segments over multiple member states operating in the
same shing regions.
Background
Under the Data Collection Framework (DCF), economic data of European fishing eets have to
be provided by eet segment. The current eet segmentation scheme is based on the vessel length
class and the main shing gear, which are both technical parameters of the vessels. This
segmentation method is well dened and easily applicable, but it does not adequately represent
- 2 -
target sheries. In addition, the length classes used for the classication often fail to group vessels
of matching operational scales correctly. Vessels with similar technical parameters are often active
in varying sheries that differ in terms of catch composition, shing activity, and cost structure.
To improve reporting with respect to individual target sheries, a transferable, systematic approach
based on multivariate statistics methods was developed in the pilot project 'Fleet Segmentation' at
the Thünen Institute for Sea Fisheries in Bremerhaven, Germany, from September 2019 to
December 2020. The statistical framework was transferred to a user-friendly function package for
the R statisticssoftware, which was tested by multiple partners in the developmental process. From
the 29th to the 31st of March 2021, the rst workshop on the novel approach to eet segmentation
was held. The workshop was well attended by national sheries experts, and based on the
discussions, feedback, and suggestions gathered, the ToRs for the second eet segmentation
workshop were formulated.
Statistical Framework and Technical Amendments
A detailed description of the statistical background of the alternative eet segmentation approach
can be found in the report of the rst eet segmentation workshop. For an overview, please see
gure 1. This section will focus on the amendments made to the approach prior to the workshop.
First and foremost, the R package containing the novel approach was made available in a public
programming repository, GitHub. This is in accordance with one major conclusion of the rst
workshop. Publishing the package makes it available to all potential users, simplies the
implementation of updates, and creates a citable source for reports and publications. On the
technical side, rstly, the standard distance measure used in the clustering algorithm was changed
from the Euclidean distance to a metric conversion of the Bray-Curtis dissimilarity. This distance
measure is less sensitive to pairs of zeros in the data. It, therefore, creates more balanced clusters
and helps reduce the abundance of single-ship clusters, which tend to be a problem, especially in
large, heterogenic data sets.
As suggested in the rst workshop, a function allowing standardized data preparation was created.
The user no longer needs to create multiple specific data sets, only one comprehensive data set is
needed from which all necessary subsets of data are created. This data preparation function and
possibilities for improving it further will be discussed in the next section regarding ToR 1;
additional newly developed or updated functions aiding the applicability of the package will be
discussed in the section of ToR 2.
- 3 -
           
ToR 1. Harmonize the data preparation and develop a standardized protocol
The hurdles in installing the FleetSegmentation R package from a .zip le that were observed in
the rst eet segmentation workshop illustrated the necessity of having the package publicly
available. Hence, the FleetSegmentation R package was publically released on the GitHub
repository in August 2021. As a consequence, updating the package has become considerably easier.
However, installing a package from a non-quality checked repository like GitHub can be an
obstacle for inexperienced users. Therefore, publishing the FleetSegmentation package in the
official, quality-checked R repositoryCRAN (The Comprehensive R Archive Network) is the next
step in improving availability and ensuring a high code quality that meets scientic standards.
During the rst eet segmentation workshop, the ICES stock database used to create the distance
matrices for the clustering algorithm was extended by stock lists of the ICES and the GFCM. This
and the commonly agreed thresholds for automatic creation of species-area combinations
considerably improved the clustering result. Nevertheless, especially the stock list of the
Mediterranean Sea appears to be still incomplete and requires validation from experts of
Mediterranean Fisheries Management.
The function to automatically assign stocks to the national data sets was nested in a new function
that was created to harmonize the data preparation procedure. Thisnew function requires only one
comprehensive data frame, including all necessary eet variables (vessel ID, main gear, species,
area, catch weight), and computes all data frames for subsequent analysis per gear class. While this
step reduces the preparatory work and the potential for errors, the pre-segmentation by main gear
class requires further elaboration. In the second eet segmentation workshop, most participants
used the DCF main gear, as an alternative gear classication was not yet agreed upon. This DCF
main gear is assigned based on gear use time, and any gear being used in more than 50% of the
shing time is considered the main gear. This not only conceals patterns of polyvalent gear use but
also classies some very heterogenic shing gears, specically demersal trawlers and demersal
seiners (DTS). Based on our observations of shing patterns and cost structure, we highly
- 4 -
recommend separating demersal trawlers and demersal seiners in the DCF just like pelagic trawlers
(TM) and pelagic purse seiners (PS). Regarding polyvalency, we emphasize the necessity to consider
polyvalent gear classes like polyvalent active gears (MGP), a mix of active and passive gears (PMP).
Thresholds for assigning polyvalent gear use based on time of use need to be agreed upon by
the experts of the STECF.
Especially the large, heterogenic eets of major shing nations analyzed in the workshop elucidated
the necessity of additional pre-segmentation measures. In these cases, analyzing vessel groups
based only on the gear classes might lead to the grouping of vessels with very different technical
parameters and cost structures, especially when these vessels are targeting widely distributed sh
stocks. Participants tested various strategies of pre-separation during the workshop, e.g., size-based
pre-separation of passive geares over and under 12m vessel length or area-based pre-separation
between vessels that operate in non-ICES areas and those that do not. The different measures of
pre-separation are currently collected from the participants, and surveys on which of those
measures appear to be most promising will be held. It might be advisable that member states with
large, diverse eets introduce additional characteristics to further segment the eet without
initiating condentiality issues.
Conversely, it was suggested to group different active gears or passive gears where necessary in a
way that there is evidence for a comparable cost structure within that group. One idea was to test
grouping all passive gears, all active gears, and all dredgers. However, this has to be further
analyzed. Moreover, it was suggested to apply a pre-segmentation based on the technical equipment
of the vessels, e.g., net drums, beams, net hauler, line hauler, etc. It is intended to include several
pre-separation protocols in an updated version of the FleetSegmentation package.
ToR 2. Clarify the application of the eet segmentation approach and evaluate newly
developed tools
Both the newly developed and the updated functions were generally reviewed positively by the
participants, especially by those who were familiar with the FleetSegmentation package from the
rst workshop. As suggested in the rst workshop, the bar plot displaying the stock-based catch
composition of the clusters (clustering_stockshares_plot) was added with the option to display the
number of vessels in the corresponding clusters. This feature helps identify major shing strategies
and supports the user in decision-making on which clusters should be joined. The generic plots
displaying cluster catch composition and technical variables (clustering_stockshares_plot and
clustering_plotgrid, as well as all sub-plot functions included in the latter) were updated with an option
to subset the plot to a selection of clusters to give additional advice for cluster joining and segment
identication. This helps the user not only in getting an overview of the cluster features but also in
saving results in a more structured and clear way. In addition to those updates, the target
assemblage was introduced as a new feature for analysis. It is now possible to create a bar plot
(clustering_assemblageshares_plot) and a multi-dimensional scaling (clustering_assemblage_MDS) of the
target assemblages linked to the species caught by vessels. The assemblages help identify vessels
with similar shing strategies (e.g., vessels targeting small pelagic sh), especially if the user is not
familiar with all species codes and stock units. Since ICES alone considers 269 stock units and
shing eets generally target several hundred species, this is quite common. Including the
assemblages in the segmentation procedure has the additional advantage that resulting eet
- 5 -
segments are more comparable to the métier-based biological analysis, where the target assemblage
is included as a feature of the six-level métier denition. A complete list of the target assemblages
can be found in Annex 3.
ToR 3. Establish regionally consistent eet segments over multiple member states
operating in the same fishing regions
The primary focus of the second eet segmentation workshop was the regional group work to
identify regionally consistent shing strategies and, ultimately, also eet segments between multiple
shing nations. Major shing regions considered relevant for the participant nations were the Baltic
Sea, the Black Sea, the Eastern Arctic, the Mediterranean, the Northeast Atlantic, the North Sea,
and Other shing regions (e.g., NAFO areas or Southern Atlantic). Due to the fact that not all
European shing nations were represented in the workshop, some regions were more diversely
analyzed than others. All available regional plot sheets and participant case studies can be found in
Annexes 5.1-5.4. The regional group work in the Mediterranean was mainly characterized by the
comparison of the Maltese and Greek shing eets. For the Baltic Sea analysis, eets from
Germany, Finland, Sweden, Denmark, Lithuania, and Estonia were compared. In the North Sea
region, analysis was carried out for shing eets from Germany, Denmark, the Netherlands,
Sweden, and France. The Northeast Atlantic shing eets hailed from Portugal, Spain, France,
Ireland, and Germany. In all shing regions, consistent eet segments could be, to some extent,
identied among shing nations. Of course, identifying targeted and distinct fishing patterns like,
e.g., the Brown Shrimp shery in the North Sea is less challenging than comparing diverse,
heterogenic segments like mixed demersal sheries. Yet, the regional group work highlighted that
different shing nations are in many cases engaged in comparable, regionally consistent shing
activities and that those activities can be identied using the novel eet segmentation approach.
We, therefore, emphasize the necessity for additional, detailed analysis of regional shing patterns,
ideally in regional subgroups. This analysis will help not only to identify major European shing
eets but also to nalize the application protocol of our alternative approach to the segmentation
of fishing eets.
Overall workow perspective
The alternative segmentation package is, in general, a necessary step but not entirely sufcient for
a regular segmentation routine. The main purpose of the segmentation package is to provide
support in dening and describing alternative, sheries-related segments. This step is a manual one
that requires expert knowledge. Expert knowledge is needed for pre-segmenting the eet and for
checks if the clusters generated are appropriate or can/need to be combined or split.
Thus far, participants were, in many cases, able to derive alternative eet segments from the output
of the segmentation package more or less intuitively. However, general principles for these steps
still need to be elaborated. The characteristics of alternative segments should be clearly
documented, e.g., in a table. The description should include
- 6 -
Segment name
Number of vessels
Main gears
Main stocks or stock assemblages targeted
Vessel size range
Engine power range
DCF segments included
Effort and/or landings measures were regarded as informative
Once the eet has been segmented for one year, this procedure should be repeated for 2-4 more
years. The results should be checked for consistency. Where necessary, segment denitions and
descriptions can be adjusted. For transparency reasons, it is advisable to cross-check the result of
this step within regional working groups.
As soon as appropriate segments are characterized, the raw data will be used in an Articial
Intelligence (AI) package, which can identify patterns in vessel, logbook, and landings data and
then assign vessels to alternative segments automatically that were dened in the aforementioned
way. The AI package was developed by Verena Dully and Prof. Thorsten Stoeck of the Technical
University of Kaiserslautern. The results of this tool are highly convincing, and a corresponding
publication is in preparation. The AI tool will be part of a future workshop.
Further considerations criteria for evaluating the suitability of segmentation principles
The existing segmentation approach, which is based on the dominant gear class and vessel length,
has been in place for several years, with an introduction of two additional length class thresholds
(10m and 18m) and a split of one gear class (PTS to TM and PS) in 2008 and a shift in length
threshold from 10m to 8m in the Baltic in 2022. Length classes are easy to derive from the eet
register, and the main gear is usually determined using logbook information, where available. This
approach has been used for many years and has proven to be feasible for all eets. Still, the only
application of eet economics data with the same resolution as provided in the DCF has been the
Annual Economic Report. Hence, the same resolution is used in the "Balance Report".
However, sheries management often requires eet economic data at a different resolution. One
way to address this mismatch is to re-allocate economic data proportional to, e.g., effort or landings.
In order to overcome this auxiliary step, the existing segmentation principle should be reviewed
with a focus on linking segments more closely to stocks or target assemblages.
It appears desirable to aim at keeping the total number of segments similar to the DCF
segmentation. If this is the case, the approach with a closer link to stock is to be preferred. If
deemed necessary, the number of segments can be increased, though. Even in that case, the
sampling effort might not need to be increased as alternative segments can be less heterogeneous
- 7 -
in cost structure and thus require lower sampling rates per segment. In order to compare an
alternative segmentation approach with the existing (DCF) one, several criteria should be taken
into consideration. The following criteria should be taken into account but do not claim to be
complete:
     
Any alternative approach should aim for a closer link of segments to stocks or groups of stocks.
This aim can compete with the requirements of data condentiality and potentially with cost-
effectiveness. The closer the link to specic sheries, the higher the number of segments and the
smaller the number of vessels per segment. If segments get too small, data might become
condential.
   
The need for an amendment of the current DCF eet segmentation is based on the observation
that DCF segments, in some cases, combine vessels with different cost structures. Hence, any
alternative approach should result in segments with a more homogeneous cost structure. Like the
DCF segmentation, any segmentation will combine vessels with a broad range of landings and
effort. These characteristics are therefore not in the focus of an analysis of homogeneity. Likewise,
cost items should not be analyzed based on absolute values but rather on indicators or proxies.
E.g., if a group of shrimp beam trawlers has a broad range of fuel cost per year, this will be mainly
due to a broad range of shing days and probably engine power. Therefore, the indicator fuel
cost/kwday might be appropriate to describe homogeneity.
  
The segmentation procedure has to be clear, doable without excessive extra burden, and repeatable.
  
It is desirable if the segmentation is compatible with any existing time series. However, a time series
is only as good as the information that comes with it. The link between the current DCF
segmentation and particular sheries can be quite loose. In this case, the time series is not very
informative in terms of fisheries management. If the link between the DCF segment and certain
sheries is close, then the link to the alternative segmentation is close as well, and the time series
is more or less stable. It has to be borne in mind that the recent EUMAP legislation has introduced
a length class threshold at 8m instead of 10m for the Baltic Sea, thus introducing a break in time
series anyway.
- 8 -
Annex
1. List of participants
2. Workshop agenda
3. List of target assemblages
4. List of analyzed regions
5. Case studies
5.1. Finland
5.2. Greece
5.3. Romania
5.4. France
5.5. Germany
    
              
Name Institu�on Nation
  
     
      
    
 

       
     
  
 

       
        
        
      
    
       
         
        
 
      
  
          
 
    
 
    
  
          
- 9 -
5.6. Portugal
      
     
        
       
 
        
   
           
        
     
     
 

       
     
  

       
        

   
        
 
        
 
        
 
      
  
        
        


        
 
      
        
  
      
 
- 10 -
Second workshop on a novel
approach to the segmentation of fishing fleets
Agenda*
* all times are in CEST and might be changed in the progression of the workshop
Monday
10:00 Welcome and housekeeping
10:30 Brief introduction to the current fleet
segmentation system and its issues (Jörg
Berkenhagen)
11:00 Summary of the last fleet segmentation
workshop and ammendments made to the
fleet segmentation approach (Erik Sulanke)
11:30 Data preparation seminar and getting started
13:00 Lunch break
14:00 Regional subgroups forming and running
national analyses, if missing
17:00 End of the first day
Tuesday
09:30 Regional subgroup work
13:00 Lunch break
13:00 Regional subgroup work cont.
16:30 Impressions from data work
17:00 End of the second day
Wednesday
10:00 Summary and accomplishments of data work
day (Erik Sulanke)
10:30 Regional subgroups present their results
13:00 Lunch break
14:00 Experience presentation, pt. II
15:00 Key results and implications for the future
16:00 End of the workshop
   
- 11 -
   
             
Target assemblage - code Target assemblage Species (example.)
      
     
      
      
      
     
     
      
     
     
    
   
Region Extent / Remarks
      
 
 

 
      
            
- 12 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 1/13
Alternative Approach to Fleet Segmentation
Workshop report - Finland
Antti Sykkö, Natural Resources Institute Finland (antti.sykko@luke. (mailto:antti.sykko@luke.)), Joonas Valve,
Natural Resources Institute Finland (joonas.valve@luke. (mailto:joonas.valve@luke.))
2022-04-20
Experiments with The FIN 2020 catch data
The following clustering analysis was carried out via eetSegmentation() -package and with the 2020 catch data by Finnish trawlers ( )
and vessels which used passive gears (n). The average vessel lengths in catch data were 25 and 6 meters for trawlers and vessels
using passive gears, respectively (Figure 1).
The pelagic trawlers gear class TM consist of trawlers, which mostly target Baltic herring (Clupea harengus membras) in the Northern Baltic
Sea, with additional target species being sprat (Sprattus sprattus), smelt (Osmerus eperlanus) and vendace (Coregonus albula). Other species
are caught in minor amounts and, such as Herring, are mostly sold as sh fodder. All the vessels sh in the Baltic Sea, (27.d), with majority of
the shing occurring in sub-areas 27.d.32,27.d.31, 27.d.29 and 27.d.30.
The passive gear class PG involves small coastal vessels targeting their shing to, e.g., Baltic herring (Clupea harengus), atlantic salmon
(Salmo salar), pikeperch (Stizostedion lucioperca) and European perch (Perca uviatilis), which are economically the most important target
species for the coastal sheries. The vessels using passive gears sh in the Northern Baltic Sea, mostly in Gulf of Bothnia and Gulf of Finland at
the coast of Finland.
Figure 1: The length of ships in Finnish eet in 2020 in two gear classes - Passive Gears (PG) and Pelagic Trawlers (TM).
Pelagic trawlers - TM
We utilized the segmentation_datapreparation() -function for trawler data preprocessing. The amount of catch in kilograms was chosen as the
catch measurement. We exploited numberclust_table() and numberclust_plot() to receive statistics related to the optimal k, and visualized
the suggested clusters via numberclust_dendrogram() and numberclust_clustree() .
n = 4 3
=1245
- 13 -
    
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 2/13
Figure 2: Visual diagnostics for optimal number of clusters in TM catch data.
The initial analysis were performed in order to nd the optimal number of clusters k. Based on statistics (Avg. Silhouette score for
, Mantel score for ), the optimal choice would have been either or . The visual diagnostics (Figure 2) show
that elbow suggests to place ksomewhere between four and eight, silhouette proposes two or three while Mantel encourages to try between
four and seven. However, since we only had 43 vessels and we knew beforehand that the vast majority of herring would be shed, we wanted to
examine the clustering with a sufciently large k. Thus, we chose for further analysis.
Figure 3: Clustering dendogram with range-method for pelagic trawlers
=0 .699
k = 2 = 0 . 8 4 k = 6 k = 2 k = 6
k = 8
- 14 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 3/13
Figure 4: Clustering tree for TM trawler data
TM with
The clustering analysis was performed with number of clusters . We deployed segmentation_clustering() with thee catch data of
trawlers and examined the visualizations provided by clustering_stockshares_plot() ,clustering_assemblageshares_plot() ,
cluster_assemblages_MDS() and clustering_plotgrid() . The results were in line with our initial thoughts as of the vessels
were placed into one, herring-weighted, cluster.
Figure 5: Stockshare plot of TM catch data (n=43)
k = 8
k = 8
31/43≈70%
- 15 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 4/13
Figure 6: Assemblageshares plot of TM catch data (n=43)
Figure 7: Assemblages MD5 plot of TM catch data (n=43)
We looked the properties of the vessels in each cluster (gure 8) before nailing down the nal clustering. While the clusters 1,2,3,6,7 and 8 are
not similar in terms of catch composition (gure 7), they are very similar in terms of vessel diagnostics. We can see that with respect to vessel
length, annual catch per ship and annual catch per cluster, our choice to combine the previously mentioned group of clusters is well justied.
- 16 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 5/13
Figure 8: Diagnostics and statistics of TM catch data vessels (n=43)
TM Final clustering
Final clustering for pelagic trawlers was formed by combining the initial clusters 1,2,3,6 and 7 together and leaving clusters 4 and 5 as they were
suggested by the algorithm. We identied three different clusters.
Herring cluster having 31 trawlers of length between 20-40 meters and targeting their shing heavily on herring (initial cluster 4, gure 9).
Herring & Sprat cluster, which contains 5 relatively large vessels targeting on herring and sprat (initial cluster 5, gure 10).
Mixed cluster holding 6 vessels with length less than 20 meter. These vessels sh a quite of a number of different species, none of which
seems to surpass the others (initial clusters 1,2,3,6,7 , gure 11).
It is interesting to notice that these three nal clusters are in practice identical when comparing the nal clusters dened in these analysis to the
nal clustering determined in previous workshop (2021) for .
k = 1 0
- 17 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 6/13
Figure 9: TM Cluster - Herring
Figure 10: TM Cluster - Herring + sprat
- 18 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 7/13
Figure 11: TM Cluster - Mixed
Passive Gears - PG
The passive gear class analysis was implemented by rst running the segmentation_datapreparation() -function with PG data ( )
followed by the actual clustering analysis. We used the amount of catch in kilogram as a cath measurement. The diagnostics and statistics were
then examined in order to choose kfor further analysis. Based on elbow and silhouette means (score for and ), we can
see that, overall, the optimal number of clusters seems to be either or (gure 1). However, Mantel test makes an exception
suggesting to try clustering with clusters (highest score for ). Moreover, we also deployed numberclust_clustree() -
function to create a clustering tree to provide additional visualizations.
Figure 12: Visualization - Estimated number of clusters for passive gear (PG) class.
n = 1 2 4 5
0 . 3 8 8 k = 6 k = 7
k = 6 k = 7
k = 1 5 0 . 7 4 9 k = 1 5
- 19 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 8/13
Figure 13: Clustering three with PG data
PG with
We chose clusters for further analysis. We utilized passive gear catch data and performed clustering with segmentation_clustering() -
function. We started our experiments by looking the stock shares and assemblage shares over the seven clusters. The initial results were found
quite promising, as we can see (gure 14) that there is relatively clear distinctive specie representation among the clusters.
Figure 14: Stockshares of Passive Gear data with n=1245 and k=7
k = 7
k = 7
- 20 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 9/13
Figure 15: Assemblage shares plot of PG data
We utilized cluster_assemblages_MDS() and clustering_MDS() functions to gain more information of the cluster candidates. From the left
hand side (of gure 16) can be seen that cluster 2 stands out from the crowd, but clusters 5,6,7 and 1,3,4 appear to form relatively similar
groups. The same phenomena can be seen from the right hand side (gure 16)). That is, it looks like there is a major overlap among vessels.
However, if we look at the three dimensional plot (gure 17) and rotate it appropriately, we can notice that the ostensible overlap is just a result
of visualizing the ships in two dimensional space.
Figure 16: Cluster assemblages and clustering MD5 of PG gear class - Multidimensional scaling to two dimensions.
cluster 7
cluster 6
cluster 5
cluster 4
cluster 3
cluster 2
cluster 1
- 21 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 10/13
Figure 17: Clustering MD5 with PG catch data - Multidimensional scaling to three dimensional space
Then we looked at the properties of the clusters in terms of the vessel length, cluster size and annual catches (gure 18). From the annual catch
point of view, it is interesting to see that the biggest share of catch is landed by cluster 2 (gure 18) and the vessels in this cluster are the
longest. This notable amount of catch is natural, as the vessels in cluster 2 are targeting herring, and herring forms undoubtedly the largest
share of annual landings among Finnish sheries.
The vessels in clusters 1,3,4 and 5 have approximately equal lengths, but the annual catch in cluster 1 is considerably larger. However, this is
clearly explained by the number of vessels in cluster 1 versus other clusters. Furthermore, it is interesting to see that there are some outliers in
clusters 1 and 3 in terms of ship length. Thus, we can notice that, from the similarity of shing activities point of view, the vessel length is not a
one classication feature above the others yet it is an important factor in many extent.
Figure 18: PG Vessel properties over different clusters
PG Final clustering
We introduce the nal PG clustering in this section. Based on the previous diagnostics and analysis, the seven clusters provided by
eetSegmentation were categorized as follows:
Herring cluster holding in total of 84 medium size coastal vessels (initial cluster 2, gure 19), which use passive gears and target their
shing heavily on herring.
Salmon cluster having 128 shing small coast ships (initial cluster 3, gure 20) targeting mostly salmon but other species, e.g, european
perch, beam and roach, to some extent as well.
Northern pike cluster consists of 28 small coastal vessels (initial cluster 4, gure 21) targeting their shing primarily to northern pike and
secondary to burbot.
Smelt & Mixed cluster is by far the largest cluster with 986 vessels (initial cluster 1, gure 22) of which most are small coast ships. These
vessels catch mostly smelt, but also sh other species (bream, perch, roach) on side.
Mixed cluster involves 19 vessels (initial clusters 5,6,7, gure 23) of which 15 of them are targeting mostly unknown species and smelt.
Three vessels are catching cod heavily. Furthermore, this cluster has one ship shing solely ounder.
- 22 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 11/13
Figure 19: PG Cluster - Herring
Figure 20: PG Cluster - Salmon
- 23 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 12/13
Figure 21: PG Cluster - Northern pike
Figure 22: PG Cluster - Smelt & Mixed
- 24 -
29.4.2022 12.03 Alternative Approach to Fleet Segmentation
le:///G:/Luke2/Stat_kala_yleinen/PG_ECON_Fleet_segmentation_workshop_03_2022/eet_wsreport_FIN.html 13/13
Figure 23: PG Cluster - Mixed
Discussion
We found the eetSegmentation -package and the results provided by the package very interesting. The problems related to package’s wider
usage in ocial reporting and applications are most likely to be related to small cluster sizes (privacy and applicability of the analysis). In spite of
this, we encourage the further development and usage of the package. We think that utilizing this tool should not be restricted only to ofcial
reporting. Instead, this approach could have a potential analysis usage at those stages where, e.g., number of vessels is invisible to end users.
One example could be to examine the performance in imputation with, e.g., economical data - the traditional way to form stratum versus
clustering approach.
Some keynotes that were not considered in these analysis due to time limitations but good to take account further.
Value of landings Would we see more variety if we include the value of landings into the analysis. This would take into account whether
herring is used to feed or food production, which can lead to drastic differences in price and value. We are aware that value-option is
already available in the package, but reliable results would require some data processing in terms of value calculations. Unfortunately we
had no time to experiment this during the workshop.
Twin-trawlers It would be interesting to test if twin-trawlers could be identied as an independent cluster.
Merging PG data We know there are few small trawlers that report their catch with coastal landings report and are omitted from the data
used here. They should be included.
PG’s targeting Cod It would have been nice to separate cod sheries into their own cluster, but unfortunately the amount of vessels
targeting cod is relatively small.
PG shing on ice We have an additional PG data containing the shing without a ship ( ). This is mostly shing on ice, but these
were not analyzed in the workshop because of time limitations.
Finally, some suggestions for the future development.
Clustering features Some alternative clustering variables would be nice to include in the future. Our understanding is that currently the
package supports a xed list of clustering variables. Would be interesting to see what type of results we would get by using, e.g., a
combination of horse power and catch measurement.
Combining the clusters The strategy in combining clusters likely involves using the vessel properties at least to some extent. An
additional function, which would leverage the cluster combining automatically w.r.t. vessel properties would be an interesting extension to
the package.
n = 3 7 3
- 25 -
   
  
    
- 26 -
    
- 27 -
- 28 -
- 29 -
- 30 -
   
  
    
- 31 -
- 32 -
- 33 -
- 34 -
- 35 -
  

   
 
 
- 36 -
    
    
         
          
          
          
           
           
        
       
- 37 -
    
 
- 38 -
      
 
- 39 -
    
 
- 40 -
     
 
- 41 -
   
 
- 42 -
  
 
- 43 -
       
    
 
- 44 -
       
    
 
- 45 -
   
 
- 46 -
   

 
- 47 -
*Please note that this case study is an extract of the original Ifremer document:
Demaneche Sebastien, Guyader Olivier, La Grand Christelle, Merzereaud Mathieu, Vigneau Joel,
Quentin Laurent (2022). Alternative approaches to the segmentation of EU shing eets. Workshop
II - 28-30th March 2022. Previous experiences, tests for application in the French context and
recommendations. PDG-RBE-HISSEO, PDGRBE-EM, PDG-RBE-HMMN-LRHPB.
https://doi.org.10.13155/89336
- 48 -
*
    
- 49 -
- 50 -
- 51 -
- 52 -
- 53 -
- 54 -
- 55 -
- 56 -
- 57 -
- 58 -
- 59 -
- 60 -
- 61 -
- 62 -
- 63 -
- 64 -
Workshop on alternative approaches to the segmentation of the EU fishing fleets (II) - 28-30th March 2022. Previous experiences, tests for
application in the French context and recommendations. Page 58 sur 88
Figure 31 Flowchart proposal to define a set of rules to segment the EU fleet
- 65 -
      

        
 
   
              
                
                
               
              
               
    
    
     
      
      
   
     
   
    
    
     
    
    
    
    
    
     
     
   
     
    
     
      
  
             
              
              
                
            
            
                
                  
                
          
            
   
- 66 -
    
Segmentation of the Demersal Trawlers (DTS) operating in Iberian Waters
(ICES Divisions 27.9.a and 27.8)
Authors: Ana Cláudia Fernandes, Suzana Faria Cano
Description of analyzed population
The Portuguese Demersal Trawlers (DTS) have annual fixed licenses to operate with bottom otter
trawl and, considering the mesh sizes used, they can target mainly demersal species (65-69mm or
≥70mm) or crustacean species (55-59mm and ≥70mm). The areas considered for this group reflect
their fishing behavior: it is known that some vessels may operate in ICES Div 27.8.c (Spanish
waters) part of the year. The data used in the analysis is from 2021 and includes 90 vessels with
vessel length range between 9-34m (mean=23 m; median=24m).
In Portugal there are fisheries targeting species that are not assessed by ICES but are very
important at national level (e.g. Atlantic chub mackerel). In this work we removed the ICES stock
assignment limitation to enable the classification of all ‘species versus area’ as a new ‘stock’. The
high number of species*area present in the data (276) reflects the multispecificity of this fleet and
the possible difficulty in assigning the target assemblages to some of the vessels (or fishing trips).
The objective of this case study is to analyze the differences observed in the results, if we consider
different number of clusters in the data.
Analysis of the data
The analysis was performed using the ‘FleetSegmentation package.
The clustering diagnosis used to analyze the optimal number of clusters were the average
silhouettes, the Mantel test, the Davis-Bouldin index, the SD index and the Calinski-Harabasz index
(Figure 1). The optimal number of clusters varies between 2 and 15, depending on the indices
considered. Also the clustering dendrograms presented in Figure 2 support that 1 to 24 distinct
clusters are present in the data. These results reflect well the diversity of species assemblages
observed in this fleet. In our study we will start to consider the optimal cluster number obtained
from the average silhouette (6 clusters).
    
- 67 -
- 68 -
- 69 -
- 70 -
- 71 -
- 72 -
- 73 -
... Clustering of catch data : To arrive at a set of métiers for the Icelandic fleet, we performed a clustering analysis on the described data using the clustering methods of Sulanke and Berkenhagen ( 2022 ) to identify métiers (Baranowska et al. 2024 ). However, our methods deviate slightly from previously published methods by using Ward's method for hierarchical clustering (Ward 1963 ) and an Euclidean distance matrix. ...
... From here on, we will call these clusters métiers. While doing so, we used a set of algorithms to calculate the optimal number of clusters that summarized the data used in this analysis well, relying on the average silhouettes and Mantel test (Sulanke and Berkenhagen 2022 ). Due to computational memory constraints, we ran these algorithms for the optimal number of clusters for each year of the data separately. ...
... Due to computational memory constraints, we ran these algorithms for the optimal number of clusters for each year of the data separately. In combination with relying on these algorithms, we also studied the clustering tree and the data visually to arrive at a good number of clusters (Sulanke and Berkenhagen, 2022 ). ...
Article
Full-text available
Marine ecosystem-based management requires the understanding of species interactions and what species are harvested together. This study combines two major questions: the first regarding what drives the probability that a métier (species assemblages, with spatial distribution and seasonality) will be observed as catch, and the second regarding the level of control fishers have over this catch mix. To address these questions, we analysed highly resolved logbook records of an Arctic and sub-Arctic industrial demersal fishery operating in Icelandic waters. The study employs a multi-class random forest model to identify predictors of métier occurrence and consistency of predictions using a dataset of > 100 000 hauls over 4 years (20 16-20 19). The overall accuracy of the random forest model is 69-70%, indicating moderate predictability of catch mix based on known en vironmental, vessel, and compan y characteristics. We find that habitat-related variables (depth and temperature) are most important to predict catch mix. Still, company, trip, and vessel characteristics are also very important (e.g. vessel and trip length, distance to port). Beyond these more traditional bio-economic variables, important predictors include variables related to harvesting strategies, such as quota di ver sity and a vessel's mobility. These findings contribute to a fuller picture of fisher decision-making in mixed fisheries.
... The analysis followed the methods used to identify métiers in the Iceland bottom trawl fishery (Baranowska et al. 2024;Oostdijk et al. in review). Briefly, we performed a clustering analysis following a previously reported clustering protocol (Sulanke et al. 2022) to identify métiers. As preliminary analyses defined many clusters based on single hauls, we used Ward's method for hierarchical clustering (Oksanen et al. 2022;Ward 1963). ...
Technical Report
Full-text available
While Iceland has an advanced and well-developed fisheries management system most species are managed individually, that is without consideration for their role in and interactions with the whole ecosystem or with the fact that fisheries target multiple species in different areas and at different times of the year. But long-term sustainability of these economically and culturally important resources requires consideration of a broader system. End-to-end ecosystem models provide one way to account for complex interactions and evaluate how changes in one species influences other species. Here we continue parameterization of the Iceland Atlantis end-to-end ecosystem model to include multispecies fisheries for the most important groundfish fleets in Iceland. A métier analysis was performed to determine how the bottom trawl fleet targets species spatially and temporally. The Iceland Atlantis model was then parameterized with historic fishing effort to include five different bottom trawl fleets and one longline fleet which target five of the most important commercial species in Iceland: Atlantic cod (Gadus morhua), haddock (Melanogrammus aeglefinus), saithe (Pollachius virens), redfish (Sebastes sp.), and Greenland halibut (Reinhardtius hippoglossoides). The updated model will serve as basis from which management decisions can be evaluated in the context of other harvested species as well as the ecosystem as a whole.
Article
Full-text available
The appropriate segmentation of fishing fleets is controversially discussed in fisheries research and management and a variety of approaches has been introduced. The present approach, developed in a pilot study funded by the European Commission – Data Collection Framework (DCF), introduces a standardized multivariate approach for characterizing fisheries fleet segments by hierarchical agglomerative cluster analysis (HAC) of their catch composition. We chose data from 2021 of the Romanian fishing fleet as the basis of our analysis. Statistical analyses were performed using the program RStudio V3.6.1 by running the fleet segmentation package script. The specific indices, tests, and visual validation methods of the package were applied to determine the optimal number of clusters. The procedure was finalized by a post-hoc validation of the clustering result to identify the actual fleet segments. From the basic data, 6 fleet segments for Vessels using active and passive gears (PMP) were highlighted, representing 52 boats, where it was noted that fishing at Rapa whelk (RPW) prevailed with 91.94% of the total catches on the segment, respective 5 fleet segments for Vessels using passive gears (PG) only for vessels <12m, representing 78 vessels where the main catches or recorded at European anchovy (ANE) 25.16%, turbot (TUR) 13.08%, horse mackerel (HMM) 8.75%, thus 130 vessels from two classes of different gears in total. We detected mixed fishing, especially on various assemblages of demersal and pelagic fish, as well as target fishing on demersal and pelagic fish, Rapa whelk, and mussels. For a better understanding of the approach, further research is needed.
ResearchGate has not been able to resolve any references for this publication.