indmed3 (former version of indmed documention)

CALCULATION OF ABUNDANCE INDICES AND LENGTH FREQUENCIES IN THE MEDITS SURVEY

by Arnauld Souplet IFREMER 1, rue Jean Vilar F-34200 Sète March 1998

Introduction

The first exploitation of the MEDITS data collected during the 1994 and 1995 surveys has been made by calculating abundance indices by stratum and area and length frequencies by stratum. This paper describes the sampling scheme, the method used to calculate indices and finally the computer program.

1. The sampling scheme

The survey area has been divided into 11 areas, each of them being divided into strata. The definition of the areas has been made in reference to the GFCM statistical areas. The following table gives the list of these areas together with the number of strata in each.

Country	Area	Nb. of strata
Spain	111	5
	112	5
	113	5
France	121	10
France	131	10
Italy	132	20
	133	35
	134	15
	221	40
	211	10
Greece	222	20
Total		175

Due to the bathymetry of the continental shelf and of the slope, the strata have been defined using the depth lines and, when necessary, some lines perpendicular to the coast. The depth strata are : 10 - 50 m, 50 -100 m, 100 -200 m, 200 - 500 m and 500 - 800 m. In each stratum the sampling is made following a simple random scheme.

2. The abundance indices

2.1. Computation

The surfaces of the areas and the strata are well known. Furthermore, it is possible to know the surface covered by the trawl during each haul. Hence these surfaces have been used as a basis for the stratification. For the various measured values (total weight caught, total number, number by sex, etc.), the index has been calculated as follow (Cochran, 1977)

Notations :

A total surface of the area

N number of strata in that area

Ai surface of the stratum i

Wi relative weight of the stratum i in the area

ni number of hauls in the stratum i

Ai,j surface trawled by the haul j in the stratum i

fi sampling fraction in the stratum i

xi,j measured value in the haul j

with and

It is possible to choice between two options. The first one is to calculate for each haul a value by surface unit and to average those values over all the hauls made in the stratum :

This estimate is unbiased but it is possible to reduce its variance. Actually, the surface covered by the trawl is large compared to the micro-distribution structures. Therefore, the variance of the catch is proportional to the covered surface. In other words, the number of individuals or the total weight by unit of surface, which can be considered as a CPUE, has a variance in 1/A_i,j (F. Gauthiez - IFREMER Nantes, to be published). It seems then more realistic, especially if the hauls are of different lengths, to get estimates using weighted least squares with the following formulae

mean value of x by unit of surface in the stratum i :
Variance of the value in the stratum i :
Variance of the estimate of the mean :
Abundance index in the area :
Variance of that index :

The term (1- fi) could be neglected because fi is in general very small.

2.2. Practical problems encountered

To know the surface actually covered by the trawl, it is indispensable to know both the real distance covered by it on the bottom and the horizontal opening of the trawl, either the door spread or the wing spread. The distance can be measured with enough accuracy by navigation devices such as GPS and, for example the wing spread by SCANMAR. Nevertheless some vessels participating in the survey did not have these equipments

The covered distance can be calculated by a simple algorithm (Annex 1) from the shooting and hauling positions. Obviously this computation gives correct results only if the course during the haul has been rectilinear. This has to be assumed, even if it was not the case but, to avoid too big errors, it is important to make as rectilinear courses as possible.

As far as the wing spread is concerned, the data available after the 1994 survey show that there is an obvious non linear relationship between the warp length and the trawl horizontal opening, at least for warp length less than around 500 meters. For longer warp lengths the wing spread stabilize around 17 meters. To take this relationship into account, it has been decided to fit a von Bertalanffy function to the data provided by France, Italy and Greece. The asymptotic relationship so obtained has been applied when the wing spread was unknown. It is as follow (with S = wing spread and L = warp length) :

3. The length frequencies

These are calculated by stratum doing a simple sum of individuals caught by length class. For fish and cephalopods, the length class interval is 1 centimetre while it is 1 millimetre for crustaceans. For practical purpose, the span of these length frequencies has been limited to 100 length classes, i.e. 1 meter for fish and cephalopods and 10 cm for crustaceans. The total number caught in the strata is also calculated and this value is printed out together with the histograms.

4. The INDMED program

The program to perform the above mentioned calculations has been written in standard FORTRAN 77 Microsoft® and run under MS-DOS®. The version to be used now is Version 3.0, dated 02/96. It needs four reference files and the three data files by country/area.

4.1. The reference files

Two of them are common to all participants. The first one contains the list of species with their Rubbin code and scientific name (see Annex 2) ; it has been derived from the official LISTFM file used for the 1995 survey (506 species). Due to the rather small memory currently available under MS-DOS, it is not possible to extend it to much. Nevertheless, it is expected that computations will not be performed for new species added to the LISTFM file each year. However, if the user wants to run the program for one of these new species, it is possible to do it. In that case the user will be asked to give the scientific name of any species absent from the file. The file second contains only a list of species to be taken into account within a single run of the program. This file is normally limited to the 31 reference species but can be modified by any user following his own needs. The usual 31 species file is shown in Annex 3.

The two other files are peculiar to each participant in the survey. They contain respectively the list of the strata in each country/area with their respective surface and the list of the hauls with the stratum in which they have been made (see Annex 4 and Annex 5).

4.2. File names

To correctly use the program, the user has to be sure that the file names comply with the following rules :

Species file :		MEDIESP.DAT
list of species to be analyzed :		free but currently ESPECE.REF
list of strata :	char. 1-2	ST
	char. 3-4	country code
	char. 5-6	year
	char. 7-8	area (replaced by __ in case of one area for a country)
	extension	.DAT
list of hauls	char. 1-2	TR
	the rest	as the previous one
data files	char. 1-2	TA, TB or TC
	char. 3-4	country code
	char. 5-6	year
	char. 7-8	area (replaced by __ in case of one area for a country)
	extension	.TXT

Examples (country : France, year 1995) :

STFR95__.DAT

TRFR95__.DAT

TCFR95__.TXT

4.3. Running the program

The user is asked to give the country code, the year and the area code (if any). Then the program asks if the user wants to make the calculation either for a single species or for some of them. In the first case the program asks the Rubbin code of the species and in the latter case the name of the file with those species. If a species is absent from the species reference file, the user has to give its scientific name.

Then the user has to choice whether he wants to calculate length frequencies or abundance indices. The possible indices are given in Annex 6. In the case of calculation of indices, an other choice is between full or reduced output. The user has to give one output file name for a full output and two for a reduced output.

4.4. Results

4.4.1. Length frequencies

The output file names are fixed : the first character is H, the following seven are the species Rubbin code, the two first of the extension are the country code and the last one is the second character of the area code. Example : Results for Merluccius merluccius, country Italy, area M3 : HMERLMER.IT3

The files are in CSV (comma separated values) format and can be easily imported into any spreadsheet. An example is given in Annex 7.

4.4.2 Abundance indices

The output file names are free but it is recommended to use easy-to-remember names. The two types of file (full and reduced output) are simple ASCII files which can be easily imported into any word processor. The full output file contains two separate tables by area. The first one gives the detailed results by stratum and the second the results for the whole area, the shelf and the slope separately. In the case of a reduced output, the first file contains the same table as in the previous versions of the program giving the abundance index and its coefficient of variation by stratum and for the whole area and the second file contains a table giving index and coefficient of variation for the whole area, the shelf and the slope. Examples of these files are given in Annex 8 and Annex 9.

5. Further developments

The INDMED program could obviously and quite easily been extended to other computations from the MEDITS data files. Being written in a standard and normalized language, its subroutines can been incorporated in new programs.

However some theoretical points could be raised. Is the calculated index relevant ? Is there a better index ? For the time being, the surfaces of the strata are estimated from maps following a ground plan. It could be better to try to estimate the actual surfaces by taking into account the various slopes. These question should be addressed to the Group of Co-ordination of the MEDITS program and/or to some of the Working Groups which have been set.