Seven ways MarineTraffic ensures AIS accuracy

This blog post was updated on 4/6/2021 by Sotiria Kontopoulou

The sheer amount of data generated by the global AIS network is truly overwhelming: more than half a billion positions are received daily from various sources around the world.

But as this comes from a huge range of aggregated sources with varying degrees of quality, from dedicated receiver stations to maritime enthusiasts and AIS satellites, this raw data flow may contain errors, incomplete information and contradictions. Even ships might be misidentified, jump backwards and forwards in time, or disappear altogether and re-appear.

To provide users with accurate and actionable data, MarineTraffic uses real-time filtering and analysis combined with post-processing. The chaotic raw data flow is filtered down to 30 million positions each day: one position per ship, per minute.

Below, we have listed the seven ways MarineTraffic ensures AIS accuracy:

1. Making sure all ‘junk’ data is removed

AIS information is transmitted every few seconds from each vessel by VHF radio signals which typically have an average terrestrial range of just 20-30 miles.

When they are picked up by receivers, packets may be incomplete or not properly formatted.

The first stage of when all data is tested for invalid AIS transmissions and through filtering is put in groups based on the type of invalid, corrupted, incomplete information it suffers from. Duplicate MMSI numbers are also identified and station restrictions also apply.

Then, as data errors are compiled, it is necessary that they be fully identified and grouped according to where and how often they occur.

2. Identifying each vessel correctly

The AIS system transmits each vessel’s unique identifier such as the International Maritime Organisation (IMO) number, the Maritime Mobile Service Identity (MMSI) number and the vessel Call Sign.

As ships change names or operators, update their AIS information or are decommissioned, MarineTraffic is proactively updating and removing any outdated or erroneous records to maintain an accurate database of over 500,000 AIS-equipped vessels, each with valid supplementary information.

A few years ago, MarineTraffic introduced the Ship ID scheme, which aims to provide a much more authoritative and unique identifier for each sea-going vessel. This has gained traction with the industry as users are increasingly asking for the MarineTraffic Ship ID over other common identifiers which can be duplicated or manipulated. Increasingly, the quality of this data set is all part of efforts to ensure the correct identity and position allocation to each vessel.

MarineTraffic data accuracy

3. Doing real-time spatial and temporal correlation analysis

MarineTraffic operates the largest AIS terrestrial receiving station network in the world.

In addition, MarineTraffic has data-sharing agreements with a number of third parties in order to extend the reach. This variety of collection methods, coupled with inherent AIS protocol deficiencies, affects the quality of data received.

Related: Building the world’s most comprehensive real-time vessel tracking service

AIS transmissions from the best providers can reach MarineTraffic servers almost immediately, while more remote stations with slower connections to the global network take longer to be received. AIS satellites sometimes wait hours before their data can be offloaded to the next ground station in range.

Combined with the fact that AIS timestamps are synchronised locally rather than globally, the MarineTraffic system can be presented with wildly ranging spatial and temporal information about each vessel. The system weighs the quality of each data source and attempts to create a coherent timeline in order to establish a clear picture of where each vessel is at a given time. Each vessel’s movement is constantly evaluated and if its speed, course or any other factor falls outside accepted boundaries, the data is flagged and automatically or manually removed.

4. Post-processing

With each stage of evaluation, the rules get stricter and stricter.

Post-processing is a series of functions carried out on a daily basis to catch any data that is not valid, yet still managed to pass through the real-time filters.

By this stage, the system is focused on special cases, of very small deviations not picked up by the real-time processing. These even more stringent tests make sure that no erroneous positions are present before the data is re-checked, re-classified and ultimately stored in the historical data set.

Over time, as these combined processes have proven their effectiveness and become even more sophisticated, MarineTraffic has been able to move from reporting accurate positions for each vessel every five minutes to two minutes and now once per minute – achieving a more granular level of information, without sacrificing data quality.

Taken together, this multi-stage process results in being able to produce increasingly accurate positions that are guaranteed to be true to the best of MarineTraffic ability.

How we turn raw feeds to cleansed, accurate & reliable data: a complex process in short

Pre-identification
  • In this stage, we test for invalid AIS transmissions,
  • we continue by filtering by tuples (eg invalid ship – type length combination),
  • we check for MMSI deduplication
  • and start the application of station restrictions.
Identification
  • Next, we proceed with the field-based identification &
  • the frequency-based identification.
Post-identification
  • In the third stage, we work on message deduplication,
  • geospatial checks (over-land, spoofing)
  • and archiving rules (downsampling).

5. Continuously evolving

MarineTraffic has always welcomed collaboration either by sponsoring new station owners or by exploring new partnership opportunities and interesting project collaborations around the globe.

“Targeted expansion is the key to get the most value from each new station installation,” says Anastassis Touros, AIS Network Team Leader at MarineTraffic.

Focusing on collaborations enables the team to keep expanding coverage and provide high-quality AIS data feeds and related intelligence to the world, playing a key role in making the shipping world more transparent and efficient through added visibility and awareness.

6. Shaping the future

To ensure AIS accuracy and efficiency throughout the years, MarineTraffic actively monitors and participates in protocol-related workshops and initiatives to be up to date with—if not drive— the evolution of AIS while moving into an era of interconnected communications and technological enablement which provide new capabilities in the near future such as VDES, extended radio frequencies etc.

7. Staying close to the community

The team of experts at MarineTraffic take feedback seriously and utilise it to keep improving all cleaning, filtering as well as identification algorithms to ensure that their work can only get better and better.

Our dedicated team continue to build a strong and healthy relationship with our AIS station owners by providing timely support, proactive communications on a regular basis, and by engaging them through hosting exclusive contests to ensure the retention and good health of our community.

Adding to the above-mentioned techniques MarineTraffic uses to guarantee the accuracy of all published position data, we value our contributors, our users and our customers and always encourage them to provide us with their feedback, as this enables our dedicated team to lead the way in terms of comprehensive, fast, efficient and accurate AIS data streams, always presented in the most comprehensible way possible.

New call-to-action