Customer Success
Customer Success |
Gloucestershire Constabulary Cleans UpImproving data quality for operational efficiency and freedom of informationIn the United Kingdom there is no central crime database, any more than there is a single monolithic police force. It is up to county and metropolitan forces to take care of their own data resources, and they all face their own challenges. Gloucestershire Constabulary, a UK police force with 1,280 officers (plus 786 civilian staff and 170 Special Constables), is responsible for policing 560,000 people in 270,000 hectares (approximately 1,050 square miles) of urban and rural countryside. Community policing is at the heart of Gloucestershire Constabulary and in today's mobile society that depends on accurate and up-to-date information. However, criminals do not respect county boundaries, so Gloucestershire Constabulary also owes a duty of care to the other 43 police forces of England and Wales, to ensure that the data in its records are accurate.
Freedom of information
Moreover, since 1 January 2005, Gloucestershire Constabulary has had new obligations to keep its data in good order under the Freedom of Information Act (FoIA). "These are all good reasons to invest in a solution that would address our data quality problems," says Reg Barnard, who heads the Constabulary's Information Systems Development team in Cheltenham, Gloucestershire. "The trouble is that data quality is a bit like a healthy diet and regular exercise: we're all in favor of it, but we don't always find it that easy to achieve. Every organization faces data quality issues, but in the police force, we have all the usual problems such as bad spelling and mis-keying, magnified by the fact that many of the people we are dealing with are less than entirely honest. Criminals give us all kinds of false information that can't be verified on the street, such as wrong date of birth or misspelled first names." Of course, many of these tricks have been around for years, but with electronic data systems, it should in theory be easier to cross-check records. In practice, however, it's not so easy when you have several incompatible legacy systems, each implemented to serve different purposes.
An expanding user community
Under the FoIA, public authorities have to produce a "publication scheme", and the Association of Chief Police Officers (ACPO) set up a national team to develop a model publication scheme for the 44 police forces. It involves delivering an open and transparent service whenever possible, publishing information that the public has an interest in viewing and making specific information available on request, so long as this does not prejudice ongoing operations.
The SAS solution
That's a view echoed by Nick Churchill, who spent 31 years as a detective and is now the Constabulary's Database Administrator. In 2003 he was given the task of shifting the data from the legacy systems into a data warehouse and making it available on the intranet. He chose SAS for a pilot project that would integrate three legacy systems: Unity, which is the core intelligence system containing data on crime and custody; VPFPO, which stores data on vehicle parking and fixed penalty offences; and the Domestic Violence database, which contains details of offenders and victims. "We did not make things easy for SAS. All three systems are implemented with incompatible software: Oracle, Ingres and SQL," says Churchill. All of the information extracted from the source systems is subjected to SAS data quality procedures to profile, cleanse and standardize the data, using match coding. Gloucestershire Constabulary found that with DataFlux dfPower Studio you can adjust the sensitivity on matching fields, such as name or street address. That way you can identify the biggest problems first and work downwards. It will quickly pick up obvious alternative spellings of the same name, such as Stephen and Steven or Uddin and Udden, but it will also pick up less obvious matches. For example, the surname Ferry might be accurately recorded on the street, but then wrongly entered as Perry back at the station. Likewise, car registration numbers might include one or two wrong digits. You would have a hard time finding such mistakes with other software, let alone manually.
Safety first
Consequently Gloucestershire Constabulary is using SAS to identify possible quality issues, but feeds the questionable records back to the owners for auditing and correction. A typical problem is wrong postcodes and addresses, which a police officer on the beat has no means of validating. It is relatively easy to identify mismatches, but then the question arises, is the postcode wrong or the street address? One way of isolating such issues is to match coding between the different data sources. "Much to our surprise, a quarter of the vehicles on the VPFPO system were also on the Unity system, meaning they were used in a crime of some sort, with or without the knowledge of the owner. Yet, very often, the least reliable and up-to-date information is on Unity, our main operational database. "SAS is very powerful when it comes to clustering the near-matches to help us correct data and bring it up to date, so it saves a huge amount of time and resources," says Churchill. "What's more, SAS Data Integration Studio is really easy to use, considering how powerful it is. The drag-and-drop technology means you don't need any specialist programming knowledge, just some basic knowledge of the data you are interrogating. In no time you are producing reports that will amaze senior managers." Police officers typically work in difficult circumstances where mistakes are inevitable; mistakes that ordinary database software will not identify, even when it is working properly. SAS offers the additional layer of intelligence that will now enable Gloucestershire Constabulary not only to spot individual mistakes, but also to correct procedures and identify training needs. "With poor quality data, we were wasting a lot of time trying to find out things we already knew. SAS is putting us on track to achieve the operational efficiency mandated in Vision 5 and to meet our obligations under FoIA. In practical terms, it could ensure that a police officer arrives at the right address when trying to apprehend a criminal," concludes Barnard. Copyright © SAS Institute Inc. All Rights Reserved. |
Gloucestershire Constabulary
Challenge:
Improve data quality and meet Freedom of Information Act requirements.
Solution:
SAS Enterprise Data Integration Server addresses data quality concerns and lays the foundation for business intelligence, providing more reliable and consistent data. “SAS is putting us on track to achieve the operational efficiency mandated in Vision 5 and to meet our obligations under FoIA. In practical terms, it could ensure that a police officer arrives at the right address when trying to apprehend a criminal. ” Reg Barnard Information Systems Development Manager Read more:
|