This is the final installment of the Data Loss Prevention (in a rational, sane way without new blinky boxes) series. In this post I'm going to bring up one of the most interesting topics (at least to me) which is gathering actionable intelligence from all of your existing investments. Since you probably have at least 100 devices across your network generating security log information - not to mention other types of useful bits - it's imperative that as you think about doing DLP you utilize this wealth of existing information available to you... or is it?
Remember - mountains of information being generated by security devices is only useful if you can transform it into actions which can increase the security posture of your organization.
What's a SIRM?
SIRM stands for Security Information and Risk Management, and it's typically served up as a platform of technologies, and can be consumed as an in-house product or service. SIRM is the continuing evolution of the SIM platform that many years ago started as a log aggregator which of course did not one any good because no one I know had any time to actually do anything with it. Why the new term SIRM? Simple - the industry needs to evolve into risk management beyond just the traditional security event management. In short, there is more to your organization than what the firewall and IPS generates. While lots of events may mean an influx in attack, or simply noise ... it does not adequately express or correlate business risk.
You see, today's security dashboards and consoles focus on exactly that - security - and security tends to have a very myopic view of the enterprise. Security tends to care about bad things happening, and for good reason too! If the security team were to start looking at the totality of organizational "events" the odds of an information security team being overwhelmed in seconds is a a sure bet.
So here we have the crux of the issue with data loss prevention - too much information to process in a meaningful way without advanced insider knowledge of your specific organization. DLP is a Catch 22, because you never know what you're looking for (or in what format) to tell the systems you have in place to look for it. If you knew what you were looking for, you wouldn't need the big fancy systems to look for it ... so this gets complicated and SIRM technologies combined with some good 'ol fashioned brain power can actually rescue you from drowning.
Finding a needle in a stack of needles
The difficulty in DLP is that you're looking for patterns that range from obvious to downright 007-style sneaky. What I mean by this is that sometimes you're looking for the accidental email that sends out a boat-load of social security numbers, while other times it's a trickle of events that alone don't raise suspicion but are exfiltrating data from your organization.
There are really 3 main questions when you're thinking about the mountains of information you have at your fingertips for the purposes of avoiding leaking data from your organization. Often times, when I've seen security teams simply "dive in" to a DLP effort it turns into an exercise of trying to find the one needle they're looking for not in a haystack, but in a haystack of needles. Information can be our biggest asset, and our greatest adversary when we're looking at preventing data loss. On one hand you have information being generated (in the form of events) on every single piece of hardware and software in your organization. Starting at the badge readers at the front door, to the access terminals (PC, laptop, mobile device or terminal), to the software - every kind of software - there are billions upon billions of events being generated every day. This mountain of information is a fantastic asset - that is until you start to think about how you're going to process those events and figure out what they mean in real-time. You see, with the way that business moves these days, you don't get the luxury of running a log analysis engine overnight to figure out that you've had information stolen yesterday - you need to be able to do this in real-time (or very nearly real time). The challenge of course is if you turn the logging knob to maximum and point it at your log aggregator (or SIRM if you've got a copy of ArcSight [or some other SIRM platform] sitting around humming) things tend to go ka-boom quickly. So on one end you have this wealth of information and on the other is those few events when strung together which tell you something is going wrong right now.
So here we go, let's take a look at how you can find the right needle, in that haystack of needles...
Now you're reading this wondering - how you can possibly implement this type of system without buying one of those "solutions" that comes in 4 rack-mountable 2U boxes right? Odds are you've got a SIM or SIEM or maybe if you're lucky one of the more advanced SIRM platforms already in-house. Leveraging those platforms, and building out capability is more important than probably anything else you'll do, and brings together everything else we've talked about so far. Knowing where your critical assets are, how they traverse your business platforms, and how your users use them is the key to plugging the holes in the boat before it sinks. You can do this ... just don't buy into the hype around DLP and understand it's like anything else - baby steps until you have a working system.
Good luck!