Need advice about which tool to choose?Ask the StackShare community!
Splunk Cloud vs Stackdriver: What are the differences?
Introduction: When choosing a logging and monitoring solution for your cloud environment, it's essential to understand the key differences between Splunk Cloud and Stackdriver. Both platforms offer unique features and capabilities that cater to different needs, making it crucial to compare them before making a decision.
Integration Capabilities: Splunk Cloud provides seamless integration with various third-party systems and applications, allowing for easy data ingestion from multiple sources. On the other hand, Stackdriver is tightly integrated with Google Cloud Platform services, providing native monitoring and logging solutions for GCP resources. This makes Splunk Cloud a better choice for organizations with diverse cloud environments, while Stackdriver offers a streamlined experience for GCP users.
Data Retention and Storage: Splunk Cloud offers flexible data retention options, allowing users to store and analyze logs for an extended period. Stackdriver, on the other hand, has limitations on data retention and storage, making it more suitable for real-time monitoring and short-term analysis. Organizations that require long-term log retention may prefer Splunk Cloud for its scalability and storage capabilities.
Machine Learning and AI Capabilities: Splunk Cloud provides advanced machine learning and AI capabilities for anomaly detection, predictive analytics, and root cause analysis. Stackdriver also offers machine learning features for log analysis and monitoring, but its capabilities may be more limited compared to Splunk Cloud. Organizations seeking robust AI-driven insights may find Splunk Cloud more suitable for their needs.
Customization and Extensibility: Splunk Cloud offers extensive customization options, allowing users to create custom dashboards, alerts, and reports tailored to their specific requirements. Stackdriver provides a more standardized monitoring experience with predefined metrics and alerts, limiting the level of customization available. Organizations looking for highly tailored monitoring and reporting solutions may lean towards Splunk Cloud for its flexibility.
Cost and Pricing Model: Splunk Cloud follows a consumption-based pricing model, where users pay based on the volume of data ingested and stored. In contrast, Stackdriver offers a more predictable pricing structure, with pricing tiers based on the features and capabilities used. Organizations with fluctuating data volumes may find Splunk Cloud's pricing model more cost-effective, while those with consistent monitoring needs may prefer Stackdriver's predictable pricing.
Community Support and Ecosystem: Splunk Cloud boasts a large community of users, developers, and partners, providing a robust ecosystem for support, plugins, and integrations. Stackdriver, being a Google Cloud Platform service, offers strong integration with GCP services and Google's wider ecosystem. Organizations looking for a vibrant user community and extensive plugin support may opt for Splunk Cloud, while those heavily invested in the Google Cloud ecosystem may choose Stackdriver for its seamless integration.
In Summary, understanding the key differences between Splunk Cloud and Stackdriver is crucial in selecting the right logging and monitoring solution for your cloud environment.
We would like to detect unusual config changes that can potentially cause production outage.
Such as, SecurityGroup new allow/deny rule, AuthZ policy change, Secret key/certificate rotation, IP subnet add/drop. The problem is the source of all of these activities is different, i.e., AWS IAM, Amazon EC2, internal prod services, envoy sidecar, etc.
Which of the technology would be best suitable to detect only IMP events (not all activity) from various sources all workload running on AWS and also Splunk Cloud?
For continuous monitoring and detecting unusual configuration changes, I would suggest you look into AWS Config.
AWS Config enables you to assess, audit, and evaluate the configurations of your AWS resources. Config continuously monitors and records your AWS resource configurations and allows you to automate the evaluation of recorded configurations against desired configurations. Here is a list of supported AWS resources types and resource relationships with AWS Config https://docs.aws.amazon.com/config/latest/developerguide/resource-config-reference.html
Also as of Nov, 2019 - AWS Config launches support for third-party resources. You can now publish the configuration of third-party resources, such as GitHub repositories, Microsoft Active Directory resources, or any on-premises server into AWS Config using the new API. Here is more detail: https://docs.aws.amazon.com/config/latest/developerguide/customresources.html
If you have multiple AWS Account in your organization and want to detect changes there: https://docs.aws.amazon.com/config/latest/developerguide/aggregate-data.html
Lastly, if you already use Splunk Cloud in your enterprise and are looking for a consolidated view then, AWS Config is supported by Splunk Cloud as per their documentation too. https://aws.amazon.com/marketplace/pp/Splunk-Inc-Splunk-Cloud/B06XK299KV https://aws.amazon.com/marketplace/pp/Splunk-Inc-Splunk-Cloud/B06XK299KV
While it won't detect events as they happen a good stop gap would be to define your infrastructure config using terraform. You can then periodically run the terraform config against your environment and alert if there are any changes.
Consider using a combination of Netflix Security Monkey and AWS Guard Duty.
You can achieve automated detection and alerting, as well as automated recovery based on policies with these tools.
For instance, you could detect SecurityGroup rule changes that allow unrestricted egress from EC2 instances and then revert those changes automatically.
It's unclear from your post whether you want to detect events within the Splunk Cloud infrastructure or if you want to detect events indicated in data going to Splunk using the Splunk capabilities. If the latter, then Splunk has extremely rich capabilities in their query language and integrated alerting functions. With Splunk you can also run arbitrary Python scripts in response to certain events, so what you can't analyze and alert on with native functionality or plugins, you could write code to achieve.
Well there are clear advantages of using either tools, it all boils down to what exactly are you trying to achieve with this i.e do you want to proactive monitoring or do you want debug an incident/issue. Splunk definitely is superior in terms of proactively monitoring your logs for unusal events, but getting the cloudtrail logs across to splunk would require some not so straight forward setup (Splunk has a blueprint for this setup which uses AWS kinesis/Firehose). Cloudtrail on the other had is available out of the box from AWS, the setup is quite simple and straight forward. But analysing the log could require you setup Glue crawlers and you might have to use AWS Athena to run SQL Like query.
Refer: https://docs.aws.amazon.com/athena/latest/ug/cloudtrail-logs.html
In my personal experience the cost/effort involved in setting up splunk is not worth it for smaller workloads, whereas the AWS Cloudtrail/Glue/Athena would be less expensive setup(comparatively).
Alternatively you could look at something like sumologic, which has better integration with cloudtrail as opposed to splunk. Hope that helps.
I'd recommend using CloudTrail, it helped me a lot. But depending on your situation I'd recommed building a custom solution(like aws amazon-ssm-agent) which on configuration change makes an API call and logs them in grafana or kibana.
Pros of Splunk Cloud
- More powerful & Integrates with on-prem & off-prem7
- Free3
- Powerful log analytics3
- Pci compliance1
- Production debugger1
Pros of Stackdriver
- Monitoring19
- Logging11
- Alerting8
- Tracing7
- Uptime Monitoring6
- Error Reporting5
- Multi-cloud4
- Production debugger3
- Many integrations2
- Backed by Google1
- Configured basically with GAE1
Sign up to add or upvote prosMake informed product decisions
Cons of Splunk Cloud
Cons of Stackdriver
- Not free2