Rakesh Jain Profile picture
Jan 4, 2022 β€’ 16 tweets β€’ 6 min read β€’ Read on X
Linux Diagnostics and Troubleshooting Series -

Thread 1 πŸ‘‡

Understanding what Troubleshooting is!

#Linux #diagnostics #troubleshooting #security
What is troubleshooting?

Troubleshooting is the art of taking a problem, gathering information about it, analyzing it, and finally solving it.

While some problems are inherently β€œharder” than others, the same basic approach can be taken for every problem. PC: https://knowmax.ai/wp-content/uploads/2020/09/Troublesho
Not just fixing!

While fixing a problem is one of the major parts of troubleshooting, there are other parts that cannot be neglected: documenting the problem (and fix), and performing a root cause analysis (RCA). PC: https://knowmax.ai/wp-content/uploads/2020/09/Troublesho
Documenting the problem (and the fix) can help in the future when another (or possibly the same) administrator is faced with the same, or a similar, problem.

Performing a root cause analysis can help in preventing similar problems in the future. PC: https://www.foodengineeringmag.com/ext/resources/FE-Late
Using the scientific method -

A good schema to follow wen troubleshooting is d scientific method:

1.Clearly define d issue-
Take a step & view d larger picture, den clearly define d actual problem. Most of d problems reported r symptoms of another problem, not d actual problem. PC: https://www.wikihow.com/images/thumb/c/c9/Define-a-Probl
For eg, a user might call about a problem signing into a machine. While this is a problem for d user, the actual problem can be a forgotten passwd, an incorrectly configured machine, a nw issue, or something else entirely.

Further investigation is needed to determine d cause. PC: https://www.elegantthemes.com/blog/wp-content/uploads/20
2. Collect information -

The next step is collecting as much (relevant) as possible. This information can come from a wide variety of sources: reading log files, information displayed on screen or in a GUI, follow-up questions for the original reporter, etc. PC: https://www.gocanvas.com/content/images/image-uploads/fa
3. Form a hypothesis -
After looking at all gathered info, & d symptoms observed/reported, it is time to form a hypothesis abt the cause of d problem. PC: https://slideplayer.com/5772059/19/images/slide_1.jpg
Sometimes this can be easy; for example, when a user has forgotten his password. Other times, it can be harder; for example, when a single service in a high-availability cluster fails to start on Mondays during months with an "e" in their name.
The key to remember during this step is that the hypothesis is just that, a hypothesis: a best guess as to what can be the cause of the issue. During the following steps, this hypothesis will be tested. If it turns out the hypothesis was wrong, a new one can be formed.
4. Test the hypothesis -
With an initial hypothesis formed, it can be tested for validity. How this testing happens depends on d problem & d hypothesis. PC: www.statisticsteacher.org/files/2021/03/HypothesisTestin
For example, when d hypothesis for a login problem states, β€œThe nw connection between d workstation & d KDC is being interrupted by a firewall,” the testing will be different from a hypothesis for a spontaneously rebooting server including a faulty UPS.
5. Fixing the problem -
If a hypothesis was not found to be invalid, an attempt can be made to fix the problem. During this stage, it is vital to only change one variable at a time, documenting all changes made, and testing every change individually. PC: https://www.slideteam.net/media/catalog/product/cache/96
Keeping backups of any changed configuration files, and reverting to those backups if a change was found to be ineffective, is also crucial. Modifying multiple configurations at once typically only leads to further issues, not fixes. PC: https://iosafe.com/wp-content/uploads/2020/04/3-types-of
6. Rinse & repeat -
If the proposed fixes did not actually resolve the issue, the process will need to be restarted from the top. This time, any new information discovered during this cycle can be added to the mix to form a new hypothesis. PC: https://miro.medium.com/max/1024/1*FJ20mlOOkJFH6py_-pvRz
Hope you like this thread. If yes Retweet it!
More to come.

Follow me for more such content.

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Rakesh Jain

Rakesh Jain Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @devops_tech

Nov 21
Platform Engineer vs. SRE vs. DevOps Engineer!

A Thread 🧡
1/ What’s the difference? πŸ€”
All these roles focus on improving software delivery and reliability but have different goals. Here’s a quick breakdown:
2/ Platform Engineer πŸ‘·
πŸ‘‰ Goal: Make life easier for developers.
β€’ Build tools & platforms (e.g. CI/CD pipelines, Kubernetes)
β€’ Standardize workflows for efficiency
β€’ Create self-service systems so developers can work faster

πŸ› οΈ They focus on improving developer productivity.
Read 9 tweets
Sep 12
AWS service limitations to keep in mind!

A List of critical #AWS services and their limitations πŸ‘‡ Image
1. EC2 – Instance limits by region, instance type restrictions.
2. RDS – Max database storage limits, instance size restrictions.
3. S3 – Max object size is 5TB, bucket policies can limit access.
4. EBS – Volume size max of 64TB, 20,000 IOPS for io1/io2 volumes.
5. IAM – Max 5,000 roles per account, policy size limits.
6. Lambda – Max execution timeout of 15 minutes, memory max 10GB.
7. DynamoDB – Partition throughput limits, item size max of 400KB.
8. CloudFormation – 200 resources limit per stack.
Read 13 tweets
Sep 9
Mastering Ansible command line!

A Thread of 10 key #Ansible commands and concepts all #DevOps and #Linux Administrators should focus on πŸ‘‡πŸ‘‡ Image
1️⃣ Check Ansible Version

Command:
ansible --version

Explanation:
Verify your Ansible installation and check version details.
2️⃣ Ping All Hosts

Command:
ansible all -m ping

Explanation:
Ping all hosts in your inventory to ensure they are reachable.
Read 18 tweets
Sep 5
🧡 Mastering Docker Troubleshooting: 15 Key Tips for Developers and DevOps Engineers!

A Thread πŸ‘‡πŸ‘‡ Image
1/ 🐳 Check Container Status
Use docker ps -a to view all containers and their statuses. A container may have exited unexpectedly.
Look at STATUS and RESTART policies to identify potential issues.
2/ πŸ” Inspect Logs
Run docker logs <container_name> to see the container logs.
This helps troubleshoot crashes, errors, or other issues within the app or service.
Read 18 tweets
Sep 3
🧡 Mastering Dockerfile: A Breakdown of 15 most important Parameters!

A thread πŸ‘‡πŸ‘‡ Image
1️⃣ FROM:
The first instruction in any Dockerfile. It defines the base image for subsequent instructions.

FROM ubuntu:20.04 - Starts with an Ubuntu 20.04 image.
2️⃣ LABEL:
Adds metadata to the image, such as author or version.

LABEL maintainer="you@example.com" - Tags the image with your contact info.
Read 18 tweets
Aug 30
🧡 Kubernetes Security Master Guide! πŸ” Image
1️⃣ Role-Based Access Control (RBAC)

πŸ“œ Limit access with RBAC! Define who can do what in your cluster by setting up roles and bindings. Keep permissions tight!

Example:
kubectl create rolebinding developer --clusterrole=edit --user=dev@example.com -n dev-namespace
2️⃣ Network Policies

🌐 Control traffic flow between pods using Network Policies. Limit communication to what's needed, reducing the attack surface.
Example: A policy that only allows inbound traffic from specific pods: Image
Read 18 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(