Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Rakesh Jain

@devops_tech

Jan 22, 2022 • 32 tweets • 9 min read • Read on X

Scrolly

Linux Diagnostics and Troubleshooting Series -

Thread 4 👇

IDENTIFYING HARDWARE ISSUES!

#Linux #diagnostics #troubleshooting #security

Identifying hardware -
An important step in troubleshooting potential hw issues is knowing exactly which hw is present in a system. For virtual systems, this might seem less useful than for a physical system, but it can tell an admin if the correct virtual devices have been added

Identifying CPUs -

The CPU(s) in a running system can be identified with the lscpu command from the util-linux package.

# lscpu

Another useful piece of info is what flags a CPU supports. These flags indicate whether a CPU supports certain extended technologies, such as AES acceleration, hw-assisted virtualization, & many more. These flags can be inspected in /proc/cpuinfo.

# cat /proc/cpuinfo

Point to Note -
The fact that a CPU supports a certain flag doesn't always mean that the feature is available. For eg, the vmx flag on a Intel CPU indicates that d processor is capable of supporting hw virtualization, but d feature itself might be disabled in the system firmware.

Identifying memory -

The dmidecode tool can be used to retrieve info about physical memory banks, including the type, speed, and location of the bank. To retrieve this information, use the command

# dmidecode -t memory

Identifying disks -

To identify physical disks, an administrator can use the command lsscsi from the lsscsi package. This tool can list all physical SCSI (and USB, SATA, and SAS) drives attached to a system.

# apt-get install lsscsi
# lsscsi -v

For more information, the hdparm command from the hdparm package can be used on individual disks.

# hdparm -I /dev/sda

Identifying PCI hardware -

Attached PCI hardware can be identified with the lspci command. Adding one or more -v options will increase the verbosity.

# lspci

Identifying USB hardware -

USB hardware can be identified using the lsusb command. Just like with lspci, verbosity can be increased by adding -v options.

# lsusb

Hardware error reporting -

Modern systems can typically keep a watch on various hw failures, alerting an admin when a hw fault occurs. While some of these solutions r vendor-specific, and require a remote management card, others can be read from the OS in a standardized fashion.

There are two mechanisms for logging hardware faults, mcelog and rasdaemon.

mcelog -
mcelog provides a framework for catching, and logging machine check exceptions on x86 systems. On supported systems, it can also automatically mark bad areas of RAM so that they will not be used

To install and enable mcelog, follow the following procedure:

1. Install the mcelog package.
# apt-get install mcelog

or

# yum install mcelog

Note - On Ubuntu 18.04 onwards The mcelog package functionality has been replaced by rasdaemon.

2. Start and enable the mcelog.service service.

root@lco-linux-worker1:~# systemctl enable mcelog
root@lco-linux-worker1:~# systemctl start mcelog

From now on, hw errors caught by the mcelog daemon will show up in the system journal. Messages can be queried using the cmd journalctl -u mcelog.service. If the abrt daemon is installed and active, it will also trigger on various mcelog messages.

Alternatively, for administrators who do not wish to run a separate service, a cron is set up, but
commented out, in /etc/cron.hourly/mcelog.cron that will dump events into /var/log/mcelog.

rasdaemon -
A modern replacement for mcelog dat hooks into d kernel trace subsystem. It stands for Reliability, Availability, & Serviceability. It hooks into d Error Detection & Correction (EDAC) mechanism for DIMM modules & reports dem to user space & RAS msgs dat come from kts.

To enable rasdaemon, use the following steps:

1. Install the rasdaemon package.
# apt-get install rasdaemon

or

# yum install rasdaemon

2. Start and enable the rasdaemon.service service.

[root@lco-linux-worker1:~# systemctl enable rasdaemon

[root@lco-linux-worker1:~# systemctl start rasdaemon

Information about the various memory banks can be queried using the ras-mc-ctl tool.

Of special interest are ras-mc-ctl --status to show the current status, and ras-mc-ctl -- errors to view any logged errors

Memory testing -

When a physical memory error is suspected, an administrator might want to run an exhaustive
memory test. In these cases, the memtest86+ package can be installed.

Since memory testing on a live system is less than ideal, the memtest86+ package will install a
separate boot entry that runs memtest86+ instead of a regular Linux kernel.

The following steps outline how to enable this boot entry -

1. Install the memtest86+ package; this will install the memtest86+ application into /boot.

[root@lco-linux-worker1:~# yum install memtest86+

or

root@lco-linux-worker1:~# apt-get install memtest86+

2. Run the command memtest-setup. This will add a new template into /etc/grub.d/ to enable memtest86+.

# memtest-setup

There is another utility called memtester.

# apt install memtester

3. Update the grub2 boot loader configuration.

# grub2-mkconfig -o /boot/grub2/grub.cfg

Digging into multiple loggings -

Dmesg allows you to figure out errors and warnings in the kernel's latest messages. For example, here is output of the dmesg | more command:

# dmesg | more

You can also look at all Linux system logs in the /var/log/messages or syslog file, which is where you'll find errors related to specific issues. It's worthwhile to monitor d msgs via the tail cmd in real time when you make modifications to your hw.

# tail -f /var/log/messages

Analyzing networking functions -

You may have hundreds of thousands of cloud-native applications to serve business services in a complex networking environment; these may include virtualization, multiple cloud, and hybrid cloud.

This means you should analyze whether networking connectivity is working correctly as part of your troubleshooting. Useful commands to figure out networking functions in the Linux server include ip addr, traceroute, nslookup, dig, and ping, among others.

Conclusion -
Troubleshooting Linux hw requires considerable knowledge, including how to use powerful command-line tools and figure out system loggings. You should also know how to diagnose the kernel space, which is where you can find the root cause of many hardware problems.

Hope you like the thread. If yes, retweet it. You can follow me for more such content.

Thanks!

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @devops_tech

Rakesh Jain

@devops_tech

Oct 18, 2025

🧵 Free & Best Tools to Scan Containers, K8s, Terraform & Jenkins Configs

Security isn’t just about firewalls — it’s about catching misconfigs before they hit prod 🚨

Here’s a thread on the best free tools every DevOps engineer should know 👇

1️⃣ Container Image Scanning – Trivy (by Aqua Security)

🔹 Scans Docker, Podman, and OCI images for vulnerabilities & secrets
🔹 Also supports IaC & SBOMs
🔹 Fast, single binary

📦 Example:
trivy image nginx:latest

👉 github.com/aquasecurity/t…

2️⃣ Kubernetes Config Scanning – Kubescape / kube-score

🔹 Scans YAML/Helm manifests against NSA & CIS benchmarks
🔹 Detects privilege issues, missing limits, etc.

🧩 Example:
kubescape scan --submit false .
OR
kube-score score manifests/

👉 github.com/kubescape/kube…

Read 11 tweets

Rakesh Jain

@devops_tech

Aug 9, 2025

Top 20 Linux networking commands explained with examples!

A Thread 👇

1⃣ ifconfig: Displays network interface configuration.

For example,
ifconfig eth0

shows the configuration details of the Ethernet interface. #LinuxNetworkingExample

2⃣ ip: Versatile command to manage network interfaces, addresses, and routes.

For instance,
ip addr show

displays IP addresses assigned to all interfaces. #LinuxNetworkingExample

Read 23 tweets

Rakesh Jain

@devops_tech

Aug 7, 2025

Understanding sudo, su, su - and sudo su !

A Thread with examples 👇

1/8 🐦 Welcome to today's thread!

Let's dive into the world of user privileges on Linux systems. We'll explore the differences between sudo, su, and sudo su.

#Linux #UserPrivileges

2/8 🐦 First up, sudo!

sudo stands for "Superuser Do." It allows regular users to perform administrative tasks by temporarily gaining root (superuser) privileges. Just add "sudo" before a cmd to execute it with elevated privileges. eg: sudo apt-get update updates packages.

Read 23 tweets

Rakesh Jain

@devops_tech

Aug 5, 2025

Load Balancer vs Reverse Proxy vs API Gateway

A Thread 🧵

1/ 💡 Let's dive into the world of networking and infrastructure components: Load Balancer, Reverse Proxy, and API Gateway.

They play distinct roles in managing web traffic.

2/ 🔄 Reverse Proxy:
A reverse proxy is like a middleman between clients and servers. It handles requests on behalf of servers, often providing benefits like security, load balancing, and caching.

Example: Nginx, Apache.

Read 26 tweets

Rakesh Jain

@devops_tech

Jul 25, 2025

All possible reasons a Kubernetes Pod can go into CrashLoopBackOff 🧵👇

1/🧵 What causes a Kubernetes Pod to go into CrashLoopBackOff?
Here’s a deep-dive thread on ALL the possible reasons and how to fix them. 🚑🐳
#Kubernetes #DevOps #CrashLoopBackOff

2/ Container Exit Code != 0
Your container crashed due to an error in the app.

🛠️ Fix: Check logs with kubectl logs <pod> and fix code/config causing the error.

Read 36 tweets

Rakesh Jain

@devops_tech

Jun 30, 2025

🧵 10 Things Every DevSecOps Engineer Must Know About Kubernetes Security — with real examples 👇

1/
🔐 RBAC > cluster-admin
Grant access based on roles, not titles.
✅ Example: Allow devs to view pods only:

2/
🕵️ Enable Audit Logs
Track who deleted a service or changed a config.

✅ Example: Enable auditing via kube-apiserver:
--audit-log-path=/var/log/k8s-audit.log

Read 18 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Rakesh Jain

Try unrolling a thread yourself!

More from @devops_tech

Rakesh Jain

Rakesh Jain

Rakesh Jain

Rakesh Jain

Rakesh Jain

Rakesh Jain

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!