Discover and read the best of Twitter Threads about #pakdatasci

Most recents (8)

Here's a list of almost ~all regression models used in machine learning. Make sure to try them out.

A Thread ๐Ÿงต๐Ÿ‘‡

#MachineLearning #regression #codanics #ArtificialIntelligence #DataScience #analysis #DataAnalytics #model #algorithm #pakdatasci Image
Image
Image
Read 7 tweets
Sequences.
A core concept in Biopython is the biological sequence, and this is represented by the Seq class. A Biopython Seq object is similar to a Python string in many respects: it supports the Python slice notation, can be concatenated with other sequences and is immutable.
Sequence annotation.
The SeqRecord class describes sequences along with information such as name description and features in the form of SeqFeature objects. Each SeqFeature object specifies the type of the feature and its location.
Read 8 tweets
Biopython: an open-source collection of non-commercial Python tools for computational biology and bioinformatics It contains classes to represent biological sequences and sequence annotations
Thread
#pythonkachilla2 #pythonkachilla #pakdatasci #DataScience #MachineLearning
- History:
Began in 1999 and it was first released in July 2000. It was developed during a similar time frame and with analogous goals to other projects that added bioinformatics capabilities to their respective programming languages, including BioPerl, BioRuby and BioJava.
- Design:
Follows the conventions used by the Python programming language to make it easier for users familiar with Python. For example, Seq and SeqRecord objects can be manipulated via slicing, in a manner similar to Python's strings and lists. #DataVisualization
Read 4 tweets
Browsing Data Compendia:
This is a good strategy if you are not sure what types of variables exist or what data would be relevant for your project
- Select a data compendia
- Determine the subject area or data type that your topic or variable falls under
- Read the descriptions
Searching by Topic:
This guide provides several links to data sources by topic. These links are by no mean exhaustive, but can be a good place to start and can help you get a sense of who are some of the major collectors of data in your topic area.
Read 5 tweets
A beginner's guide to interpreting Box plot.

A thread ๐Ÿงต

#pythonkachilla2 #pythonkachilla #pakdatasci #DataScience #MachineLearning #DataVisualization #Statistics
Box plot is also called box and whiskers plot.

It gives us information about different things which help us get insights of the data, these include median (Q2), first quartile (Q1), third quartile (Q3), minimum value, maximum value, outliers, and interquartile range (IQR). Image
1. The line in the middle of the box is the median value.
50% of the data is on one side of the median, and 50% of the data is on the other side of the median.

2. The first quartile (Q1) is the middle value between the median and minimum value in the data set. (See figure)
Read 6 tweets
Shortcut keys for Microsoft Excel.

A Thread ๐Ÿ‘‡

#pythonkachilla2 #pythonkachilla #pakdatasci #DataScience #MachineLearning #DataVisualization
1: Alt+Q, then enter the search term.
Move to the Tell me or Search field on the ribbon and type a search term for assistance or Help content.
2: Alt+F
Open the File menu.
Read 10 tweets
STATISTICS ๐Ÿ“ˆis the most important subject to know if you are jumping into data science.

I have put together top 3 youtube playlists to get yourself started with statistics.

A thread ๐Ÿงต(1/4)
1. Statistics Tutorials (by 365 Data Science)

youtube.com/playlist?list=โ€ฆ

(2/4)
2. Statistics 101 (by Cognitive Class)

youtube.com/playlist?list=โ€ฆ

(3/4)
Read 6 tweets
5 ๐™๐™Š๐™‹ ๐™‹๐™ฎ๐™ฉ๐™๐™ค๐™ฃ ๐™‡๐™ž๐™—๐™ง๐™–๐™ง๐™ž๐™š๐™จ that you should master to become a ๐˜ฟ๐™–๐™ฉ๐™– ๐™Ž๐™˜๐™ž๐™š๐™ฃ๐™ฉ๐™ž๐™จ๐™ฉ

A Thread๐Ÿงต

#pythonkachilla2 #pythonkachilla #pakdatasci #DataScience #MachineLearning #DataVisualization #DataAnalytics Image
๐™‰๐™ช๐™ข๐™ฅ๐™ฎ (๐™‰๐™ช๐™ข๐™š๐™ง๐™ž๐™˜๐™–๐™ก ๐˜พ๐™ค๐™ข๐™ฅ๐™ช๐™ฉ๐™–๐™ฉ๐™ž๐™ค๐™ฃ)
Numpy is used for numerical computations including matrix calculations and multidimensional array operations. Performing numerical computation in Python is time-consuming so we use NumPy for fast computation.
๐™‹๐™–๐™ฃ๐™™๐™–๐™จ (๐˜ฟ๐™–๐™ฉ๐™– ๐˜ผ๐™ฃ๐™–๐™ก๐™ฎ๐™จ๐™ž๐™จ)
In Data Science, when you get the data you have to preprocess it to make it suitable for analysis. Pandas library provides us the tools to manipulate the data and make sense of the data. It is used for data cleaning/wrangling
Read 7 tweets

Related hashtags

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!