ф
Data Engineering
Pipeline Orchestration
Tools
Cron [J]
Airflow
Luigi
Prefect
Rundeck
Dagster
Metaflow
Cloud
AWS Step Functions [M]
Google Cloud Composer
Azure Data Factory
Data Storage
RDBMS
Classic
MS SQL Server [J]
PostgreSQL [M]
MySQL [J]
Oracle
Distributed
Cloud
MS Synapse
Google BigQuery
Snowflake
YDB
Arenadata
On premise
Citus
Apache Hive
Clickhouse
NON - RDBMS
Apache Hadoop
MongoDB [M]
MS Cosmos DB
Apache Ignite
Apache Cassandra
AWS Dynamo DB
Google Firebase
Neo4J
Arango DB
Storage
Cloud
AWS S3 [J]
Google Drive
Minio S3
MS Sharepoint
MS OneDrive
Ya Disk
Sber Disk
Protocols
FTP [J]
SFTP [J]
SCP [J]
S3 [J]
WebDAV
HTTP [M]
ETL
Languages
Python
requests [M]
sqlalchemy [M]
pandas
psycopg2 / 3
bonobo
BeautifulSoup
lxml
dask
ScraPy
connectorX
Scala
Catz
Java
R
Go [J]
Tools
Pentaho DI
TalenD
MS Data Factory
MS SSIS
Databricks
Informatica
AWS Glue
AWS DMS
MQ
Rabbit
Kafka
AWS SQS
IBM MQ
Software Engineering
Deploy & Code Maintanance
CI / CD
Gitlab CI CD
Jenkins
Team City
Git
GitHub [M]
Bitbucket [J]
GitLab [J]
Code quality
Linters
Python
Flake8 [J]
wemake
mypy
pycodestype
Testing
Python
pytest [J]
pytest-coverage
hypothesis
unittest
Formatters
Python
black [J]
pre-commit
MLOps
Data tracking & Quality
DVC
CML
pandera
pydantic [J]
Experiment tracking
ClearML
MLFlow
Serving
bentoml
flask [MJ]
FastAPI [MJ]
Languages
Scala
Python [MJ]
Data Science
IT Infrastructure
Hosting & Serverless calculation
Cloud
Linux [M]
Windows
Azure Windows Server
AWS EC2 [J]
AWS Lightsail
SberCloud
Selectel
Containers
docker-compose
AWS ECS [J]
Azure Databricks
On premise
Linux [M]
Windows [M]
Authorization
MS Active Directory [J]
LDAP [J]
AWS IAM
Load Balancer
Azure Traffic Manager
Nginx [J]
AWS Elastic Load Balancing
Citrix ADC
HAProxy
Kubernetes
DNS [J]
LAN [J]
WAN [J]
Server
AWS Lightsail
AWS Route 53
Google Public DNS
Yandex DNS
Domain
Registration
Routing delegation
Structure
Zones
DNS Records
A
AAA
CNAME
MX
NX
PTR
SSL Certificates
Monitoring [J]
Zabbix
Grafana
Kibana
Prometheus
IaC