Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
BigData Praktikum
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Requirements
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Test cases
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Insights
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Franziska Roepke
BigData Praktikum
Compare revisions
47e77d3cd461026ba5f8b10117728ca37c5f5427 to 595807dba50e08f3d20bf4954fbf8c40efd3f1a1
Compare revisions
Changes are shown as if the
source
revision was being merged into the
target
revision.
Learn more about comparing revisions.
Source
iu71daji/BigData_Praktikum
Select target project
No results found
595807dba50e08f3d20bf4954fbf8c40efd3f1a1
Select Git revision
Branches
dev
main
test
Swap
Target
iu71daji/BigData_Praktikum
Select target project
iu71daji/BigData_Praktikum
1 result
47e77d3cd461026ba5f8b10117728ca37c5f5427
Select Git revision
Branches
dev
main
test
Show changes
Only incoming changes from source
Include changes to target since source was created
Compare
Commits on Source (5)
notes appended
· d8414ed3
franzi - schranzi
authored
1 year ago
d8414ed3
Merge branch 'main' of
https://git.informatik.uni-leipzig.de/iu71daji/BigData_Praktikum
into main
· aad79851
franzi - schranzi
authored
1 year ago
aad79851
Merge branch 'dev' into 'main'
· 8d9714c4
Paul Kuehnel
authored
1 year ago
Dev See merge request
!1
8d9714c4
create .gitlab-ci.yml file for CI/CD pipelines
· 0e2b0b3d
Paul Kuehnel
authored
1 year ago
0e2b0b3d
edit gitlab-ci
· 595807db
Paul Kuehnel
authored
1 year ago
595807db
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
.gitlab-ci.yml
+53
-0
53 additions, 0 deletions
.gitlab-ci.yml
docs/source/Notizen.md
+16
-5
16 additions, 5 deletions
docs/source/Notizen.md
with
69 additions
and
5 deletions
.gitlab-ci.yml
0 → 100644
View file @
595807db
image
:
python:3.9
# Change pip's cache directory to be inside the project directory since we can
# only cache local items.
variables
:
PIP_CACHE_DIR
:
"
$CI_PROJECT_DIR/.cache/pip"
# Pip's cache doesn't store the python packages
# https://pip.pypa.io/en/stable/topics/caching/
#
# If you want to also cache the installed packages, you have to install
# them in a virtualenv and cache it as well.
cache
:
paths
:
-
.cache/pip
-
venv/
before_script
:
-
python --version ; pip --version
# For debugging
-
pip install virtualenv
-
virtualenv venv
-
source venv/bin/activate
#test:
# script:
# - pip install ruff tox # you can also use tox
# - pip install --editable ".[test]"
# - tox -e py,ruff
test
:
script
:
-
pip install -r requirements.txt
-
python test.py
#artifacts:
# paths:
# - build/*
pages
:
script
:
-
pip install sphinx sphinx-rtd-theme
-
cd docs
-
make html
-
mv build/html/ ../public/
artifacts
:
paths
:
-
public
rules
:
-
if
:
$CI_COMMIT_BRANCH == $CI_DEFAULT_BRANCH
#deploy:
# stage: deploy
# script: echo "Define your deployment script!"
# environment: production
This diff is collapsed.
Click to expand it.
docs/source/Notizen.md
View file @
595807db
# Big Data Praktikum Autoencoder
# Big Data Praktikum Autoencoder
## 0. Test Data Frame
## 0. Test Data Frame
*
auseinandersetzung mit loom-file
*
auseinandersetzung mit loom-file
(loompy verwenden, loom Datei einlesen & pandas erstellen)
*
checken wie ein repräsentatives Pandas Data Frame aussehen kann
*
checken wie ein repräsentatives Pandas Data Frame aussehen kann
*
eins zum testen erstellen.
*
eins zum testen erstellen.
### 0.1 Datenformat
### 0.1 Datenformat
...
@@ -9,10 +9,10 @@
...
@@ -9,10 +9,10 @@
*
ever row is a gene which can be expressed or not..
*
ever row is a gene which can be expressed or not..
## 1. Data-Access
## 1. Data-Access
1.
Get Data from API
1.
Get Data from API
2.
get it into a "good" format
2.
get it into a "good" format
-> what do we need ?
-> what do we need ?
##
# 1.
2
. Loom-Files
# 1.
1
. Loom-Files
*
Idee: Meta Data durch ID behalten, Daten ohne Metadaten nehmen & in Autoencoder reintun.
*
Idee: Meta Data durch ID behalten, Daten ohne Metadaten nehmen & in Autoencoder reintun.
*
Idee: Pandas Data Frames extrahieren für den Autoencoder.
*
Idee: Pandas Data Frames extrahieren für den Autoencoder.
also jeder Data Frame = 1 Input für Encoder.
also jeder Data Frame = 1 Input für Encoder.
...
@@ -26,5 +26,16 @@
...
@@ -26,5 +26,16 @@
## 2. Auto Encoder
## 2. Auto Encoder
*
reinlesen
*
welche bibliotheken ?
*
scientific computing resources, wie verwendet man die ?
## 3. Visualisation
## 3. Visualisation
Ziel: Visualisierung des Clusters
*
latent space
*
Idee: Autoencoder output, z.B. 50 Dimensionen
*
Darauf Dimensionsreduktions-Algorithmen anwenden
*
z.B. t-SNE und uMap was macht Sinn, auf wie viele Dimensionen wollen wir runter ? Glaube auf 2
*
Clusteranalyse dann auf 2-dimensionalem anwenden ?
*
welche.. kMeans z.B. ?
*
[
Dokument zu Clusteralgorithmen
](
https://www.kde.cs.uni-kassel.de/wp-content/uploads/ws/LLWA03/fgml/final/Kirchner.pdf
)
This diff is collapsed.
Click to expand it.