Entrepreneurship Hamburg, Startup Hamburg, Deep Tech Hamburg, Entrepreneurship Science, Technology Entrepreneurship, TUHH Entrepreneurship, Gründung Hamburg, Startup Forschung Hamburg, Technologie-Entrepreneurship, Startup Engineering, Entrepreneurial Finance, Innovation Hamburg
Dock-1, © Victor Klassen
We advance innovation and entrepreneurship through rigorous empirical research,
a scientific approach to entrepreneurship education,
and practice-oriented collaborations
— shaping the next generation of Startup Engineers.
The Science Data Lake: A Unified Open Infrastructure Integrating 293 Million Papers Across Eight Scholarly Sources with Embedding-Based Ontology Alignment Scholarly data are largely fragmented across siloed databases with divergent metadata and missing linkages among them. We present the Science Data Lake, a locally-deployable infrastructure built on DuckDB and simple Parquet files that unifies eight open sources - Semantic Scholar, OpenAlex, SciSciNet, Papers with Code, Retraction Watch, Reliance on Science, a preprint-to-published mapping, and Crossref - via DOI normalization while preserving source-level schemas. The resource comprises approximately 960GB of Parquet files spanning ~293 million uniquely identifiable papers across ~22 schemas and ~153 SQL views. An embedding-based ontology alignment using BGE-large sentence embeddings maps 4,516 OpenAlex topics to 13 scientific ontologies (~1.3 million terms), yielding 16,150 mappings covering 99.8% of topics (≥ 0.65 threshold) with F1 = 0.77 at the recommended ≥ 0.85 operating point, outperforming TF-IDF, BM25, and Jaro-Winkler baselines on a 300-pair gold-standard evaluation. We validate through 10 automated checks, cross-source citation agreement analysis (pairwise Pearson r = 0.76 - 0.87), and stratified manual annotation. Four vignettes demonstrate cross-source analyses infeasible with any single database. The resource is open source, deployable on a single drive or queryable remotely via HuggingFace, and includes structured documentation suitable for large language model (LLM) based research agents. 2026 Working Paper Jonas Wilinski
Organizing Entrepreneurial Teams: A Field Experiment on Autonomy over Choosing Teams and Ideas Organization Science, 34(6), 2097-2118 2023 Journal Article Viktoria Boss, Linus Dahlander, Christoph Ihl, Rajshri Jayaraman
More than words! How narrative anchoring and enrichment help to balance differentiation and conformity of entrepreneurial products Journal of Business Venturing, 35(6), 106050 2020 Journal Article Alexander Vossen, Christoph Ihl