Skip to content

Monthly Merge To production#660

Merged
cristian-tamblay merged 453 commits into
productionfrom
develop
May 29, 2026
Merged

Monthly Merge To production#660
cristian-tamblay merged 453 commits into
productionfrom
develop

Conversation

@cristian-tamblay
Copy link
Copy Markdown
Member

No description provided.

Irozuku and others added 30 commits May 15, 2026 10:24
…converter

- Switch pa.memory_map to pa.OSFile in all four read sites to release
  Windows file lock (WinError 1224) so converter job can write data.arrow
- Add mtime check in _FilteredTableCache.get so cache auto-invalidates
  when data.arrow is written by a converter job, preventing stale previews
…e cache

shutil.copytree uses copy2 which preserves the source file's mtime. When
deleting the only converter (no previous ones to re-run), the restored
data.arrow has an older mtime than the cache entry, so the mtime-based
cache invalidation never fires and the table still shows the old transformed
data. Touching data.arrow after the copy ensures a fresh mtime.
- Return actual job ID from delete_converter so the frontend polls
  for re-run completion before refreshing (previously job_ids was
  always empty due to hasattr check before put())
- Refresh column types via handleStatusChange (ConverterBox path) and
  FormConverterSection onSuccess (job-polling path) instead of a
  redundant useEffect in DatasetPreviewNotebook
- DatasetPreviewNotebook syncs localColumnTypes from context only,
  eliminating the extra type fetch on initial notebook load (3→1)
…apse-animation

fix: replace Paper with Accordion for model comparison panel
Add Portuguese language support for various components and tours
fix: clear notebook state and update SaveDatasetModal style on dataset save
fix: implement search threshold for displaying search bars in components
Reduce redundant dataset fetches and standardize notebook API endpoints
…oarding

Feat/fix and redesign onboarding
Reduce redundant HTTP requests in the notebooks/datasets UI
…dal-and-ordering

Fix/generative session modal and ordering
Fix stale dataset cache and converter updates for Arrow-backed datasets
Irozuku and others added 29 commits May 28, 2026 10:12
MultilingualString.get() already falls back to en when de=None,
so wrapping plain strings with de=description was redundant and
caused frozen-dataclass equality failures in test_registry.py.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Introduces SUPPORTS_NATIVE_TYPES, NATIVE_TYPE_MAPPING, and
extract_native_types() on the dataloader base so self-describing
formats can expose column types directly instead of going through
statistical inference. Default returns None, preserving existing
behavior for all current loaders. get_metadata() now surfaces the
capability flag for the frontend.
ARFF files declare each attribute as NUMERIC, INTEGER, REAL, NOMINAL,
STRING, or DATE in the header. extract_native_types now reads the
scipy meta object (previously discarded) and builds the DashAI column
type dict directly: numeric kinds map to Float/Integer, nominal maps
to Categorical with categories taken verbatim from the header.
Refactors _read_arff_file to share the raw scipy call.
When the request sets use_native_types and the chosen dataloader
declares SUPPORTS_NATIVE_TYPES, call extract_native_types on the
prepared file and short-circuit the DashAIPtype/Dummy inference
loop. The returned dict reuses the inferred_types response field,
so existing frontend and dataset_job paths consume it unchanged.
DataloaderConfigBar fetches the selected dataloader's metadata and,
when supports_native_types is true, renders a Switch above the
Inference Rows input. The toggle defaults to on so the first preview
already runs in native mode, propagates use_native_types in the
params payload, and swaps the row-count input label to Preview Rows
when active. New i18n keys land in en, es, and pt locales.
Adds tests for SUPPORTS_NATIVE_TYPES metadata exposure, the full
schema returned by extract_native_types over a mixed-attribute
ARFF (NUMERIC/REAL/INTEGER/NOMINAL with categories), shape parity
with DashAIPtype.infer_types, and the negative path where loaders
without an override return None.
BaseDataLoader.get_metadata now exposes the supports_native_types
flag for the frontend toggle. Update test_components_api fixtures
to match the new metadata dict shape (defaults to False for stock
test loaders).
feat: add German (Deutsch) language support
Add Hub for browsing and importing datasets from external sources
…torage-state

Clear stale localStorage state when deleting or recreating a run
Add native type extraction support to BaseDataLoader and ARFF
…nction

- Replaced individual dataset classes with a common `_make_image_dataset` function across multiple classifiers (Torchvision, CNN, LeNet5, MLP).
- Updated model definitions to utilize the new dataset function.
- Simplified model architecture definitions by introducing `_build_*_model` functions for CNN, LeNet5, and MLP.
- Removed redundant imports and organized code for better readability.
- Enhanced training loops to include evaluation metrics for both training and validation datasets.
…tion-models

Add image classification models
Fix/hpo with one-hot encoder and score system
…ator

Fix type change validation for hub import datasets without local files
Improve performance and usability of analysis tabs and insights
Include translation function in model fetching dependencies
@cristian-tamblay cristian-tamblay merged commit 6660ce9 into production May 29, 2026
21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants