archivebox.core.models
Module Contents
Classes
Old tag model, loosely based on django-taggit model + ABID base. |
|
Functions
API
- class archivebox.core.models.Tag(*args: Any, **kwargs: Any)[source]
Bases:
archivebox.base_models.models.ModelWithReadOnlyFields
,archivebox.base_models.models.ModelWithSerializers
,archivebox.base_models.models.ModelWithUUID
,archivebox.base_models.models.ABIDModel
Old tag model, loosely based on django-taggit model + ABID base.
Being phazed out in favor of archivebox.tags.models.ATag
Initialization
Overriden init method ensures we have a stable creation timestamp that fields can use within initialization code pre-saving to DB.
- class archivebox.core.models.SnapshotManager[source]
Bases:
django.db.models.Manager
- filter(*args, **kwargs)[source]
add support for .filter(domain=’example.com’) to Snapshot queryset
- class archivebox.core.models.Snapshot(*args: Any, **kwargs: Any)[source]
Bases:
archivebox.base_models.models.ModelWithReadOnlyFields
,archivebox.base_models.models.ModelWithSerializers
,archivebox.base_models.models.ModelWithUUID
,archivebox.base_models.models.ModelWithKVTags
,archivebox.base_models.models.ABIDModel
,archivebox.base_models.models.ModelWithOutputDir
,archivebox.base_models.models.ModelWithConfig
,archivebox.base_models.models.ModelWithNotes
,archivebox.base_models.models.ModelWithHealthStats
,workers.models.ModelWithStateMachine
- read_only_fields[source]
(‘id’, ‘abid’, ‘created_at’, ‘created_by_id’, ‘url’, ‘timestamp’, ‘bookmarked_at’, ‘crawl_id’)
- crawl: crawls.models.Crawl[source]
‘ForeignKey(…)’
- archiveresult_set: django.db.models.Manager[ArchiveResult][source]
None
- as_link() archivebox.index.schema.Link [source]
- as_link_with_details() archivebox.index.schema.Link [source]
- pending_archiveresults() django.db.models.QuerySet[archivebox.core.models.ArchiveResult] [source]
- create_pending_archiveresults() list[archivebox.core.models.ArchiveResult] [source]
- class archivebox.core.models.ArchiveResult(*args: Any, **kwargs: Any)[source]
Bases:
archivebox.base_models.models.ModelWithReadOnlyFields
,archivebox.base_models.models.ModelWithSerializers
,archivebox.base_models.models.ModelWithUUID
,archivebox.base_models.models.ModelWithKVTags
,archivebox.base_models.models.ABIDModel
,archivebox.base_models.models.ModelWithOutputDir
,archivebox.base_models.models.ModelWithConfig
,archivebox.base_models.models.ModelWithNotes
,archivebox.base_models.models.ModelWithHealthStats
,workers.models.ModelWithStateMachine
- EXTRACTOR_CHOICES[source]
((‘htmltotext’, ‘htmltotext’), (‘git’, ‘git’), (‘singlefile’, ‘singlefile’), (‘media’, ‘media’), (‘a…
- read_only_fields[source]
(‘id’, ‘abid’, ‘created_at’, ‘created_by’, ‘snapshot’, ‘extractor’, ‘pwd’)
- snapshot: archivebox.core.models.Snapshot[source]
‘ForeignKey(…)’
- keys[source]
(‘snapshot_id’, ‘extractor’, ‘cmd’, ‘pwd’, ‘cmd_version’, ‘output’, ‘start_ts’, ‘end_ts’, ‘created_a…
- embed_path() str | None [source]
return the actual runtime-calculated path to the file on-disk that should be used for user-facing iframe embeds of this result
- canonical_outputs() Dict[str, Optional[str]] [source]
Predict the expected output paths that should be present after archiving