Commit 2bd7d8df authored by Greg Kroah-Hartman's avatar Greg Kroah-Hartman

Merge tag 'misc-habanalabs-next-2020-05-19' of...

Merge tag 'misc-habanalabs-next-2020-05-19' of git://people.freedesktop.org/~gabbayo/linux into char-misc-next

Oded writes:

This tag contains the following changes for kernel 5.8:

- GAUDI ASIC support. The tag contains code and header files needed to
  initialize the GAUDI ASIC and run workloads on it. There are changes to
  the common code that are needed for GAUDI and there is the addition of
  new ASIC-dependent code of GAUDI.

- Add new feature of signal/wait command submissions. This is relevant to
  GAUDI only and allows the user to sync between streams (queues) inside
  the device.

- Allow user to retrieve the device time alongside the host time, to allow
  a user application to synchronize device time together with host
  time during profiling.

- Change ASIC's CPU initialization by loading its boot loader code from the
  Host memory (instead of it being programmed on the on-board FLASH).

- Expose more attributes through HWMON.

- Move the ASIC event handling code to be "common code". This will be
  shared between GAUDI and future ASICs. Goya will still use its own code.

- Fix bug in command submission parsing in Goya.

- Small fixes to security configuration (open up some registers for user
  access).

- Improvements to ASIC reset code.

* tag 'misc-habanalabs-next-2020-05-19' of git://people.freedesktop.org/~gabbayo/linux: (38 commits)
  habanalabs: update patched_cb_size for Wreg32
  habanalabs: move event handling to common firmware file
  habanalabs: enable gaudi code in driver
  habanalabs: add gaudi profiler module
  habanalabs: add gaudi security module
  habanalabs: add hwmgr module for gaudi
  habanalabs: add gaudi asic-dependent code
  uapi: habanalabs: add gaudi defines
  habanalabs: add gaudi asic registers header files
  habanalabs: get card type, location from F/W
  habanalabs: support clock gating enable/disable
  habanalabs: set PM profile to auto only for goya
  habanalabs: add dedicated define for hard reset
  habanalabs: check if CoreSight is supported
  habanalabs: add signal/wait to CS IOCTL operations
  habanalabs: handle the h/w sync object
  habanalabs: define ASIC-dependent interface for signal/wait
  uapi: habanalabs: add signal/wait operations
  habanalabs: add missing MODULE_DEVICE_TABLE
  habanalabs: print all CB handles as hex numbers
  ...
parents 57c76221 87eaea1c
...@@ -8,6 +8,16 @@ Description: Sets the device address to be used for read or write through ...@@ -8,6 +8,16 @@ Description: Sets the device address to be used for read or write through
only when the IOMMU is disabled. only when the IOMMU is disabled.
The acceptable value is a string that starts with "0x" The acceptable value is a string that starts with "0x"
What: /sys/kernel/debug/habanalabs/hl<n>/clk_gate
Date: May 2020
KernelVersion: 5.8
Contact: oded.gabbay@gmail.com
Description: Allow the root user to disable/enable in runtime the clock
gating mechanism in Gaudi. Due to how Gaudi is built, the
clock gating needs to be disabled in order to access the
registers of the TPC and MME engines. This is sometimes needed
during debug by the user and hence the user needs this option
What: /sys/kernel/debug/habanalabs/hl<n>/command_buffers What: /sys/kernel/debug/habanalabs/hl<n>/command_buffers
Date: Jan 2019 Date: Jan 2019
KernelVersion: 5.1 KernelVersion: 5.1
...@@ -150,3 +160,10 @@ KernelVersion: 5.1 ...@@ -150,3 +160,10 @@ KernelVersion: 5.1
Contact: oded.gabbay@gmail.com Contact: oded.gabbay@gmail.com
Description: Displays a list with information about all the active virtual Description: Displays a list with information about all the active virtual
address mappings per ASID address mappings per ASID
What: /sys/kernel/debug/habanalabs/hl<n>/stop_on_err
Date: Mar 2020
KernelVersion: 5.6
Contact: oded.gabbay@gmail.com
Description: Sets the stop-on_error option for the device engines. Value of
"0" is for disable, otherwise enable.
...@@ -10,6 +10,23 @@ KernelVersion: 5.1 ...@@ -10,6 +10,23 @@ KernelVersion: 5.1
Contact: oded.gabbay@gmail.com Contact: oded.gabbay@gmail.com
Description: Version of the application running on the device's CPU Description: Version of the application running on the device's CPU
What: /sys/class/habanalabs/hl<n>/clk_max_freq_mhz
Date: Jun 2019
KernelVersion: not yet upstreamed
Contact: oded.gabbay@gmail.com
Description: Allows the user to set the maximum clock frequency, in MHz.
The device clock might be set to lower value than the maximum.
The user should read the clk_cur_freq_mhz to see the actual
frequency value of the device clock. This property is valid
only for the Gaudi ASIC family
What: /sys/class/habanalabs/hl<n>/clk_cur_freq_mhz
Date: Jun 2019
KernelVersion: not yet upstreamed
Contact: oded.gabbay@gmail.com
Description: Displays the current frequency, in MHz, of the device clock.
This property is valid only for the Gaudi ASIC family
What: /sys/class/habanalabs/hl<n>/cpld_ver What: /sys/class/habanalabs/hl<n>/cpld_ver
Date: Jan 2019 Date: Jan 2019
KernelVersion: 5.1 KernelVersion: 5.1
......
...@@ -13,3 +13,6 @@ habanalabs-$(CONFIG_DEBUG_FS) += debugfs.o ...@@ -13,3 +13,6 @@ habanalabs-$(CONFIG_DEBUG_FS) += debugfs.o
include $(src)/goya/Makefile include $(src)/goya/Makefile
habanalabs-y += $(HL_GOYA_FILES) habanalabs-y += $(HL_GOYA_FILES)
include $(src)/gaudi/Makefile
habanalabs-y += $(HL_GAUDI_FILES)
...@@ -105,10 +105,9 @@ int hl_cb_create(struct hl_device *hdev, struct hl_cb_mgr *mgr, ...@@ -105,10 +105,9 @@ int hl_cb_create(struct hl_device *hdev, struct hl_cb_mgr *mgr,
goto out_err; goto out_err;
} }
if (cb_size > HL_MAX_CB_SIZE) { if (cb_size > SZ_2M) {
dev_err(hdev->dev, dev_err(hdev->dev, "CB size %d must be less than %d\n",
"CB size %d must be less then %d\n", cb_size, SZ_2M);
cb_size, HL_MAX_CB_SIZE);
rc = -EINVAL; rc = -EINVAL;
goto out_err; goto out_err;
} }
...@@ -211,7 +210,7 @@ int hl_cb_ioctl(struct hl_fpriv *hpriv, void *data) ...@@ -211,7 +210,7 @@ int hl_cb_ioctl(struct hl_fpriv *hpriv, void *data)
{ {
union hl_cb_args *args = data; union hl_cb_args *args = data;
struct hl_device *hdev = hpriv->hdev; struct hl_device *hdev = hpriv->hdev;
u64 handle; u64 handle = 0;
int rc; int rc;
if (hl_device_disabled_or_in_reset(hdev)) { if (hl_device_disabled_or_in_reset(hdev)) {
...@@ -223,15 +222,26 @@ int hl_cb_ioctl(struct hl_fpriv *hpriv, void *data) ...@@ -223,15 +222,26 @@ int hl_cb_ioctl(struct hl_fpriv *hpriv, void *data)
switch (args->in.op) { switch (args->in.op) {
case HL_CB_OP_CREATE: case HL_CB_OP_CREATE:
rc = hl_cb_create(hdev, &hpriv->cb_mgr, args->in.cb_size, if (args->in.cb_size > HL_MAX_CB_SIZE) {
&handle, hpriv->ctx->asid); dev_err(hdev->dev,
"User requested CB size %d must be less than %d\n",
args->in.cb_size, HL_MAX_CB_SIZE);
rc = -EINVAL;
} else {
rc = hl_cb_create(hdev, &hpriv->cb_mgr,
args->in.cb_size, &handle,
hpriv->ctx->asid);
}
memset(args, 0, sizeof(*args)); memset(args, 0, sizeof(*args));
args->out.cb_handle = handle; args->out.cb_handle = handle;
break; break;
case HL_CB_OP_DESTROY: case HL_CB_OP_DESTROY:
rc = hl_cb_destroy(hdev, &hpriv->cb_mgr, rc = hl_cb_destroy(hdev, &hpriv->cb_mgr,
args->in.cb_handle); args->in.cb_handle);
break; break;
default: default:
rc = -ENOTTY; rc = -ENOTTY;
break; break;
...@@ -278,7 +288,7 @@ int hl_cb_mmap(struct hl_fpriv *hpriv, struct vm_area_struct *vma) ...@@ -278,7 +288,7 @@ int hl_cb_mmap(struct hl_fpriv *hpriv, struct vm_area_struct *vma)
cb = hl_cb_get(hdev, &hpriv->cb_mgr, handle); cb = hl_cb_get(hdev, &hpriv->cb_mgr, handle);
if (!cb) { if (!cb) {
dev_err(hdev->dev, dev_err(hdev->dev,
"CB mmap failed, no match to handle %d\n", handle); "CB mmap failed, no match to handle 0x%x\n", handle);
return -EINVAL; return -EINVAL;
} }
...@@ -347,7 +357,7 @@ struct hl_cb *hl_cb_get(struct hl_device *hdev, struct hl_cb_mgr *mgr, ...@@ -347,7 +357,7 @@ struct hl_cb *hl_cb_get(struct hl_device *hdev, struct hl_cb_mgr *mgr,
if (!cb) { if (!cb) {
spin_unlock(&mgr->cb_lock); spin_unlock(&mgr->cb_lock);
dev_warn(hdev->dev, dev_warn(hdev->dev,
"CB get failed, no match to handle %d\n", handle); "CB get failed, no match to handle 0x%x\n", handle);
return NULL; return NULL;
} }
......
...@@ -170,24 +170,16 @@ int hl_ctx_put(struct hl_ctx *ctx) ...@@ -170,24 +170,16 @@ int hl_ctx_put(struct hl_ctx *ctx)
struct dma_fence *hl_ctx_get_fence(struct hl_ctx *ctx, u64 seq) struct dma_fence *hl_ctx_get_fence(struct hl_ctx *ctx, u64 seq)
{ {
struct hl_device *hdev = ctx->hdev;
struct dma_fence *fence; struct dma_fence *fence;
spin_lock(&ctx->cs_lock); spin_lock(&ctx->cs_lock);
if (seq >= ctx->cs_sequence) { if (seq >= ctx->cs_sequence) {
dev_notice_ratelimited(hdev->dev,
"Can't wait on seq %llu because current CS is at seq %llu\n",
seq, ctx->cs_sequence);
spin_unlock(&ctx->cs_lock); spin_unlock(&ctx->cs_lock);
return ERR_PTR(-EINVAL); return ERR_PTR(-EINVAL);
} }
if (seq + HL_MAX_PENDING_CS < ctx->cs_sequence) { if (seq + HL_MAX_PENDING_CS < ctx->cs_sequence) {
dev_dbg(hdev->dev,
"Can't wait on seq %llu because current CS is at seq %llu (Fence is gone)\n",
seq, ctx->cs_sequence);
spin_unlock(&ctx->cs_lock); spin_unlock(&ctx->cs_lock);
return NULL; return NULL;
} }
......
...@@ -970,6 +970,98 @@ static ssize_t hl_device_write(struct file *f, const char __user *buf, ...@@ -970,6 +970,98 @@ static ssize_t hl_device_write(struct file *f, const char __user *buf,
return count; return count;
} }
static ssize_t hl_clk_gate_read(struct file *f, char __user *buf,
size_t count, loff_t *ppos)
{
struct hl_dbg_device_entry *entry = file_inode(f)->i_private;
struct hl_device *hdev = entry->hdev;
char tmp_buf[200];
ssize_t rc;
if (*ppos)
return 0;
sprintf(tmp_buf, "%d\n", hdev->clock_gating);
rc = simple_read_from_buffer(buf, strlen(tmp_buf) + 1, ppos, tmp_buf,
strlen(tmp_buf) + 1);
return rc;
}
static ssize_t hl_clk_gate_write(struct file *f, const char __user *buf,
size_t count, loff_t *ppos)
{
struct hl_dbg_device_entry *entry = file_inode(f)->i_private;
struct hl_device *hdev = entry->hdev;
u32 value;
ssize_t rc;
if (atomic_read(&hdev->in_reset)) {
dev_warn_ratelimited(hdev->dev,
"Can't change clock gating during reset\n");
return 0;
}
rc = kstrtouint_from_user(buf, count, 10, &value);
if (rc)
return rc;
if (value) {
hdev->clock_gating = 1;
if (hdev->asic_funcs->enable_clock_gating)
hdev->asic_funcs->enable_clock_gating(hdev);
} else {
if (hdev->asic_funcs->disable_clock_gating)
hdev->asic_funcs->disable_clock_gating(hdev);
hdev->clock_gating = 0;
}
return count;
}
static ssize_t hl_stop_on_err_read(struct file *f, char __user *buf,
size_t count, loff_t *ppos)
{
struct hl_dbg_device_entry *entry = file_inode(f)->i_private;
struct hl_device *hdev = entry->hdev;
char tmp_buf[200];
ssize_t rc;
if (*ppos)
return 0;
sprintf(tmp_buf, "%d\n", hdev->stop_on_err);
rc = simple_read_from_buffer(buf, strlen(tmp_buf) + 1, ppos, tmp_buf,
strlen(tmp_buf) + 1);
return rc;
}
static ssize_t hl_stop_on_err_write(struct file *f, const char __user *buf,
size_t count, loff_t *ppos)
{
struct hl_dbg_device_entry *entry = file_inode(f)->i_private;
struct hl_device *hdev = entry->hdev;
u32 value;
ssize_t rc;
if (atomic_read(&hdev->in_reset)) {
dev_warn_ratelimited(hdev->dev,
"Can't change stop on error during reset\n");
return 0;
}
rc = kstrtouint_from_user(buf, count, 10, &value);
if (rc)
return rc;
hdev->stop_on_err = value ? 1 : 0;
hl_device_reset(hdev, false, false);
return count;
}
static const struct file_operations hl_data32b_fops = { static const struct file_operations hl_data32b_fops = {
.owner = THIS_MODULE, .owner = THIS_MODULE,
.read = hl_data_read32, .read = hl_data_read32,
...@@ -1015,6 +1107,18 @@ static const struct file_operations hl_device_fops = { ...@@ -1015,6 +1107,18 @@ static const struct file_operations hl_device_fops = {
.write = hl_device_write .write = hl_device_write
}; };
static const struct file_operations hl_clk_gate_fops = {
.owner = THIS_MODULE,
.read = hl_clk_gate_read,
.write = hl_clk_gate_write
};
static const struct file_operations hl_stop_on_err_fops = {
.owner = THIS_MODULE,
.read = hl_stop_on_err_read,
.write = hl_stop_on_err_write
};
static const struct hl_info_list hl_debugfs_list[] = { static const struct hl_info_list hl_debugfs_list[] = {
{"command_buffers", command_buffers_show, NULL}, {"command_buffers", command_buffers_show, NULL},
{"command_submission", command_submission_show, NULL}, {"command_submission", command_submission_show, NULL},
...@@ -1152,6 +1256,18 @@ void hl_debugfs_add_device(struct hl_device *hdev) ...@@ -1152,6 +1256,18 @@ void hl_debugfs_add_device(struct hl_device *hdev)
dev_entry, dev_entry,
&hl_device_fops); &hl_device_fops);
debugfs_create_file("clk_gate",
0200,
dev_entry->root,
dev_entry,
&hl_clk_gate_fops);
debugfs_create_file("stop_on_err",
0644,
dev_entry->root,
dev_entry,
&hl_stop_on_err_fops);
for (i = 0, entry = dev_entry->entry_arr ; i < count ; i++, entry++) { for (i = 0, entry = dev_entry->entry_arr ; i < count ; i++, entry++) {
ent = debugfs_create_file(hl_debugfs_list[i].name, ent = debugfs_create_file(hl_debugfs_list[i].name,
......
...@@ -256,6 +256,10 @@ static int device_early_init(struct hl_device *hdev) ...@@ -256,6 +256,10 @@ static int device_early_init(struct hl_device *hdev)
goya_set_asic_funcs(hdev); goya_set_asic_funcs(hdev);
strlcpy(hdev->asic_name, "GOYA", sizeof(hdev->asic_name)); strlcpy(hdev->asic_name, "GOYA", sizeof(hdev->asic_name));
break; break;
case ASIC_GAUDI:
gaudi_set_asic_funcs(hdev);
sprintf(hdev->asic_name, "GAUDI");
break;
default: default:
dev_err(hdev->dev, "Unrecognized ASIC type %d\n", dev_err(hdev->dev, "Unrecognized ASIC type %d\n",
hdev->asic_type); hdev->asic_type);
...@@ -603,6 +607,9 @@ int hl_device_set_debug_mode(struct hl_device *hdev, bool enable) ...@@ -603,6 +607,9 @@ int hl_device_set_debug_mode(struct hl_device *hdev, bool enable)
hdev->in_debug = 0; hdev->in_debug = 0;
if (!hdev->hard_reset_pending)
hdev->asic_funcs->enable_clock_gating(hdev);
goto out; goto out;
} }
...@@ -613,6 +620,7 @@ int hl_device_set_debug_mode(struct hl_device *hdev, bool enable) ...@@ -613,6 +620,7 @@ int hl_device_set_debug_mode(struct hl_device *hdev, bool enable)
goto out; goto out;
} }
hdev->asic_funcs->disable_clock_gating(hdev);
hdev->in_debug = 1; hdev->in_debug = 1;
out: out:
...@@ -1062,7 +1070,7 @@ int hl_device_reset(struct hl_device *hdev, bool hard_reset, ...@@ -1062,7 +1070,7 @@ int hl_device_reset(struct hl_device *hdev, bool hard_reset,
*/ */
int hl_device_init(struct hl_device *hdev, struct class *hclass) int hl_device_init(struct hl_device *hdev, struct class *hclass)
{ {
int i, rc, cq_ready_cnt; int i, rc, cq_cnt, cq_ready_cnt;
char *name; char *name;
bool add_cdev_sysfs_on_err = false; bool add_cdev_sysfs_on_err = false;
...@@ -1120,14 +1128,16 @@ int hl_device_init(struct hl_device *hdev, struct class *hclass) ...@@ -1120,14 +1128,16 @@ int hl_device_init(struct hl_device *hdev, struct class *hclass)
goto sw_fini; goto sw_fini;
} }
cq_cnt = hdev->asic_prop.completion_queues_count;
/* /*
* Initialize the completion queues. Must be done before hw_init, * Initialize the completion queues. Must be done before hw_init,
* because there the addresses of the completion queues are being * because there the addresses of the completion queues are being
* passed as arguments to request_irq * passed as arguments to request_irq
*/ */
hdev->completion_queue = hdev->completion_queue = kcalloc(cq_cnt,
kcalloc(hdev->asic_prop.completion_queues_count, sizeof(*hdev->completion_queue),
sizeof(*hdev->completion_queue), GFP_KERNEL); GFP_KERNEL);
if (!hdev->completion_queue) { if (!hdev->completion_queue) {
dev_err(hdev->dev, "failed to allocate completion queues\n"); dev_err(hdev->dev, "failed to allocate completion queues\n");
...@@ -1135,10 +1145,9 @@ int hl_device_init(struct hl_device *hdev, struct class *hclass) ...@@ -1135,10 +1145,9 @@ int hl_device_init(struct hl_device *hdev, struct class *hclass)
goto hw_queues_destroy; goto hw_queues_destroy;
} }
for (i = 0, cq_ready_cnt = 0; for (i = 0, cq_ready_cnt = 0 ; i < cq_cnt ; i++, cq_ready_cnt++) {
i < hdev->asic_prop.completion_queues_count; rc = hl_cq_init(hdev, &hdev->completion_queue[i],
i++, cq_ready_cnt++) { hdev->asic_funcs->get_queue_id_for_cq(hdev, i));
rc = hl_cq_init(hdev, &hdev->completion_queue[i], i);
if (rc) { if (rc) {
dev_err(hdev->dev, dev_err(hdev->dev,
"failed to initialize completion queue\n"); "failed to initialize completion queue\n");
...@@ -1325,11 +1334,12 @@ void hl_device_fini(struct hl_device *hdev) ...@@ -1325,11 +1334,12 @@ void hl_device_fini(struct hl_device *hdev)
* This function is competing with the reset function, so try to * This function is competing with the reset function, so try to
* take the reset atomic and if we are already in middle of reset, * take the reset atomic and if we are already in middle of reset,
* wait until reset function is finished. Reset function is designed * wait until reset function is finished. Reset function is designed
* to always finish (could take up to a few seconds in worst case). * to always finish. However, in Gaudi, because of all the network
* ports, the hard reset could take between 10-30 seconds
*/ */
timeout = ktime_add_us(ktime_get(), timeout = ktime_add_us(ktime_get(),
HL_PENDING_RESET_PER_SEC * 1000 * 1000 * 4); HL_HARD_RESET_MAX_TIMEOUT * 1000 * 1000);
rc = atomic_cmpxchg(&hdev->in_reset, 0, 1); rc = atomic_cmpxchg(&hdev->in_reset, 0, 1);
while (rc) { while (rc) {
usleep_range(50, 200); usleep_range(50, 200);
......
...@@ -6,20 +6,22 @@ ...@@ -6,20 +6,22 @@
*/ */
#include "habanalabs.h" #include "habanalabs.h"
#include "include/hl_boot_if.h"
#include <linux/firmware.h> #include <linux/firmware.h>
#include <linux/genalloc.h> #include <linux/genalloc.h>
#include <linux/io-64-nonatomic-lo-hi.h> #include <linux/io-64-nonatomic-lo-hi.h>
#include <linux/slab.h>
/** /**
* hl_fw_push_fw_to_device() - Push FW code to device. * hl_fw_load_fw_to_device() - Load F/W code to device's memory.
* @hdev: pointer to hl_device structure. * @hdev: pointer to hl_device structure.
* *
* Copy fw code from firmware file to device memory. * Copy fw code from firmware file to device memory.
* *
* Return: 0 on success, non-zero for failure. * Return: 0 on success, non-zero for failure.
*/ */
int hl_fw_push_fw_to_device(struct hl_device *hdev, const char *fw_name, int hl_fw_load_fw_to_device(struct hl_device *hdev, const char *fw_name,
void __iomem *dst) void __iomem *dst)
{ {
const struct firmware *fw; const struct firmware *fw;
...@@ -129,6 +131,68 @@ int hl_fw_send_cpu_message(struct hl_device *hdev, u32 hw_queue_id, u32 *msg, ...@@ -129,6 +131,68 @@ int hl_fw_send_cpu_message(struct hl_device *hdev, u32 hw_queue_id, u32 *msg,
return rc; return rc;
} }
int hl_fw_unmask_irq(struct hl_device *hdev, u16 event_type)
{
struct armcp_packet pkt;
long result;
int rc;
memset(&pkt, 0, sizeof(pkt));
pkt.ctl = cpu_to_le32(ARMCP_PACKET_UNMASK_RAZWI_IRQ <<
ARMCP_PKT_CTL_OPCODE_SHIFT);
pkt.value = cpu_to_le64(event_type);
rc = hdev->asic_funcs->send_cpu_message(hdev, (u32 *) &pkt, sizeof(pkt),
HL_DEVICE_TIMEOUT_USEC, &result);
if (rc)
dev_err(hdev->dev, "failed to unmask RAZWI IRQ %d", event_type);
return rc;
}
int hl_fw_unmask_irq_arr(struct hl_device *hdev, const u32 *irq_arr,
size_t irq_arr_size)
{
struct armcp_unmask_irq_arr_packet *pkt;
size_t total_pkt_size;
long result;
int rc;
total_pkt_size = sizeof(struct armcp_unmask_irq_arr_packet) +
irq_arr_size;
/* data should be aligned to 8 bytes in order to ArmCP to copy it */
total_pkt_size = (total_pkt_size + 0x7) & ~0x7;
/* total_pkt_size is casted to u16 later on */
if (total_pkt_size > USHRT_MAX) {
dev_err(hdev->dev, "too many elements in IRQ array\n");
return -EINVAL;
}
pkt = kzalloc(total_pkt_size, GFP_KERNEL);
if (!pkt)
return -ENOMEM;
pkt->length = cpu_to_le32(irq_arr_size / sizeof(irq_arr[0]));
memcpy(&pkt->irqs, irq_arr, irq_arr_size);
pkt->armcp_pkt.ctl = cpu_to_le32(ARMCP_PACKET_UNMASK_RAZWI_IRQ_ARRAY <<
ARMCP_PKT_CTL_OPCODE_SHIFT);
rc = hdev->asic_funcs->send_cpu_message(hdev, (u32 *) pkt,
total_pkt_size, HL_DEVICE_TIMEOUT_USEC, &result);
if (rc)
dev_err(hdev->dev, "failed to unmask IRQ array\n");
kfree(pkt);
return rc;
}
int hl_fw_test_cpu_queue(struct hl_device *hdev) int hl_fw_test_cpu_queue(struct hl_device *hdev)
{ {
struct armcp_packet test_pkt = {}; struct armcp_packet test_pkt = {};
...@@ -286,3 +350,232 @@ int hl_fw_get_eeprom_data(struct hl_device *hdev, void *data, size_t max_size) ...@@ -286,3 +350,232 @@ int hl_fw_get_eeprom_data(struct hl_device *hdev, void *data, size_t max_size)
return rc; return rc;
} }
static void fw_read_errors(struct hl_device *hdev, u32 boot_err0_reg)
{
u32 err_val;
/* Some of the firmware status codes are deprecated in newer f/w
* versions. In those versions, the errors are reported
* in different registers. Therefore, we need to check those
* registers and print the exact errors. Moreover, there
* may be multiple errors, so we need to report on each error
* separately. Some of the error codes might indicate a state
* that is not an error per-se, but it is an error in production
* environment
*/
err_val = RREG32(boot_err0_reg);
if (!(err_val & CPU_BOOT_ERR0_ENABLED))
return;
if (err_val & CPU_BOOT_ERR0_DRAM_INIT_FAIL)
dev_err(hdev->dev,
"Device boot error - DRAM initialization failed\n");
if (err_val & CPU_BOOT_ERR0_FIT_CORRUPTED)
dev_err(hdev->dev, "Device boot error - FIT image corrupted\n");
if (err_val & CPU_BOOT_ERR0_TS_INIT_FAIL)
dev_err(hdev->dev,
"Device boot error - Thermal Sensor initialization failed\n");
if (err_val & CPU_BOOT_ERR0_DRAM_SKIPPED)
dev_warn(hdev->dev,
"Device boot warning - Skipped DRAM initialization\n");
if (err_val & CPU_BOOT_ERR0_BMC_WAIT_SKIPPED)
dev_warn(hdev->dev,
"Device boot error - Skipped waiting for BMC\n");
if (err_val & CPU_BOOT_ERR0_NIC_DATA_NOT_RDY)
dev_err(hdev->dev,
"Device boot error - Serdes data from BMC not available\n");
if (err_val & CPU_BOOT_ERR0_NIC_FW_FAIL)
dev_err(hdev->dev,
"Device boot error - NIC F/W initialization failed\n");
}
int hl_fw_init_cpu(struct hl_device *hdev, u32 cpu_boot_status_reg,
u32 msg_to_cpu_reg, u32 cpu_msg_status_reg,
u32 boot_err0_reg, bool skip_bmc,
u32 cpu_timeout, u32 boot_fit_timeout)
{
u32 status;
int rc;
dev_info(hdev->dev, "Going to wait for device boot (up to %lds)\n",
cpu_timeout / USEC_PER_SEC);
/* Wait for boot FIT request */
rc = hl_poll_timeout(
hdev,
cpu_boot_status_reg,
status,
status == CPU_BOOT_STATUS_WAITING_FOR_BOOT_FIT,
10000,
boot_fit_timeout);
if (rc) {
dev_dbg(hdev->dev,
"No boot fit request received, resuming boot\n");
} else {
rc = hdev->asic_funcs->load_boot_fit_to_device(hdev);
if (rc)
goto out;
/* Clear device CPU message status */
WREG32(cpu_msg_status_reg, CPU_MSG_CLR);
/* Signal device CPU that boot loader is ready */
WREG32(msg_to_cpu_reg, KMD_MSG_FIT_RDY);
/* Poll for CPU device ack */
rc = hl_poll_timeout(
hdev,
cpu_msg_status_reg,
status,
status == CPU_MSG_OK,
10000,
boot_fit_timeout);
if (rc) {
dev_err(hdev->dev,
"Timeout waiting for boot fit load ack\n");
goto out;
}
/* Clear message */
WREG32(msg_to_cpu_reg, KMD_MSG_NA);
}
/* Make sure CPU boot-loader is running */
rc = hl_poll_timeout(
hdev,
cpu_boot_status_reg,
status,
(status == CPU_BOOT_STATUS_DRAM_RDY) ||
(status == CPU_BOOT_STATUS_NIC_FW_RDY) ||
(status == CPU_BOOT_STATUS_READY_TO_BOOT) ||
(status == CPU_BOOT_STATUS_SRAM_AVAIL),
10000,
cpu_timeout);
/* Read U-Boot, preboot versions now in case we will later fail */
hdev->asic_funcs->read_device_fw_version(hdev, FW_COMP_UBOOT);
hdev->asic_funcs->read_device_fw_version(hdev, FW_COMP_PREBOOT);
/* Some of the status codes below are deprecated in newer f/w
* versions but we keep them here for backward compatibility
*/
if (rc) {
switch (status) {
case CPU_BOOT_STATUS_NA:
dev_err(hdev->dev,
"Device boot error - BTL did NOT run\n");
break;
case CPU_BOOT_STATUS_IN_WFE:
dev_err(hdev->dev,
"Device boot error - Stuck inside WFE loop\n");
break;
case CPU_BOOT_STATUS_IN_BTL:
dev_err(hdev->dev,
"Device boot error - Stuck in BTL\n");
break;
case CPU_BOOT_STATUS_IN_PREBOOT:
dev_err(hdev->dev,
"Device boot error - Stuck in Preboot\n");
break;
case CPU_BOOT_STATUS_IN_SPL:
dev_err(hdev->dev,
"Device boot error - Stuck in SPL\n");
break;
case CPU_BOOT_STATUS_IN_UBOOT:
dev_err(hdev->dev,
"Device boot error - Stuck in u-boot\n");
break;
case CPU_BOOT_STATUS_DRAM_INIT_FAIL:
dev_err(hdev->dev,
"Device boot error - DRAM initialization failed\n");
break;
case CPU_BOOT_STATUS_UBOOT_NOT_READY:
dev_err(hdev->dev,
"Device boot error - u-boot stopped by user\n");
break;
case CPU_BOOT_STATUS_TS_INIT_FAIL:
dev_err(hdev->dev,
"Device boot error - Thermal Sensor initialization failed\n");
break;
default:
dev_err(hdev->dev,
"Device boot error - Invalid status code %d\n",
status);
break;
}
rc = -EIO;
goto out;
}
if (!hdev->fw_loading) {
dev_info(hdev->dev, "Skip loading FW\n");
goto out;
}
if (status == CPU_BOOT_STATUS_SRAM_AVAIL)
goto out;
dev_info(hdev->dev,
"Loading firmware to device, may take some time...\n");
rc = hdev->asic_funcs->load_firmware_to_device(hdev);
if (rc)
goto out;
if (skip_bmc) {
WREG32(msg_to_cpu_reg, KMD_MSG_SKIP_BMC);
rc = hl_poll_timeout(
hdev,
cpu_boot_status_reg,
status,
(status == CPU_BOOT_STATUS_BMC_WAITING_SKIPPED),
10000,
cpu_timeout);
if (rc) {
dev_err(hdev->dev,
"Failed to get ACK on skipping BMC, %d\n",
status);
WREG32(msg_to_cpu_reg, KMD_MSG_NA);
rc = -EIO;
goto out;
}
}
WREG32(msg_to_cpu_reg, KMD_MSG_FIT_RDY);
rc = hl_poll_timeout(
hdev,
cpu_boot_status_reg,
status,
(status == CPU_BOOT_STATUS_SRAM_AVAIL),
10000,
cpu_timeout);
/* Clear message */
WREG32(msg_to_cpu_reg, KMD_MSG_NA);
if (rc) {
if (status == CPU_BOOT_STATUS_FIT_CORRUPTED)
dev_err(hdev->dev,
"Device reports FIT image is corrupted\n");
else
dev_err(hdev->dev,
"Device failed to load, %d\n", status);
rc = -EIO;
goto out;
}
dev_info(hdev->dev, "Successfully loaded firmware to device\n");
out:
fw_read_errors(hdev, boot_err0_reg);
return rc;
}
# SPDX-License-Identifier: GPL-2.0-only
subdir-ccflags-y += -I$(src)
HL_GAUDI_FILES := gaudi/gaudi.o gaudi/gaudi_hwmgr.o gaudi/gaudi_security.o \
gaudi/gaudi_coresight.o
This diff is collapsed.
/* SPDX-License-Identifier: GPL-2.0
*
* Copyright 2019-2020 HabanaLabs, Ltd.
* All Rights Reserved.
*
*/
#ifndef GAUDIP_H_
#define GAUDIP_H_
#include <uapi/misc/habanalabs.h>
#include "habanalabs.h"
#include "include/hl_boot_if.h"
#include "include/gaudi/gaudi_packets.h"
#include "include/gaudi/gaudi.h"
#include "include/gaudi/gaudi_async_events.h"
#define NUMBER_OF_EXT_HW_QUEUES 12
#define NUMBER_OF_CMPLT_QUEUES NUMBER_OF_EXT_HW_QUEUES
#define NUMBER_OF_CPU_HW_QUEUES 1
#define NUMBER_OF_INT_HW_QUEUES 100
#define NUMBER_OF_HW_QUEUES (NUMBER_OF_EXT_HW_QUEUES + \
NUMBER_OF_CPU_HW_QUEUES + \
NUMBER_OF_INT_HW_QUEUES)
/*
* Number of MSI interrupts IDS:
* Each completion queue has 1 ID
* The event queue has 1 ID
*/
#define NUMBER_OF_INTERRUPTS (NUMBER_OF_CMPLT_QUEUES + \
NUMBER_OF_CPU_HW_QUEUES)
#if (NUMBER_OF_INTERRUPTS > GAUDI_MSI_ENTRIES)
#error "Number of MSI interrupts must be smaller or equal to GAUDI_MSI_ENTRIES"
#endif
#define QMAN_FENCE_TIMEOUT_USEC 10000 /* 10 ms */
#define CORESIGHT_TIMEOUT_USEC 100000 /* 100 ms */
#define GAUDI_MAX_CLK_FREQ 2200000000ull /* 2200 MHz */
#define MAX_POWER_DEFAULT 200000 /* 200W */
#define GAUDI_CPU_TIMEOUT_USEC 15000000 /* 15s */
#define TPC_ENABLED_MASK 0xFF
#define GAUDI_HBM_SIZE_32GB 0x800000000ull
#define GAUDI_HBM_DEVICES 4
#define GAUDI_HBM_CHANNELS 8
#define GAUDI_HBM_CFG_BASE (mmHBM0_BASE - CFG_BASE)
#define GAUDI_HBM_CFG_OFFSET (mmHBM1_BASE - mmHBM0_BASE)
#define DMA_MAX_TRANSFER_SIZE U32_MAX
#define GAUDI_DEFAULT_CARD_NAME "HL2000"
#define PCI_DMA_NUMBER_OF_CHNLS 3
#define HBM_DMA_NUMBER_OF_CHNLS 5
#define DMA_NUMBER_OF_CHNLS (PCI_DMA_NUMBER_OF_CHNLS + \
HBM_DMA_NUMBER_OF_CHNLS)
#define MME_NUMBER_OF_SLAVE_ENGINES 2
#define MME_NUMBER_OF_ENGINES (MME_NUMBER_OF_MASTER_ENGINES + \
MME_NUMBER_OF_SLAVE_ENGINES)
#define MME_NUMBER_OF_QMANS (MME_NUMBER_OF_MASTER_ENGINES * \
QMAN_STREAMS)
#define QMAN_STREAMS 4
#define DMA_QMAN_OFFSET (mmDMA1_QM_BASE - mmDMA0_QM_BASE)
#define TPC_QMAN_OFFSET (mmTPC1_QM_BASE - mmTPC0_QM_BASE)
#define MME_QMAN_OFFSET (mmMME1_QM_BASE - mmMME0_QM_BASE)
#define NIC_MACRO_QMAN_OFFSET (mmNIC1_QM0_BASE - mmNIC0_QM0_BASE)
#define TPC_CFG_OFFSET (mmTPC1_CFG_BASE - mmTPC0_CFG_BASE)
#define DMA_CORE_OFFSET (mmDMA1_CORE_BASE - mmDMA0_CORE_BASE)
#define SIF_RTR_CTRL_OFFSET (mmSIF_RTR_CTRL_1_BASE - mmSIF_RTR_CTRL_0_BASE)
#define NIF_RTR_CTRL_OFFSET (mmNIF_RTR_CTRL_1_BASE - mmNIF_RTR_CTRL_0_BASE)
#define MME_ACC_OFFSET (mmMME1_ACC_BASE - mmMME0_ACC_BASE)
#define SRAM_BANK_OFFSET (mmSRAM_Y0_X1_RTR_BASE - mmSRAM_Y0_X0_RTR_BASE)
#define NUM_OF_SOB_IN_BLOCK \
(((mmSYNC_MNGR_E_N_SYNC_MNGR_OBJS_SOB_OBJ_2047 - \
mmSYNC_MNGR_E_N_SYNC_MNGR_OBJS_SOB_OBJ_0) + 4) >> 2)
#define NUM_OF_MONITORS_IN_BLOCK \
(((mmSYNC_MNGR_E_N_SYNC_MNGR_OBJS_MON_STATUS_511 - \
mmSYNC_MNGR_E_N_SYNC_MNGR_OBJS_MON_STATUS_0) + 4) >> 2)
/* DRAM Memory Map */
#define CPU_FW_IMAGE_SIZE 0x10000000 /* 256MB */
#define MMU_PAGE_TABLES_SIZE 0x0BF00000 /* 191MB */
#define MMU_CACHE_MNG_SIZE 0x00100000 /* 1MB */
#define RESERVED 0x04000000 /* 64MB */
#define CPU_FW_IMAGE_ADDR DRAM_PHYS_BASE
#define MMU_PAGE_TABLES_ADDR (CPU_FW_IMAGE_ADDR + CPU_FW_IMAGE_SIZE)
#define MMU_CACHE_MNG_ADDR (MMU_PAGE_TABLES_ADDR + MMU_PAGE_TABLES_SIZE)
#define DRAM_DRIVER_END_ADDR (MMU_CACHE_MNG_ADDR + MMU_CACHE_MNG_SIZE +\
RESERVED)
#define DRAM_BASE_ADDR_USER 0x20000000
#if (DRAM_DRIVER_END_ADDR > DRAM_BASE_ADDR_USER)
#error "Driver must reserve no more than 512MB"
#endif
/* Internal QMANs PQ sizes */
#define MME_QMAN_LENGTH 64
#define MME_QMAN_SIZE_IN_BYTES (MME_QMAN_LENGTH * QMAN_PQ_ENTRY_SIZE)
#define HBM_DMA_QMAN_LENGTH 64
#define HBM_DMA_QMAN_SIZE_IN_BYTES \
(HBM_DMA_QMAN_LENGTH * QMAN_PQ_ENTRY_SIZE)
#define TPC_QMAN_LENGTH 64
#define TPC_QMAN_SIZE_IN_BYTES (TPC_QMAN_LENGTH * QMAN_PQ_ENTRY_SIZE)
#define SRAM_USER_BASE_OFFSET GAUDI_DRIVER_SRAM_RESERVED_SIZE_FROM_START
/* Virtual address space */
#define VA_HOST_SPACE_START 0x1000000000000ull /* 256TB */
#define VA_HOST_SPACE_END 0x3FF8000000000ull /* 1PB - 1TB */
#define VA_HOST_SPACE_SIZE (VA_HOST_SPACE_END - \
VA_HOST_SPACE_START) /* 767TB */
#define HW_CAP_PLL 0x00000001
#define HW_CAP_HBM 0x00000002
#define HW_CAP_MMU 0x00000004
#define HW_CAP_MME 0x00000008
#define HW_CAP_CPU 0x00000010
#define HW_CAP_PCI_DMA 0x00000020
#define HW_CAP_MSI 0x00000040
#define HW_CAP_CPU_Q 0x00000080
#define HW_CAP_HBM_DMA 0x00000100
#define HW_CAP_CLK_GATE 0x00000200
#define HW_CAP_SRAM_SCRAMBLER 0x00000400
#define HW_CAP_HBM_SCRAMBLER 0x00000800
#define HW_CAP_TPC0 0x01000000
#define HW_CAP_TPC1 0x02000000
#define HW_CAP_TPC2 0x04000000
#define HW_CAP_TPC3 0x08000000
#define HW_CAP_TPC4 0x10000000
#define HW_CAP_TPC5 0x20000000
#define HW_CAP_TPC6 0x40000000
#define HW_CAP_TPC7 0x80000000
#define HW_CAP_TPC_MASK 0xFF000000
#define HW_CAP_TPC_SHIFT 24
#define GAUDI_CPU_PCI_MSB_ADDR(addr) (((addr) & GENMASK_ULL(49, 39)) >> 39)
#define GAUDI_PCI_TO_CPU_ADDR(addr) \
do { \
(addr) &= ~GENMASK_ULL(49, 39); \
(addr) |= BIT_ULL(39); \
} while (0)
#define GAUDI_CPU_TO_PCI_ADDR(addr, extension) \
do { \
(addr) &= ~GENMASK_ULL(49, 39); \
(addr) |= (u64) (extension) << 39; \
} while (0)
enum gaudi_dma_channels {
GAUDI_PCI_DMA_1,
GAUDI_PCI_DMA_2,
GAUDI_PCI_DMA_3,
GAUDI_HBM_DMA_1,
GAUDI_HBM_DMA_2,
GAUDI_HBM_DMA_3,
GAUDI_HBM_DMA_4,
GAUDI_HBM_DMA_5,
GAUDI_DMA_MAX
};
enum gaudi_tpc_mask {
GAUDI_TPC_MASK_TPC0 = 0x01,
GAUDI_TPC_MASK_TPC1 = 0x02,
GAUDI_TPC_MASK_TPC2 = 0x04,
GAUDI_TPC_MASK_TPC3 = 0x08,
GAUDI_TPC_MASK_TPC4 = 0x10,
GAUDI_TPC_MASK_TPC5 = 0x20,
GAUDI_TPC_MASK_TPC6 = 0x40,
GAUDI_TPC_MASK_TPC7 = 0x80,
GAUDI_TPC_MASK_ALL = 0xFF
};
/**
* struct gaudi_internal_qman_info - Internal QMAN information.
* @pq_kernel_addr: Kernel address of the PQ memory area in the host.
* @pq_dma_addr: DMA address of the PQ memory area in the host.
* @pq_size: Size of allocated host memory for PQ.
*/
struct gaudi_internal_qman_info {
void *pq_kernel_addr;
dma_addr_t pq_dma_addr;
size_t pq_size;
};
/**
* struct gaudi_device - ASIC specific manage structure.
* @armcp_info_get: get information on device from ArmCP
* @hw_queues_lock: protects the H/W queues from concurrent access.
* @clk_gate_mutex: protects code areas that require clock gating to be disabled
* temporarily
* @internal_qmans: Internal QMANs information. The array size is larger than
* the actual number of internal queues because they are not in
* consecutive order.
* @hbm_bar_cur_addr: current address of HBM PCI bar.
* @max_freq_value: current max clk frequency.
* @events: array that holds all event id's
* @events_stat: array that holds histogram of all received events.
* @events_stat_aggregate: same as events_stat but doesn't get cleared on reset
* @hw_cap_initialized: This field contains a bit per H/W engine. When that
* engine is initialized, that bit is set by the driver to
* signal we can use this engine in later code paths.
* Each bit is cleared upon reset of its corresponding H/W
* engine.
* @multi_msi_mode: whether we are working in multi MSI single MSI mode.
* Multi MSI is possible only with IOMMU enabled.
* @ext_queue_idx: helper index for external queues initialization.
*/
struct gaudi_device {
int (*armcp_info_get)(struct hl_device *hdev);
/* TODO: remove hw_queues_lock after moving to scheduler code */
spinlock_t hw_queues_lock;
struct mutex clk_gate_mutex;
struct gaudi_internal_qman_info internal_qmans[GAUDI_QUEUE_ID_SIZE];
u64 hbm_bar_cur_addr;
u64 max_freq_value;
u32 events[GAUDI_EVENT_SIZE];
u32 events_stat[GAUDI_EVENT_SIZE];
u32 events_stat_aggregate[GAUDI_EVENT_SIZE];
u32 hw_cap_initialized;
u8 multi_msi_mode;
u8 ext_queue_idx;
};
void gaudi_init_security(struct hl_device *hdev);
void gaudi_add_device_attr(struct hl_device *hdev,
struct attribute_group *dev_attr_grp);
void gaudi_set_pll_profile(struct hl_device *hdev, enum hl_pll_frequency freq);
int gaudi_debug_coresight(struct hl_device *hdev, void *data);
void gaudi_halt_coresight(struct hl_device *hdev);
int gaudi_get_clk_rate(struct hl_device *hdev, u32 *cur_clk, u32 *max_clk);
#endif /* GAUDIP_H_ */
This diff is collapsed.
// SPDX-License-Identifier: GPL-2.0
/*
* Copyright 2016-2018 HabanaLabs, Ltd.
* All Rights Reserved.
*/
#include "gaudiP.h"
#include "include/gaudi/gaudi_fw_if.h"
void gaudi_set_pll_profile(struct hl_device *hdev, enum hl_pll_frequency freq)
{
struct gaudi_device *gaudi = hdev->asic_specific;
if (freq == PLL_LAST)
hl_set_frequency(hdev, MME_PLL, gaudi->max_freq_value);
}
int gaudi_get_clk_rate(struct hl_device *hdev, u32 *cur_clk, u32 *max_clk)
{
long value;
if (hl_device_disabled_or_in_reset(hdev))
return -ENODEV;
value = hl_get_frequency(hdev, MME_PLL, false);
if (value < 0) {
dev_err(hdev->dev, "Failed to retrieve device max clock %ld\n",
value);
return value;
}
*max_clk = (value / 1000 / 1000);
value = hl_get_frequency(hdev, MME_PLL, true);
if (value < 0) {
dev_err(hdev->dev,
"Failed to retrieve device current clock %ld\n",
value);
return value;
}
*cur_clk = (value / 1000 / 1000);
return 0;
}
static ssize_t clk_max_freq_mhz_show(struct device *dev,
struct device_attribute *attr, char *buf)
{
struct hl_device *hdev = dev_get_drvdata(dev);
struct gaudi_device *gaudi = hdev->asic_specific;
long value;
if (hl_device_disabled_or_in_reset(hdev))
return -ENODEV;
value = hl_get_frequency(hdev, MME_PLL, false);
gaudi->max_freq_value = value;
return sprintf(buf, "%lu\n", (value / 1000 / 1000));
}
static ssize_t clk_max_freq_mhz_store(struct device *dev,
struct device_attribute *attr, const char *buf, size_t count)
{
struct hl_device *hdev = dev_get_drvdata(dev);
struct gaudi_device *gaudi = hdev->asic_specific;
int rc;
u64 value;
if (hl_device_disabled_or_in_reset(hdev)) {
count = -ENODEV;
goto fail;
}
rc = kstrtoull(buf, 0, &value);
if (rc) {
count = -EINVAL;
goto fail;
}
gaudi->max_freq_value = value * 1000 * 1000;
hl_set_frequency(hdev, MME_PLL, gaudi->max_freq_value);
fail:
return count;
}
static ssize_t clk_cur_freq_mhz_show(struct device *dev,
struct device_attribute *attr, char *buf)
{
struct hl_device *hdev = dev_get_drvdata(dev);
long value;
if (hl_device_disabled_or_in_reset(hdev))
return -ENODEV;
value = hl_get_frequency(hdev, MME_PLL, true);
return sprintf(buf, "%lu\n", (value / 1000 / 1000));
}
static DEVICE_ATTR_RW(clk_max_freq_mhz);
static DEVICE_ATTR_RO(clk_cur_freq_mhz);
static struct attribute *gaudi_dev_attrs[] = {
&dev_attr_clk_max_freq_mhz.attr,
&dev_attr_clk_cur_freq_mhz.attr,
NULL,
};
void gaudi_add_device_attr(struct hl_device *hdev,
struct attribute_group *dev_attr_grp)
{
dev_attr_grp->attrs = gaudi_dev_attrs;
}
This diff is collapsed.
This diff is collapsed.
...@@ -45,7 +45,7 @@ ...@@ -45,7 +45,7 @@
#define CORESIGHT_TIMEOUT_USEC 100000 /* 100 ms */ #define CORESIGHT_TIMEOUT_USEC 100000 /* 100 ms */
#define GOYA_CPU_TIMEOUT_USEC 10000000 /* 10s */ #define GOYA_CPU_TIMEOUT_USEC 15000000 /* 15s */
#define TPC_ENABLED_MASK 0xFF #define TPC_ENABLED_MASK 0xFF
...@@ -149,11 +149,6 @@ ...@@ -149,11 +149,6 @@
#define HW_CAP_GOLDEN 0x00000400 #define HW_CAP_GOLDEN 0x00000400
#define HW_CAP_TPC 0x00000800 #define HW_CAP_TPC 0x00000800
enum goya_fw_component {
FW_COMP_UBOOT,
FW_COMP_PREBOOT
};
struct goya_device { struct goya_device {
/* TODO: remove hw_queues_lock after moving to scheduler code */ /* TODO: remove hw_queues_lock after moving to scheduler code */
spinlock_t hw_queues_lock; spinlock_t hw_queues_lock;
...@@ -221,7 +216,8 @@ void goya_handle_eqe(struct hl_device *hdev, struct hl_eq_entry *eq_entry); ...@@ -221,7 +216,8 @@ void goya_handle_eqe(struct hl_device *hdev, struct hl_eq_entry *eq_entry);
void *goya_get_events_stat(struct hl_device *hdev, bool aggregate, u32 *size); void *goya_get_events_stat(struct hl_device *hdev, bool aggregate, u32 *size);
void goya_add_end_of_cb_packets(struct hl_device *hdev, u64 kernel_address, void goya_add_end_of_cb_packets(struct hl_device *hdev, u64 kernel_address,
u32 len, u64 cq_addr, u32 cq_val, u32 msix_vec); u32 len, u64 cq_addr, u32 cq_val, u32 msix_vec,
bool eb);
int goya_cs_parser(struct hl_device *hdev, struct hl_cs_parser *parser); int goya_cs_parser(struct hl_device *hdev, struct hl_cs_parser *parser);
void *goya_get_int_queue_base(struct hl_device *hdev, u32 queue_id, void *goya_get_int_queue_base(struct hl_device *hdev, u32 queue_id,
dma_addr_t *dma_handle, u16 *queue_len); dma_addr_t *dma_handle, u16 *queue_len);
...@@ -234,5 +230,7 @@ void goya_cpu_accessible_dma_pool_free(struct hl_device *hdev, size_t size, ...@@ -234,5 +230,7 @@ void goya_cpu_accessible_dma_pool_free(struct hl_device *hdev, size_t size,
void goya_mmu_remove_device_cpu_mappings(struct hl_device *hdev); void goya_mmu_remove_device_cpu_mappings(struct hl_device *hdev);
int goya_get_clk_rate(struct hl_device *hdev, u32 *cur_clk, u32 *max_clk); int goya_get_clk_rate(struct hl_device *hdev, u32 *cur_clk, u32 *max_clk);
u32 goya_get_queue_id_for_cq(struct hl_device *hdev, u32 cq_idx);
u64 goya_get_device_time(struct hl_device *hdev);
#endif /* GOYAP_H_ */ #endif /* GOYAP_H_ */
...@@ -266,7 +266,7 @@ static int goya_config_stm(struct hl_device *hdev, ...@@ -266,7 +266,7 @@ static int goya_config_stm(struct hl_device *hdev,
WREG32(base_reg + 0xDF4, 0x80); WREG32(base_reg + 0xDF4, 0x80);
WREG32(base_reg + 0xE8C, input->frequency); WREG32(base_reg + 0xE8C, input->frequency);
WREG32(base_reg + 0xE90, 0x7FF); WREG32(base_reg + 0xE90, 0x7FF);
WREG32(base_reg + 0xE80, 0x7 | (input->id << 16)); WREG32(base_reg + 0xE80, 0x27 | (input->id << 16));
} else { } else {
WREG32(base_reg + 0xE80, 4); WREG32(base_reg + 0xE80, 4);
WREG32(base_reg + 0xD64, 0); WREG32(base_reg + 0xD64, 0);
......
...@@ -683,7 +683,6 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -683,7 +683,6 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask = 1 << ((mmTPC0_CFG_SEMAPHORE & 0x7F) >> 2); mask = 1 << ((mmTPC0_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_VFLAGS & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_SFLAGS & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_LFSR_POLYNOM & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_STATUS & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -695,7 +694,6 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -695,7 +694,6 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC0_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_TPC_STALL & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_MSS_CONFIG & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC0_CFG_TPC_INTR_CAUSE & 0x7F) >> 2); mask |= 1 << ((mmTPC0_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
...@@ -875,6 +873,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -875,6 +873,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC1_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC1_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC1_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC1_WR_REGULATOR_BASE);
pb_addr = (mmTPC1_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC1_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC1_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC1_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC1_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC1_CFG_CFG_BASE_ADDRESS_HIGH & word_offset = ((mmTPC1_CFG_CFG_BASE_ADDRESS_HIGH &
PROT_BITS_OFFS) >> 7) << 2; PROT_BITS_OFFS) >> 7) << 2;
...@@ -882,6 +890,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -882,6 +890,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC1_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC1_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC1_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC1_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC1_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -1057,6 +1069,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1057,6 +1069,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC2_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC2_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC2_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC2_WR_REGULATOR_BASE);
pb_addr = (mmTPC2_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC2_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC2_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC2_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC2_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC2_CFG_CFG_BASE_ADDRESS_HIGH & word_offset = ((mmTPC2_CFG_CFG_BASE_ADDRESS_HIGH &
PROT_BITS_OFFS) >> 7) << 2; PROT_BITS_OFFS) >> 7) << 2;
...@@ -1064,6 +1086,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1064,6 +1086,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC2_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC2_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC2_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC2_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC2_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -1239,6 +1265,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1239,6 +1265,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC3_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC3_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC3_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC3_WR_REGULATOR_BASE);
pb_addr = (mmTPC3_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC3_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC3_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC3_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC3_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC3_CFG_CFG_BASE_ADDRESS_HIGH word_offset = ((mmTPC3_CFG_CFG_BASE_ADDRESS_HIGH
& PROT_BITS_OFFS) >> 7) << 2; & PROT_BITS_OFFS) >> 7) << 2;
...@@ -1246,6 +1282,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1246,6 +1282,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC3_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC3_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC3_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC3_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC3_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -1421,6 +1461,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1421,6 +1461,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC4_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC4_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC4_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC4_WR_REGULATOR_BASE);
pb_addr = (mmTPC4_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC4_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC4_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC4_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC4_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC4_CFG_CFG_BASE_ADDRESS_HIGH & word_offset = ((mmTPC4_CFG_CFG_BASE_ADDRESS_HIGH &
PROT_BITS_OFFS) >> 7) << 2; PROT_BITS_OFFS) >> 7) << 2;
...@@ -1428,6 +1478,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1428,6 +1478,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC4_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC4_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC4_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC4_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC4_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -1603,6 +1657,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1603,6 +1657,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC5_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC5_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC5_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC5_WR_REGULATOR_BASE);
pb_addr = (mmTPC5_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC5_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC5_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC5_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC5_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC5_CFG_CFG_BASE_ADDRESS_HIGH & word_offset = ((mmTPC5_CFG_CFG_BASE_ADDRESS_HIGH &
PROT_BITS_OFFS) >> 7) << 2; PROT_BITS_OFFS) >> 7) << 2;
...@@ -1610,6 +1674,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1610,6 +1674,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC5_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC5_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC5_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC5_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC5_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -1785,6 +1853,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1785,6 +1853,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC6_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC6_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC6_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC6_WR_REGULATOR_BASE);
pb_addr = (mmTPC6_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC6_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC6_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC6_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC6_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC6_CFG_CFG_BASE_ADDRESS_HIGH & word_offset = ((mmTPC6_CFG_CFG_BASE_ADDRESS_HIGH &
PROT_BITS_OFFS) >> 7) << 2; PROT_BITS_OFFS) >> 7) << 2;
...@@ -1792,6 +1870,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1792,6 +1870,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC6_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC6_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC6_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC6_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC6_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
...@@ -1967,6 +2049,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1967,6 +2049,16 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
goya_pb_set_block(hdev, mmTPC7_RD_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC7_RD_REGULATOR_BASE);
goya_pb_set_block(hdev, mmTPC7_WR_REGULATOR_BASE); goya_pb_set_block(hdev, mmTPC7_WR_REGULATOR_BASE);
pb_addr = (mmTPC7_CFG_SEMAPHORE & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC7_CFG_SEMAPHORE & PROT_BITS_OFFS) >> 7) << 2;
mask = 1 << ((mmTPC7_CFG_SEMAPHORE & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_VFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_SFLAGS & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_STATUS & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask);
pb_addr = (mmTPC7_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS; pb_addr = (mmTPC7_CFG_CFG_BASE_ADDRESS_HIGH & ~0xFFF) + PROT_BITS_OFFS;
word_offset = ((mmTPC7_CFG_CFG_BASE_ADDRESS_HIGH & word_offset = ((mmTPC7_CFG_CFG_BASE_ADDRESS_HIGH &
PROT_BITS_OFFS) >> 7) << 2; PROT_BITS_OFFS) >> 7) << 2;
...@@ -1974,6 +2066,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev) ...@@ -1974,6 +2066,10 @@ static void goya_init_tpc_protection_bits(struct hl_device *hdev)
mask |= 1 << ((mmTPC7_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2); mask |= 1 << ((mmTPC7_CFG_CFG_SUBTRACT_VALUE & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2); mask |= 1 << ((mmTPC7_CFG_SM_BASE_ADDRESS_LOW & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2); mask |= 1 << ((mmTPC7_CFG_SM_BASE_ADDRESS_HIGH & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_TPC_STALL & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_MSS_CONFIG & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_TPC_INTR_CAUSE & 0x7F) >> 2);
mask |= 1 << ((mmTPC7_CFG_TPC_INTR_MASK & 0x7F) >> 2);
WREG32(pb_addr + word_offset, ~mask); WREG32(pb_addr + word_offset, ~mask);
......
This diff is collapsed.
...@@ -47,6 +47,7 @@ static const struct pci_device_id ids[] = { ...@@ -47,6 +47,7 @@ static const struct pci_device_id ids[] = {
{ PCI_DEVICE(PCI_VENDOR_ID_HABANALABS, PCI_IDS_GAUDI), }, { PCI_DEVICE(PCI_VENDOR_ID_HABANALABS, PCI_IDS_GAUDI), },
{ 0, } { 0, }
}; };
MODULE_DEVICE_TABLE(pci, ids);
/* /*
* get_asic_type - translate device id to asic type * get_asic_type - translate device id to asic type
...@@ -171,6 +172,7 @@ int hl_device_open(struct inode *inode, struct file *filp) ...@@ -171,6 +172,7 @@ int hl_device_open(struct inode *inode, struct file *filp)
put_pid(hpriv->taskpid); put_pid(hpriv->taskpid);
kfree(hpriv); kfree(hpriv);
return rc; return rc;
} }
...@@ -230,8 +232,15 @@ static void set_driver_behavior_per_device(struct hl_device *hdev) ...@@ -230,8 +232,15 @@ static void set_driver_behavior_per_device(struct hl_device *hdev)
hdev->fw_loading = 1; hdev->fw_loading = 1;
hdev->cpu_queues_enable = 1; hdev->cpu_queues_enable = 1;
hdev->heartbeat = 1; hdev->heartbeat = 1;
hdev->clock_gating = 1;
hdev->reset_pcilink = 0; hdev->reset_pcilink = 0;
hdev->axi_drain = 0;
hdev->sram_scrambler_enable = 1;
hdev->dram_scrambler_enable = 1;
hdev->rl_enable = 1;
hdev->bmc_enable = 1;
hdev->hard_reset_on_fw_events = 1;
} }
/* /*
...@@ -267,11 +276,6 @@ int create_hdev(struct hl_device **dev, struct pci_dev *pdev, ...@@ -267,11 +276,6 @@ int create_hdev(struct hl_device **dev, struct pci_dev *pdev,
dev_err(&pdev->dev, "Unsupported ASIC\n"); dev_err(&pdev->dev, "Unsupported ASIC\n");
rc = -ENODEV; rc = -ENODEV;
goto free_hdev; goto free_hdev;
} else if (hdev->asic_type == ASIC_GAUDI) {
dev_err(&pdev->dev,
"GAUDI is not supported by the current kernel\n");
rc = -ENODEV;
goto free_hdev;
} }
} else { } else {
hdev->asic_type = asic_type; hdev->asic_type = asic_type;
......
...@@ -71,6 +71,8 @@ static int hw_ip_info(struct hl_device *hdev, struct hl_info_args *args) ...@@ -71,6 +71,8 @@ static int hw_ip_info(struct hl_device *hdev, struct hl_info_args *args)
min(CARD_NAME_MAX_LEN, HL_INFO_CARD_NAME_MAX_LEN)); min(CARD_NAME_MAX_LEN, HL_INFO_CARD_NAME_MAX_LEN));
hw_ip.armcp_cpld_version = le32_to_cpu(prop->armcp_info.cpld_version); hw_ip.armcp_cpld_version = le32_to_cpu(prop->armcp_info.cpld_version);
hw_ip.module_id = le32_to_cpu(prop->armcp_info.card_location);
hw_ip.psoc_pci_pll_nr = prop->psoc_pci_pll_nr; hw_ip.psoc_pci_pll_nr = prop->psoc_pci_pll_nr;
hw_ip.psoc_pci_pll_nf = prop->psoc_pci_pll_nf; hw_ip.psoc_pci_pll_nf = prop->psoc_pci_pll_nf;
hw_ip.psoc_pci_pll_od = prop->psoc_pci_pll_od; hw_ip.psoc_pci_pll_od = prop->psoc_pci_pll_od;
...@@ -258,6 +260,22 @@ static int get_reset_count(struct hl_device *hdev, struct hl_info_args *args) ...@@ -258,6 +260,22 @@ static int get_reset_count(struct hl_device *hdev, struct hl_info_args *args)
min((size_t) max_size, sizeof(reset_count))) ? -EFAULT : 0; min((size_t) max_size, sizeof(reset_count))) ? -EFAULT : 0;
} }
static int time_sync_info(struct hl_device *hdev, struct hl_info_args *args)
{
struct hl_info_time_sync time_sync = {0};
u32 max_size = args->return_size;
void __user *out = (void __user *) (uintptr_t) args->return_pointer;
if ((!max_size) || (!out))
return -EINVAL;
time_sync.device_time = hdev->asic_funcs->get_device_time(hdev);
time_sync.host_time = ktime_get_raw_ns();
return copy_to_user(out, &time_sync,
min((size_t) max_size, sizeof(time_sync))) ? -EFAULT : 0;
}
static int _hl_info_ioctl(struct hl_fpriv *hpriv, void *data, static int _hl_info_ioctl(struct hl_fpriv *hpriv, void *data,
struct device *dev) struct device *dev)
{ {
...@@ -315,6 +333,9 @@ static int _hl_info_ioctl(struct hl_fpriv *hpriv, void *data, ...@@ -315,6 +333,9 @@ static int _hl_info_ioctl(struct hl_fpriv *hpriv, void *data,
rc = get_clk_rate(hdev, args); rc = get_clk_rate(hdev, args);
break; break;
case HL_INFO_TIME_SYNC:
return time_sync_info(hdev, args);
default: default:
dev_err(dev, "Invalid request %d\n", args->op); dev_err(dev, "Invalid request %d\n", args->op);
rc = -ENOTTY; rc = -ENOTTY;
......
...@@ -111,7 +111,7 @@ static int ext_queue_sanity_checks(struct hl_device *hdev, ...@@ -111,7 +111,7 @@ static int ext_queue_sanity_checks(struct hl_device *hdev,
bool reserve_cq_entry) bool reserve_cq_entry)
{ {
atomic_t *free_slots = atomic_t *free_slots =
&hdev->completion_queue[q->hw_queue_id].free_slots_cnt; &hdev->completion_queue[q->cq_id].free_slots_cnt;
int free_slots_cnt; int free_slots_cnt;
/* Check we have enough space in the queue */ /* Check we have enough space in the queue */
...@@ -194,7 +194,7 @@ static int hw_queue_sanity_checks(struct hl_device *hdev, struct hl_hw_queue *q, ...@@ -194,7 +194,7 @@ static int hw_queue_sanity_checks(struct hl_device *hdev, struct hl_hw_queue *q,
int num_of_entries) int num_of_entries)
{ {
atomic_t *free_slots = atomic_t *free_slots =
&hdev->completion_queue[q->hw_queue_id].free_slots_cnt; &hdev->completion_queue[q->cq_id].free_slots_cnt;
/* /*
* Check we have enough space in the completion queue. * Check we have enough space in the completion queue.
...@@ -308,13 +308,14 @@ static void ext_queue_schedule_job(struct hl_cs_job *job) ...@@ -308,13 +308,14 @@ static void ext_queue_schedule_job(struct hl_cs_job *job)
* No need to check if CQ is full because it was already * No need to check if CQ is full because it was already
* checked in ext_queue_sanity_checks * checked in ext_queue_sanity_checks
*/ */
cq = &hdev->completion_queue[q->hw_queue_id]; cq = &hdev->completion_queue[q->cq_id];
cq_addr = cq->bus_address + cq->pi * sizeof(struct hl_cq_entry); cq_addr = cq->bus_address + cq->pi * sizeof(struct hl_cq_entry);
hdev->asic_funcs->add_end_of_cb_packets(hdev, cb->kernel_address, len, hdev->asic_funcs->add_end_of_cb_packets(hdev, cb->kernel_address, len,
cq_addr, cq_addr,
le32_to_cpu(cq_pkt.data), le32_to_cpu(cq_pkt.data),
q->hw_queue_id); q->msi_vec,
job->contains_dma_pkt);
q->shadow_queue[hl_pi_2_offset(q->pi)] = job; q->shadow_queue[hl_pi_2_offset(q->pi)] = job;
...@@ -401,21 +402,111 @@ static void hw_queue_schedule_job(struct hl_cs_job *job) ...@@ -401,21 +402,111 @@ static void hw_queue_schedule_job(struct hl_cs_job *job)
* No need to check if CQ is full because it was already * No need to check if CQ is full because it was already
* checked in hw_queue_sanity_checks * checked in hw_queue_sanity_checks
*/ */
cq = &hdev->completion_queue[q->hw_queue_id]; cq = &hdev->completion_queue[q->cq_id];
cq->pi = hl_cq_inc_ptr(cq->pi); cq->pi = hl_cq_inc_ptr(cq->pi);
ext_and_hw_queue_submit_bd(hdev, q, ctl, len, ptr); ext_and_hw_queue_submit_bd(hdev, q, ctl, len, ptr);
} }
/* /*
* hl_hw_queue_schedule_cs - schedule a command submission * init_signal_wait_cs - initialize a signal/wait CS
* * @cs: pointer to the signal/wait CS
* @job : pointer to the CS
* *
* H/W queues spinlock should be taken before calling this function
*/
static void init_signal_wait_cs(struct hl_cs *cs)
{
struct hl_ctx *ctx = cs->ctx;
struct hl_device *hdev = ctx->hdev;
struct hl_hw_queue *hw_queue;
struct hl_cs_compl *cs_cmpl =
container_of(cs->fence, struct hl_cs_compl, base_fence);
struct hl_hw_sob *hw_sob;
struct hl_cs_job *job;
u32 q_idx;
/* There is only one job in a signal/wait CS */
job = list_first_entry(&cs->job_list, struct hl_cs_job,
cs_node);
q_idx = job->hw_queue_id;
hw_queue = &hdev->kernel_queues[q_idx];
if (cs->type & CS_TYPE_SIGNAL) {
hw_sob = &hw_queue->hw_sob[hw_queue->curr_sob_offset];
cs_cmpl->hw_sob = hw_sob;
cs_cmpl->sob_val = hw_queue->next_sob_val++;
dev_dbg(hdev->dev,
"generate signal CB, sob_id: %d, sob val: 0x%x, q_idx: %d\n",
cs_cmpl->hw_sob->sob_id, cs_cmpl->sob_val, q_idx);
hdev->asic_funcs->gen_signal_cb(hdev, job->patched_cb,
cs_cmpl->hw_sob->sob_id);
kref_get(&hw_sob->kref);
/* check for wraparound */
if (hw_queue->next_sob_val == HL_MAX_SOB_VAL) {
/*
* Decrement as we reached the max value.
* The release function won't be called here as we've
* just incremented the refcount.
*/
kref_put(&hw_sob->kref, hl_sob_reset_error);
hw_queue->next_sob_val = 1;
/* only two SOBs are currently in use */
hw_queue->curr_sob_offset =
(hw_queue->curr_sob_offset + 1) %
HL_RSVD_SOBS_IN_USE;
dev_dbg(hdev->dev, "switched to SOB %d, q_idx: %d\n",
hw_queue->curr_sob_offset, q_idx);
}
} else if (cs->type & CS_TYPE_WAIT) {
struct hl_cs_compl *signal_cs_cmpl;
signal_cs_cmpl = container_of(cs->signal_fence,
struct hl_cs_compl,
base_fence);
/* copy the the SOB id and value of the signal CS */
cs_cmpl->hw_sob = signal_cs_cmpl->hw_sob;
cs_cmpl->sob_val = signal_cs_cmpl->sob_val;
dev_dbg(hdev->dev,
"generate wait CB, sob_id: %d, sob_val: 0x%x, mon_id: %d, q_idx: %d\n",
cs_cmpl->hw_sob->sob_id, cs_cmpl->sob_val,
hw_queue->base_mon_id, q_idx);
hdev->asic_funcs->gen_wait_cb(hdev, job->patched_cb,
cs_cmpl->hw_sob->sob_id,
cs_cmpl->sob_val,
hw_queue->base_mon_id,
q_idx);
kref_get(&cs_cmpl->hw_sob->kref);
/*
* Must put the signal fence after the SOB refcnt increment so
* the SOB refcnt won't turn 0 and reset the SOB before the
* wait CS was submitted.
*/
mb();
dma_fence_put(cs->signal_fence);
cs->signal_fence = NULL;
}
}
/*
* hl_hw_queue_schedule_cs - schedule a command submission
* @cs: pointer to the CS
*/ */
int hl_hw_queue_schedule_cs(struct hl_cs *cs) int hl_hw_queue_schedule_cs(struct hl_cs *cs)
{ {
struct hl_device *hdev = cs->ctx->hdev; struct hl_ctx *ctx = cs->ctx;
struct hl_device *hdev = ctx->hdev;
struct hl_cs_job *job, *tmp; struct hl_cs_job *job, *tmp;
struct hl_hw_queue *q; struct hl_hw_queue *q;
int rc = 0, i, cq_cnt; int rc = 0, i, cq_cnt;
...@@ -461,6 +552,9 @@ int hl_hw_queue_schedule_cs(struct hl_cs *cs) ...@@ -461,6 +552,9 @@ int hl_hw_queue_schedule_cs(struct hl_cs *cs)
} }
} }
if ((cs->type == CS_TYPE_SIGNAL) || (cs->type == CS_TYPE_WAIT))
init_signal_wait_cs(cs);
spin_lock(&hdev->hw_queues_mirror_lock); spin_lock(&hdev->hw_queues_mirror_lock);
list_add_tail(&cs->mirror_node, &hdev->hw_queues_mirror_list); list_add_tail(&cs->mirror_node, &hdev->hw_queues_mirror_list);
...@@ -569,6 +663,9 @@ static int ext_and_cpu_queue_init(struct hl_device *hdev, struct hl_hw_queue *q, ...@@ -569,6 +663,9 @@ static int ext_and_cpu_queue_init(struct hl_device *hdev, struct hl_hw_queue *q,
q->ci = 0; q->ci = 0;
q->pi = 0; q->pi = 0;
if (!is_cpu_queue)
hdev->asic_funcs->ext_queue_init(hdev, q->hw_queue_id);
return 0; return 0;
free_queue: free_queue:
...@@ -791,5 +888,8 @@ void hl_hw_queue_reset(struct hl_device *hdev, bool hard_reset) ...@@ -791,5 +888,8 @@ void hl_hw_queue_reset(struct hl_device *hdev, bool hard_reset)
((!hard_reset) && (q->queue_type == QUEUE_TYPE_CPU))) ((!hard_reset) && (q->queue_type == QUEUE_TYPE_CPU)))
continue; continue;
q->pi = q->ci = 0; q->pi = q->ci = 0;
if (q->queue_type == QUEUE_TYPE_EXT)
hdev->asic_funcs->ext_queue_reset(hdev, q->hw_queue_id);
} }
} }
...@@ -200,6 +200,7 @@ static int hl_write(struct device *dev, enum hwmon_sensor_types type, ...@@ -200,6 +200,7 @@ static int hl_write(struct device *dev, enum hwmon_sensor_types type,
case hwmon_temp: case hwmon_temp:
switch (attr) { switch (attr) {
case hwmon_temp_offset: case hwmon_temp_offset:
case hwmon_temp_reset_history:
break; break;
default: default:
return -EINVAL; return -EINVAL;
...@@ -216,6 +217,24 @@ static int hl_write(struct device *dev, enum hwmon_sensor_types type, ...@@ -216,6 +217,24 @@ static int hl_write(struct device *dev, enum hwmon_sensor_types type,
} }
hl_set_pwm_info(hdev, channel, attr, val); hl_set_pwm_info(hdev, channel, attr, val);
break; break;
case hwmon_in:
switch (attr) {
case hwmon_in_reset_history:
break;
default:
return -EINVAL;
}
hl_set_voltage(hdev, channel, attr, val);
break;
case hwmon_curr:
switch (attr) {
case hwmon_curr_reset_history:
break;
default:
return -EINVAL;
}
hl_set_current(hdev, channel, attr, val);
break;
default: default:
return -EINVAL; return -EINVAL;
} }
...@@ -237,6 +256,8 @@ static umode_t hl_is_visible(const void *data, enum hwmon_sensor_types type, ...@@ -237,6 +256,8 @@ static umode_t hl_is_visible(const void *data, enum hwmon_sensor_types type,
return 0444; return 0444;
case hwmon_temp_offset: case hwmon_temp_offset:
return 0644; return 0644;
case hwmon_temp_reset_history:
return 0200;
} }
break; break;
case hwmon_in: case hwmon_in:
...@@ -246,6 +267,8 @@ static umode_t hl_is_visible(const void *data, enum hwmon_sensor_types type, ...@@ -246,6 +267,8 @@ static umode_t hl_is_visible(const void *data, enum hwmon_sensor_types type,
case hwmon_in_max: case hwmon_in_max:
case hwmon_in_highest: case hwmon_in_highest:
return 0444; return 0444;
case hwmon_in_reset_history:
return 0200;
} }
break; break;
case hwmon_curr: case hwmon_curr:
...@@ -255,6 +278,8 @@ static umode_t hl_is_visible(const void *data, enum hwmon_sensor_types type, ...@@ -255,6 +278,8 @@ static umode_t hl_is_visible(const void *data, enum hwmon_sensor_types type,
case hwmon_curr_max: case hwmon_curr_max:
case hwmon_curr_highest: case hwmon_curr_highest:
return 0444; return 0444;
case hwmon_curr_reset_history:
return 0200;
} }
break; break;
case hwmon_fan: case hwmon_fan:
...@@ -462,6 +487,56 @@ void hl_set_pwm_info(struct hl_device *hdev, int sensor_index, u32 attr, ...@@ -462,6 +487,56 @@ void hl_set_pwm_info(struct hl_device *hdev, int sensor_index, u32 attr,
sensor_index, rc); sensor_index, rc);
} }
int hl_set_voltage(struct hl_device *hdev,
int sensor_index, u32 attr, long value)
{
struct armcp_packet pkt;
int rc;
memset(&pkt, 0, sizeof(pkt));
pkt.ctl = cpu_to_le32(ARMCP_PACKET_VOLTAGE_SET <<
ARMCP_PKT_CTL_OPCODE_SHIFT);
pkt.sensor_index = __cpu_to_le16(sensor_index);
pkt.type = __cpu_to_le16(attr);
pkt.value = __cpu_to_le64(value);
rc = hdev->asic_funcs->send_cpu_message(hdev, (u32 *) &pkt, sizeof(pkt),
SENSORS_PKT_TIMEOUT, NULL);
if (rc)
dev_err(hdev->dev,
"Failed to set voltage of sensor %d, error %d\n",
sensor_index, rc);
return rc;
}
int hl_set_current(struct hl_device *hdev,
int sensor_index, u32 attr, long value)
{
struct armcp_packet pkt;
int rc;
memset(&pkt, 0, sizeof(pkt));
pkt.ctl = cpu_to_le32(ARMCP_PACKET_CURRENT_SET <<
ARMCP_PKT_CTL_OPCODE_SHIFT);
pkt.sensor_index = __cpu_to_le16(sensor_index);
pkt.type = __cpu_to_le16(attr);
pkt.value = __cpu_to_le64(value);
rc = hdev->asic_funcs->send_cpu_message(hdev, (u32 *) &pkt, sizeof(pkt),
SENSORS_PKT_TIMEOUT, NULL);
if (rc)
dev_err(hdev->dev,
"Failed to set current of sensor %d, error %d\n",
sensor_index, rc);
return rc;
}
int hl_hwmon_init(struct hl_device *hdev) int hl_hwmon_init(struct hl_device *hdev)
{ {
struct device *dev = hdev->pdev ? &hdev->pdev->dev : hdev->dev; struct device *dev = hdev->pdev ? &hdev->pdev->dev : hdev->dev;
......
/* SPDX-License-Identifier: GPL-2.0 /* SPDX-License-Identifier: GPL-2.0
* *
* Copyright 2016-2019 HabanaLabs, Ltd. * Copyright 2016-2020 HabanaLabs, Ltd.
* All Rights Reserved. * All Rights Reserved.
* *
*/ */
...@@ -35,7 +35,8 @@ struct hl_eq_entry { ...@@ -35,7 +35,8 @@ struct hl_eq_entry {
enum pq_init_status { enum pq_init_status {
PQ_INIT_STATUS_NA = 0, PQ_INIT_STATUS_NA = 0,
PQ_INIT_STATUS_READY_FOR_CP, PQ_INIT_STATUS_READY_FOR_CP,
PQ_INIT_STATUS_READY_FOR_HOST PQ_INIT_STATUS_READY_FOR_HOST,
PQ_INIT_STATUS_READY_FOR_CP_SINGLE_MSI
}; };
/* /*
...@@ -193,6 +194,16 @@ enum pq_init_status { ...@@ -193,6 +194,16 @@ enum pq_init_status {
* Set the value of the offset property of a specified thermal sensor. * Set the value of the offset property of a specified thermal sensor.
* The packet's arguments specify the desired sensor and the field to * The packet's arguments specify the desired sensor and the field to
* set. * set.
*
* ARMCP_PACKET_VOLTAGE_SET -
* Trigger the reset_history property of a specified voltage sensor.
* The packet's arguments specify the desired sensor and the field to
* set.
*
* ARMCP_PACKET_CURRENT_SET -
* Trigger the reset_history property of a specified current sensor.
* The packet's arguments specify the desired sensor and the field to
* set.
*/ */
enum armcp_packet_id { enum armcp_packet_id {
...@@ -220,6 +231,8 @@ enum armcp_packet_id { ...@@ -220,6 +231,8 @@ enum armcp_packet_id {
ARMCP_PACKET_EEPROM_DATA_GET, /* sysfs */ ARMCP_PACKET_EEPROM_DATA_GET, /* sysfs */
ARMCP_RESERVED, ARMCP_RESERVED,
ARMCP_PACKET_TEMPERATURE_SET, /* sysfs */ ARMCP_PACKET_TEMPERATURE_SET, /* sysfs */
ARMCP_PACKET_VOLTAGE_SET, /* sysfs */
ARMCP_PACKET_CURRENT_SET, /* sysfs */
}; };
#define ARMCP_PACKET_FENCE_VAL 0xFE8CE7A5 #define ARMCP_PACKET_FENCE_VAL 0xFE8CE7A5
...@@ -288,21 +301,24 @@ enum armcp_temp_type { ...@@ -288,21 +301,24 @@ enum armcp_temp_type {
armcp_temp_crit, armcp_temp_crit,
armcp_temp_crit_hyst, armcp_temp_crit_hyst,
armcp_temp_offset = 19, armcp_temp_offset = 19,
armcp_temp_highest = 22 armcp_temp_highest = 22,
armcp_temp_reset_history = 23
}; };
enum armcp_in_attributes { enum armcp_in_attributes {
armcp_in_input, armcp_in_input,
armcp_in_min, armcp_in_min,
armcp_in_max, armcp_in_max,
armcp_in_highest = 7 armcp_in_highest = 7,
armcp_in_reset_history
}; };
enum armcp_curr_attributes { enum armcp_curr_attributes {
armcp_curr_input, armcp_curr_input,
armcp_curr_min, armcp_curr_min,
armcp_curr_max, armcp_curr_max,
armcp_curr_highest = 7 armcp_curr_highest = 7,
armcp_curr_reset_history
}; };
enum armcp_fan_attributes { enum armcp_fan_attributes {
...@@ -335,11 +351,24 @@ struct armcp_sensor { ...@@ -335,11 +351,24 @@ struct armcp_sensor {
__le32 flags; __le32 flags;
}; };
/**
* struct armcp_card_types - ASIC card type.
* @armcp_card_type_pci: PCI card.
* @armcp_card_type_pmc: PCI Mezzanine Card.
*/
enum armcp_card_types {
armcp_card_type_pci,
armcp_card_type_pmc
};
/** /**
* struct armcp_info - Info from ArmCP that is necessary to the host's driver * struct armcp_info - Info from ArmCP that is necessary to the host's driver
* @sensors: available sensors description. * @sensors: available sensors description.
* @kernel_version: ArmCP linux kernel version. * @kernel_version: ArmCP linux kernel version.
* @reserved: reserved field. * @reserved: reserved field.
* @card_type: card configuration type.
* @card_location: in a server, each card has different connections topology
* depending on its location (relevant for PMC card type)
* @cpld_version: CPLD programmed F/W version. * @cpld_version: CPLD programmed F/W version.
* @infineon_version: Infineon main DC-DC version. * @infineon_version: Infineon main DC-DC version.
* @fuse_version: silicon production FUSE information. * @fuse_version: silicon production FUSE information.
...@@ -351,7 +380,9 @@ struct armcp_sensor { ...@@ -351,7 +380,9 @@ struct armcp_sensor {
struct armcp_info { struct armcp_info {
struct armcp_sensor sensors[ARMCP_MAX_SENSORS]; struct armcp_sensor sensors[ARMCP_MAX_SENSORS];
__u8 kernel_version[VERSION_MAX_LEN]; __u8 kernel_version[VERSION_MAX_LEN];
__le32 reserved[3]; __le32 reserved;
__le32 card_type;
__le32 card_location;
__le32 cpld_version; __le32 cpld_version;
__le32 infineon_version; __le32 infineon_version;
__u8 fuse_version[VERSION_MAX_LEN]; __u8 fuse_version[VERSION_MAX_LEN];
......
/* SPDX-License-Identifier: GPL-2.0
*
* Copyright 2016-2018 HabanaLabs, Ltd.
* All Rights Reserved.
*
*/
/************************************
** This is an auto-generated file **
** DO NOT EDIT BELOW **
************************************/
#ifndef ASIC_REG_CPU_IF_REGS_H_
#define ASIC_REG_CPU_IF_REGS_H_
/*
*****************************************
* CPU_IF (Prototype: CPU_IF)
*****************************************
*/
#define mmCPU_IF_ARUSER_OVR 0x442104
#define mmCPU_IF_ARUSER_OVR_EN 0x442108
#define mmCPU_IF_AWUSER_OVR 0x44210C
#define mmCPU_IF_AWUSER_OVR_EN 0x442110
#define mmCPU_IF_AXCACHE_OVR 0x442114
#define mmCPU_IF_LOCK_OVR 0x442118
#define mmCPU_IF_PROT_OVR 0x44211C
#define mmCPU_IF_MAX_OUTSTANDING 0x442120
#define mmCPU_IF_EARLY_BRESP_EN 0x442124
#define mmCPU_IF_FORCE_RSP_OK 0x442128
#define mmCPU_IF_CPU_MSB_ADDR 0x44212C
#define mmCPU_IF_AXI_SPLIT_INTR 0x442130
#define mmCPU_IF_TOTAL_WR_CNT 0x442140
#define mmCPU_IF_INFLIGHT_WR_CNT 0x442144
#define mmCPU_IF_TOTAL_RD_CNT 0x442150
#define mmCPU_IF_INFLIGHT_RD_CNT 0x442154
#define mmCPU_IF_PF_PQ_PI 0x442200
#define mmCPU_IF_PQ_BASE_ADDR_LOW 0x442204
#define mmCPU_IF_PQ_BASE_ADDR_HIGH 0x442208
#define mmCPU_IF_PQ_LENGTH 0x44220C
#define mmCPU_IF_CQ_BASE_ADDR_LOW 0x442210
#define mmCPU_IF_CQ_BASE_ADDR_HIGH 0x442214
#define mmCPU_IF_CQ_LENGTH 0x442218
#define mmCPU_IF_EQ_BASE_ADDR_LOW 0x442220
#define mmCPU_IF_EQ_BASE_ADDR_HIGH 0x442224
#define mmCPU_IF_EQ_LENGTH 0x442228
#define mmCPU_IF_EQ_RD_OFFS 0x44222C
#define mmCPU_IF_QUEUE_INIT 0x442230
#define mmCPU_IF_TPC_SERR_INTR_STS 0x442300
#define mmCPU_IF_TPC_SERR_INTR_CLR 0x442304
#define mmCPU_IF_TPC_SERR_INTR_MASK 0x442308
#define mmCPU_IF_TPC_DERR_INTR_STS 0x442310
#define mmCPU_IF_TPC_DERR_INTR_CLR 0x442314
#define mmCPU_IF_TPC_DERR_INTR_MASK 0x442318
#define mmCPU_IF_DMA_SERR_INTR_STS 0x442320
#define mmCPU_IF_DMA_SERR_INTR_CLR 0x442324
#define mmCPU_IF_DMA_SERR_INTR_MASK 0x442328
#define mmCPU_IF_DMA_DERR_INTR_STS 0x442330
#define mmCPU_IF_DMA_DERR_INTR_CLR 0x442334
#define mmCPU_IF_DMA_DERR_INTR_MASK 0x442338
#define mmCPU_IF_SRAM_SERR_INTR_STS 0x442340
#define mmCPU_IF_SRAM_SERR_INTR_CLR 0x442344
#define mmCPU_IF_SRAM_SERR_INTR_MASK 0x442348
#define mmCPU_IF_SRAM_DERR_INTR_STS 0x442350
#define mmCPU_IF_SRAM_DERR_INTR_CLR 0x442354
#define mmCPU_IF_SRAM_DERR_INTR_MASK 0x442358
#define mmCPU_IF_NIC_SERR_INTR_STS 0x442360
#define mmCPU_IF_NIC_SERR_INTR_CLR 0x442364
#define mmCPU_IF_NIC_SERR_INTR_MASK 0x442368
#define mmCPU_IF_NIC_DERR_INTR_STS 0x442370
#define mmCPU_IF_NIC_DERR_INTR_CLR 0x442374
#define mmCPU_IF_NIC_DERR_INTR_MASK 0x442378
#define mmCPU_IF_DMA_IF_SERR_INTR_STS 0x442380
#define mmCPU_IF_DMA_IF_SERR_INTR_CLR 0x442384
#define mmCPU_IF_DMA_IF_SERR_INTR_MASK 0x442388
#define mmCPU_IF_DMA_IF_DERR_INTR_STS 0x442390
#define mmCPU_IF_DMA_IF_DERR_INTR_CLR 0x442394
#define mmCPU_IF_DMA_IF_DERR_INTR_MASK 0x442398
#define mmCPU_IF_HBM_SERR_INTR_STS 0x4423A0
#define mmCPU_IF_HBM_SERR_INTR_CLR 0x4423A4
#define mmCPU_IF_HBM_SERR_INTR_MASK 0x4423A8
#define mmCPU_IF_HBM_DERR_INTR_STS 0x4423B0
#define mmCPU_IF_HBM_DERR_INTR_CLR 0x4423B4
#define mmCPU_IF_HBM_DERR_INTR_MASK 0x4423B8
#define mmCPU_IF_PLL_SEI_INTR_STS 0x442400
#define mmCPU_IF_PLL_SEI_INTR_CLR 0x442404
#define mmCPU_IF_PLL_SEI_INTR_MASK 0x442408
#define mmCPU_IF_NIC_SEI_INTR_STS 0x442410
#define mmCPU_IF_NIC_SEI_INTR_CLR 0x442414
#define mmCPU_IF_NIC_SEI_INTR_MASK 0x442418
#define mmCPU_IF_DMA_SEI_INTR_STS 0x442420
#define mmCPU_IF_DMA_SEI_INTR_CLR 0x442424
#define mmCPU_IF_DMA_SEI_INTR_MASK 0x442428
#define mmCPU_IF_DMA_IF_SEI_INTR_STS 0x442430
#define mmCPU_IF_DMA_IF_SEI_INTR_CLR 0x442434
#define mmCPU_IF_DMA_IF_SEI_INTR_MASK 0x442438
#endif /* ASIC_REG_CPU_IF_REGS_H_ */
/* SPDX-License-Identifier: GPL-2.0
*
* Copyright 2016-2018 HabanaLabs, Ltd.
* All Rights Reserved.
*
*/
/************************************
** This is an auto-generated file **
** DO NOT EDIT BELOW **
************************************/
#ifndef ASIC_REG_DMA0_CORE_REGS_H_
#define ASIC_REG_DMA0_CORE_REGS_H_
/*
*****************************************
* DMA0_CORE (Prototype: DMA_CORE)
*****************************************
*/
#define mmDMA0_CORE_CFG_0 0x500000
#define mmDMA0_CORE_CFG_1 0x500004
#define mmDMA0_CORE_LBW_MAX_OUTSTAND 0x500008
#define mmDMA0_CORE_SRC_BASE_LO 0x500014
#define mmDMA0_CORE_SRC_BASE_HI 0x500018
#define mmDMA0_CORE_DST_BASE_LO 0x50001C
#define mmDMA0_CORE_DST_BASE_HI 0x500020
#define mmDMA0_CORE_SRC_TSIZE_1 0x50002C
#define mmDMA0_CORE_SRC_STRIDE_1 0x500030
#define mmDMA0_CORE_SRC_TSIZE_2 0x500034
#define mmDMA0_CORE_SRC_STRIDE_2 0x500038
#define mmDMA0_CORE_SRC_TSIZE_3 0x50003C
#define mmDMA0_CORE_SRC_STRIDE_3 0x500040
#define mmDMA0_CORE_SRC_TSIZE_4 0x500044
#define mmDMA0_CORE_SRC_STRIDE_4 0x500048
#define mmDMA0_CORE_SRC_TSIZE_0 0x50004C
#define mmDMA0_CORE_DST_TSIZE_1 0x500054
#define mmDMA0_CORE_DST_STRIDE_1 0x500058
#define mmDMA0_CORE_DST_TSIZE_2 0x50005C
#define mmDMA0_CORE_DST_STRIDE_2 0x500060
#define mmDMA0_CORE_DST_TSIZE_3 0x500064
#define mmDMA0_CORE_DST_STRIDE_3 0x500068
#define mmDMA0_CORE_DST_TSIZE_4 0x50006C
#define mmDMA0_CORE_DST_STRIDE_4 0x500070
#define mmDMA0_CORE_DST_TSIZE_0 0x500074
#define mmDMA0_CORE_COMMIT 0x500078
#define mmDMA0_CORE_WR_COMP_WDATA 0x50007C
#define mmDMA0_CORE_WR_COMP_ADDR_LO 0x500080
#define mmDMA0_CORE_WR_COMP_ADDR_HI 0x500084
#define mmDMA0_CORE_WR_COMP_AWUSER_31_11 0x500088
#define mmDMA0_CORE_TE_NUMROWS 0x500094
#define mmDMA0_CORE_PROT 0x5000B8
#define mmDMA0_CORE_SECURE_PROPS 0x5000F0
#define mmDMA0_CORE_NON_SECURE_PROPS 0x5000F4
#define mmDMA0_CORE_RD_MAX_OUTSTAND 0x500100
#define mmDMA0_CORE_RD_MAX_SIZE 0x500104
#define mmDMA0_CORE_RD_ARCACHE 0x500108
#define mmDMA0_CORE_RD_ARUSER_31_11 0x500110
#define mmDMA0_CORE_RD_INFLIGHTS 0x500114
#define mmDMA0_CORE_WR_MAX_OUTSTAND 0x500120
#define mmDMA0_CORE_WR_MAX_AWID 0x500124
#define mmDMA0_CORE_WR_AWCACHE 0x500128
#define mmDMA0_CORE_WR_AWUSER_31_11 0x500130
#define mmDMA0_CORE_WR_INFLIGHTS 0x500134
#define mmDMA0_CORE_RD_RATE_LIM_CFG_0 0x500150
#define mmDMA0_CORE_RD_RATE_LIM_CFG_1 0x500154
#define mmDMA0_CORE_WR_RATE_LIM_CFG_0 0x500158
#define mmDMA0_CORE_WR_RATE_LIM_CFG_1 0x50015C
#define mmDMA0_CORE_ERR_CFG 0x500160
#define mmDMA0_CORE_ERR_CAUSE 0x500164
#define mmDMA0_CORE_ERRMSG_ADDR_LO 0x500170
#define mmDMA0_CORE_ERRMSG_ADDR_HI 0x500174
#define mmDMA0_CORE_ERRMSG_WDATA 0x500178
#define mmDMA0_CORE_STS0 0x500190
#define mmDMA0_CORE_STS1 0x500194
#define mmDMA0_CORE_RD_DBGMEM_ADD 0x500200
#define mmDMA0_CORE_RD_DBGMEM_DATA_WR 0x500204
#define mmDMA0_CORE_RD_DBGMEM_DATA_RD 0x500208
#define mmDMA0_CORE_RD_DBGMEM_CTRL 0x50020C
#define mmDMA0_CORE_RD_DBGMEM_RC 0x500210
#define mmDMA0_CORE_DBG_HBW_AXI_AR_CNT 0x500220
#define mmDMA0_CORE_DBG_HBW_AXI_AW_CNT 0x500224
#define mmDMA0_CORE_DBG_LBW_AXI_AW_CNT 0x500228
#define mmDMA0_CORE_DBG_DESC_CNT 0x50022C
#define mmDMA0_CORE_DBG_STS 0x500230
#define mmDMA0_CORE_DBG_RD_DESC_ID 0x500234
#define mmDMA0_CORE_DBG_WR_DESC_ID 0x500238
#endif /* ASIC_REG_DMA0_CORE_REGS_H_ */
This diff is collapsed.
This diff is collapsed.
/* SPDX-License-Identifier: GPL-2.0
*
* Copyright 2016-2018 HabanaLabs, Ltd.
* All Rights Reserved.
*
*/
/************************************
** This is an auto-generated file **
** DO NOT EDIT BELOW **
************************************/
#ifndef ASIC_REG_DMA1_CORE_REGS_H_
#define ASIC_REG_DMA1_CORE_REGS_H_
/*
*****************************************
* DMA1_CORE (Prototype: DMA_CORE)
*****************************************
*/
#define mmDMA1_CORE_CFG_0 0x520000
#define mmDMA1_CORE_CFG_1 0x520004
#define mmDMA1_CORE_LBW_MAX_OUTSTAND 0x520008
#define mmDMA1_CORE_SRC_BASE_LO 0x520014
#define mmDMA1_CORE_SRC_BASE_HI 0x520018
#define mmDMA1_CORE_DST_BASE_LO 0x52001C
#define mmDMA1_CORE_DST_BASE_HI 0x520020
#define mmDMA1_CORE_SRC_TSIZE_1 0x52002C
#define mmDMA1_CORE_SRC_STRIDE_1 0x520030
#define mmDMA1_CORE_SRC_TSIZE_2 0x520034
#define mmDMA1_CORE_SRC_STRIDE_2 0x520038
#define mmDMA1_CORE_SRC_TSIZE_3 0x52003C
#define mmDMA1_CORE_SRC_STRIDE_3 0x520040
#define mmDMA1_CORE_SRC_TSIZE_4 0x520044
#define mmDMA1_CORE_SRC_STRIDE_4 0x520048
#define mmDMA1_CORE_SRC_TSIZE_0 0x52004C
#define mmDMA1_CORE_DST_TSIZE_1 0x520054
#define mmDMA1_CORE_DST_STRIDE_1 0x520058
#define mmDMA1_CORE_DST_TSIZE_2 0x52005C
#define mmDMA1_CORE_DST_STRIDE_2 0x520060
#define mmDMA1_CORE_DST_TSIZE_3 0x520064
#define mmDMA1_CORE_DST_STRIDE_3 0x520068
#define mmDMA1_CORE_DST_TSIZE_4 0x52006C
#define mmDMA1_CORE_DST_STRIDE_4 0x520070
#define mmDMA1_CORE_DST_TSIZE_0 0x520074
#define mmDMA1_CORE_COMMIT 0x520078
#define mmDMA1_CORE_WR_COMP_WDATA 0x52007C
#define mmDMA1_CORE_WR_COMP_ADDR_LO 0x520080
#define mmDMA1_CORE_WR_COMP_ADDR_HI 0x520084
#define mmDMA1_CORE_WR_COMP_AWUSER_31_11 0x520088
#define mmDMA1_CORE_TE_NUMROWS 0x520094
#define mmDMA1_CORE_PROT 0x5200B8
#define mmDMA1_CORE_SECURE_PROPS 0x5200F0
#define mmDMA1_CORE_NON_SECURE_PROPS 0x5200F4
#define mmDMA1_CORE_RD_MAX_OUTSTAND 0x520100
#define mmDMA1_CORE_RD_MAX_SIZE 0x520104
#define mmDMA1_CORE_RD_ARCACHE 0x520108
#define mmDMA1_CORE_RD_ARUSER_31_11 0x520110
#define mmDMA1_CORE_RD_INFLIGHTS 0x520114
#define mmDMA1_CORE_WR_MAX_OUTSTAND 0x520120
#define mmDMA1_CORE_WR_MAX_AWID 0x520124
#define mmDMA1_CORE_WR_AWCACHE 0x520128
#define mmDMA1_CORE_WR_AWUSER_31_11 0x520130
#define mmDMA1_CORE_WR_INFLIGHTS 0x520134
#define mmDMA1_CORE_RD_RATE_LIM_CFG_0 0x520150
#define mmDMA1_CORE_RD_RATE_LIM_CFG_1 0x520154
#define mmDMA1_CORE_WR_RATE_LIM_CFG_0 0x520158
#define mmDMA1_CORE_WR_RATE_LIM_CFG_1 0x52015C
#define mmDMA1_CORE_ERR_CFG 0x520160
#define mmDMA1_CORE_ERR_CAUSE 0x520164
#define mmDMA1_CORE_ERRMSG_ADDR_LO 0x520170
#define mmDMA1_CORE_ERRMSG_ADDR_HI 0x520174
#define mmDMA1_CORE_ERRMSG_WDATA 0x520178
#define mmDMA1_CORE_STS0 0x520190
#define mmDMA1_CORE_STS1 0x520194
#define mmDMA1_CORE_RD_DBGMEM_ADD 0x520200
#define mmDMA1_CORE_RD_DBGMEM_DATA_WR 0x520204
#define mmDMA1_CORE_RD_DBGMEM_DATA_RD 0x520208
#define mmDMA1_CORE_RD_DBGMEM_CTRL 0x52020C
#define mmDMA1_CORE_RD_DBGMEM_RC 0x520210
#define mmDMA1_CORE_DBG_HBW_AXI_AR_CNT 0x520220
#define mmDMA1_CORE_DBG_HBW_AXI_AW_CNT 0x520224
#define mmDMA1_CORE_DBG_LBW_AXI_AW_CNT 0x520228
#define mmDMA1_CORE_DBG_DESC_CNT 0x52022C
#define mmDMA1_CORE_DBG_STS 0x520230
#define mmDMA1_CORE_DBG_RD_DESC_ID 0x520234
#define mmDMA1_CORE_DBG_WR_DESC_ID 0x520238
#endif /* ASIC_REG_DMA1_CORE_REGS_H_ */
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
/* SPDX-License-Identifier: GPL-2.0
*
* Copyright 2016-2018 HabanaLabs, Ltd.
* All Rights Reserved.
*
*/
/************************************
** This is an auto-generated file **
** DO NOT EDIT BELOW **
************************************/
#ifndef ASIC_REG_MMU_UP_REGS_H_
#define ASIC_REG_MMU_UP_REGS_H_
/*
*****************************************
* MMU_UP (Prototype: MMU)
*****************************************
*/
#define mmMMU_UP_MMU_ENABLE 0xC1100C
#define mmMMU_UP_FORCE_ORDERING 0xC11010
#define mmMMU_UP_FEATURE_ENABLE 0xC11014
#define mmMMU_UP_VA_ORDERING_MASK_31_7 0xC11018
#define mmMMU_UP_VA_ORDERING_MASK_49_32 0xC1101C
#define mmMMU_UP_LOG2_DDR_SIZE 0xC11020
#define mmMMU_UP_SCRAMBLER 0xC11024
#define mmMMU_UP_MEM_INIT_BUSY 0xC11028
#define mmMMU_UP_SPI_MASK 0xC1102C
#define mmMMU_UP_SPI_CAUSE 0xC11030
#define mmMMU_UP_PAGE_ERROR_CAPTURE 0xC11034
#define mmMMU_UP_PAGE_ERROR_CAPTURE_VA 0xC11038
#define mmMMU_UP_ACCESS_ERROR_CAPTURE 0xC1103C
#define mmMMU_UP_ACCESS_ERROR_CAPTURE_VA 0xC11040
#define mmMMU_UP_SPI_INTERRUPT_CLR 0xC11044
#define mmMMU_UP_SPI_INTERRUPT_MASK 0xC11048
#define mmMMU_UP_DBG_MEM_WRAP_RM 0xC1104C
#define mmMMU_UP_SPI_CAUSE_CLR 0xC11050
#define mmMMU_UP_SLICE_CREDIT 0xC11054
#define mmMMU_UP_PIPE_CREDIT 0xC11058
#define mmMMU_UP_RAZWI_WRITE_VLD 0xC1105C
#define mmMMU_UP_RAZWI_WRITE_ID 0xC11060
#define mmMMU_UP_RAZWI_READ_VLD 0xC11064
#define mmMMU_UP_RAZWI_READ_ID 0xC11068
#define mmMMU_UP_MMU_BYPASS 0xC1106C
#endif /* ASIC_REG_MMU_UP_REGS_H_ */
This diff is collapsed.
/* SPDX-License-Identifier: GPL-2.0
*
* Copyright 2016-2018 HabanaLabs, Ltd.
* All Rights Reserved.
*
*/
/************************************
** This is an auto-generated file **
** DO NOT EDIT BELOW **
************************************/
#ifndef ASIC_REG_PSOC_TIMESTAMP_REGS_H_
#define ASIC_REG_PSOC_TIMESTAMP_REGS_H_
/*
*****************************************
* PSOC_TIMESTAMP (Prototype: TIMESTAMP)
*****************************************
*/
#define mmPSOC_TIMESTAMP_CNTCR 0xC49000
#define mmPSOC_TIMESTAMP_CNTSR 0xC49004
#define mmPSOC_TIMESTAMP_CNTCVL 0xC49008
#define mmPSOC_TIMESTAMP_CNTCVU 0xC4900C
#define mmPSOC_TIMESTAMP_CNTFID0 0xC49020
#define mmPSOC_TIMESTAMP_PIDR4 0xC49FD0
#define mmPSOC_TIMESTAMP_PIDR5 0xC49FD4
#define mmPSOC_TIMESTAMP_PIDR6 0xC49FD8
#define mmPSOC_TIMESTAMP_PIDR7 0xC49FDC
#define mmPSOC_TIMESTAMP_PIDR0 0xC49FE0
#define mmPSOC_TIMESTAMP_PIDR1 0xC49FE4
#define mmPSOC_TIMESTAMP_PIDR2 0xC49FE8
#define mmPSOC_TIMESTAMP_PIDR3 0xC49FEC
#define mmPSOC_TIMESTAMP_CIDR0 0xC49FF0
#define mmPSOC_TIMESTAMP_CIDR1 0xC49FF4
#define mmPSOC_TIMESTAMP_CIDR2 0xC49FF8
#define mmPSOC_TIMESTAMP_CIDR3 0xC49FFC
#endif /* ASIC_REG_PSOC_TIMESTAMP_REGS_H_ */
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment